Ryzen 9 3900X Linux SMT Performance

AMD Ryzen 9 3900X SMT benchmarks by Michael Larabel.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1907314-AS-RYZEN939048
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
SMT Enabled - Default
July 31 2019
  3 Hours, 22 Minutes
SMT Disabled
July 31 2019
  3 Hours, 33 Minutes
Invert Behavior (Only Show Selected Data)
  3 Hours, 28 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Ryzen 9 3900X Linux SMT PerformanceOpenBenchmarking.orgPhoronix Test SuiteAMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads)AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS)AMD Device 148016384MB2000GB Force MP600Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz)AMD Device aae0ASUS VP28URealtek Device 8125 + Intel I211 + Intel Device 2723Ubuntu 18.045.3.0-999-generic (x86_64) 20190725GNOME Shell 3.28.4X Server 1.20.4modesetting 1.20.44.5 Mesa 19.0.2 (LLVM 8.0.0)GCC 7.4.0ext43840x2160ProcessorsMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionRyzen 9 3900X Linux SMT Performance BenchmarksSystem Logs- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v - SMT Enabled - Default: Scaling Governor: acpi-cpufreq schedutil- SMT Disabled: Scaling Governor: acpi-cpufreq ondemand- Python 2.7.15+ + Python 3.6.8- SMT Enabled - Default: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling - SMT Disabled: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling

SMT Enabled - Default vs. SMT Disabled ComparisonPhoronix Test SuiteBaseline+35.7%+35.7%+71.4%+71.4%+107.1%+107.1%142.6%134.6%110.2%103.5%56.1%32.3%31.5%28.3%20.3%15.9%8%6.5%4.8%4.8%3.9%3.8%3.5%3.3%3%2.5%2.3%2.3%IP Batch 1D - f32IP Batch All - f32H.2.H.T.N.DOpenMP LBMOpenMP Stencil91.1%O.M.GBedroom47.1%C.u.1.0.3.s.i.i.C.L.945.2%OpenMP CUTCP45%Barbershop - CPU-Only44.6%Supercar42.2%CoreMark Size 666 - I.P.S41.3%1.H.M.2.D40.3%Fayalite-FIST Data38.3%Emily36.3%Total Time36.1%P.R.W.S.S.M33.3%O.SMaterial Tester32.3%C.S.T31.8%D.B.d - f32P.N.T.T.2.0.031.1%CPU30.9%D.B.d - f321.8.b.Y.T.A.V.E28.3%G.I.R.1.S27.5%1.8.b.Y.T.H.V.E25.2%Time To Compile20.9%H.2.1.V.EATPase Simulation - 327,506 Atoms19.8%C.u.1.0.3.s.i.i.C.L.119.1%OpenMP LavaMD19%Disney Material18.1%Noise-Gaussian16.6%L.E.HOpenMP CFD Solver11.4%1.P.N.G8.4%Total TimeWater Benchmark7.6%C.B.c - f32Total Time - 4.1.R.P.P5.5%SharpenC.B.c - f32C.B.c - f32LU.AC.B.c - f32D.B.d - f32EnhancedSP.ARotateResizingMKL-DNNMKL-DNNFFmpegParboilParboilParboilIndigoBenchXZ CompressionParboilBlenderIndigoBenchCoremarkasmFishCP2K Molecular DynamicsAppleseedStockfishTTSIOD 3D RendererRodiniaAppleseed7-Zip CompressionMKL-DNNRust Prime BenchmarkChaos Group V-RAYMKL-DNNSVT-AV1SmallptSVT-HEVCTimed Linux Kernel Compilationx265NAMDZstd CompressionRodiniaAppleseedGraphicsMagickCloverLeafRodiniaPrimesieveOpen FMM Nero2DGROMACSMKL-DNNC-RayGraphicsMagickMKL-DNNMKL-DNNNAS Parallel BenchmarksMKL-DNNMKL-DNNGraphicsMagickNAS Parallel BenchmarksGraphicsMagickGraphicsMagickSMT Enabled - DefaultSMT Disabled

Ryzen 9 3900X Linux SMT Performancecompress-7zip: Compress Speed Testappleseed: Emilyappleseed: Disney Materialappleseed: Material Testerasmfish: 1024 Hash Memory, 26 Depthblender: Barbershop - CPU-Onlyc-ray: Total Time - 4K, 16 Rays Per Pixelv-ray: CPUcloverleaf: Lagrangian-Eulerian Hydrodynamicscoremark: CoreMark Size 666 - Iterations Per Secondcp2k: Fayalite-FIST Dataffmpeg: H.264 HD To NTSC DVgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacegromacs: Water Benchmarkhimeno: Poisson Pressure Solverindigobench: Bedroomindigobench: Supercarmkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch All - f32mkl-dnn: Convolution Batch conv_3d - f32mkl-dnn: Convolution Batch conv_all - f32mkl-dnn: Deconvolution Batch deconv_1d - f32mkl-dnn: Deconvolution Batch deconv_3d - f32mkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: Deconvolution Batch deconv_all - f32mkl-dnn: Convolution Batch conv_googlenet_v3 - f32namd: ATPase Simulation - 327,506 Atomsnpb: BT.Anpb: EP.Cnpb: FT.Anpb: FT.Bnpb: LU.Anpb: LU.Cnpb: SP.Anero2d: Total Timeparboil: OpenMP LBMparboil: OpenMP CUTCPparboil: OpenMP Stencilparboil: OpenMP MRI Griddingprimesieve: 1e12 Prime Number Generationrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverrodinia: OpenMP Streamclusterrust-prime: Prime Number Test To 200,000,000smallpt: Global Illumination Renderer; 128 Samplesstockfish: Total Timesvt-av1: 1080p 8-bit YUV To AV1 Video Encodesvt-hevc: 1080p 8-bit YUV To HEVC Video Encodeswet: Averagebuild-linux-kernel: Time To Compilettsiod-renderer: Phong Rendering With Soft-Shadow Mappingx265: H.265 1080p Video Encodingcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9compress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19SMT Enabled - DefaultSMT Disabled77922271.85167.95163.7439682621710.1253.23205583.79532587.52323.088.452312631651972661692850.991386.872.034.3517.25210.1018.702107.8226.525.05257.955993.69112.711.443946417.48477.095711.266193.0250909.3021264.924293.5272.34151.132.2215.1228.8415.4352.2613.9121.2030.878.333913799241.35251.3985232458748.63643.9040.6625.6018.0559108370.45198.39216.56282805921027.1856.14157093.27376805.08446.704.022332691732032721452830.921369.031.383.067.1189.5417.562011.4820.674.89248.324559.32108.941.729446440.11480.695701.856218.6752828.7121548.384399.1667.0174.273.2228.8918.4816.7262.2015.5016.0240.4810.622876588832.24200.7886043376558.80483.1548.9037.1821.50OpenBenchmarking.org

7-Zip Compression

This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestSMT Enabled - DefaultSMT Disabled20K40K60K80K100KSE +/- 323.94, N = 3SE +/- 393.22, N = 377922591081. (CXX) g++ options: -pipe -lpthread

Appleseed

Appleseed is an open-source production renderer focused on physically-based global illumination rendering engine primarily designed for animation and visual effects. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: EmilySMT Enabled - DefaultSMT Disabled80160240320400271.85370.45

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney MaterialSMT Enabled - DefaultSMT Disabled4080120160200167.95198.39

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material TesterSMT Enabled - DefaultSMT Disabled50100150200250163.74216.56

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthSMT Enabled - DefaultSMT Disabled8M16M24M32M40MSE +/- 201681.41, N = 3SE +/- 41988.40, N = 33968262128280592

Blender

Blender is an open-source 3D creation software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL or CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Barbershop - Compute: CPU-OnlySMT Enabled - DefaultSMT Disabled2004006008001000710.121027.18

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelSMT Enabled - DefaultSMT Disabled1326395265SE +/- 0.02, N = 3SE +/- 0.04, N = 353.2356.141. (CC) gcc options: -lm -lpthread -O3

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgKsamples, More Is BetterChaos Group V-RAY 4.10.03Mode: CPUSMT Enabled - DefaultSMT Disabled4K8K12K16K20KSE +/- 55.98, N = 3SE +/- 24.84, N = 32055815709

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm8192.in input file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsSMT Enabled - DefaultSMT Disabled0.85281.70562.55843.41124.264SE +/- 0.00, N = 3SE +/- 0.01, N = 33.793.271. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondSMT Enabled - DefaultSMT Disabled110K220K330K440K550KSE +/- 2501.14, N = 3SE +/- 731.59, N = 3532587.52376805.081. (CC) gcc options: -O2 -lrt" -lrt

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. This test profile currently makes use of the OpenMP implementation and using the Fayalite-FIST molecular dynamics run and measures the total time to complete. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 6.1Fayalite-FIST DataSMT Enabled - DefaultSMT Disabled100200300400500323.08446.70

FFmpeg

This test uses FFmpeg for testing the system's audio/video encoding performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVSMT Enabled - DefaultSMT Disabled246810SE +/- 0.06, N = 3SE +/- 0.01, N = 38.454.021. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shape -lxcb-xfixes -lasound -lSDL2 -lsndio -pthread -lbz2 -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SwirlSMT Enabled - DefaultSMT Disabled50100150200250SE +/- 0.67, N = 3SE +/- 1.53, N = 32312331. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: RotateSMT Enabled - DefaultSMT Disabled60120180240300SE +/- 0.88, N = 3SE +/- 1.00, N = 32632691. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SharpenSMT Enabled - DefaultSMT Disabled4080120160200SE +/- 0.33, N = 31651731. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: EnhancedSMT Enabled - DefaultSMT Disabled4080120160200SE +/- 0.33, N = 31972031. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: ResizingSMT Enabled - DefaultSMT Disabled60120180240300SE +/- 1.20, N = 3SE +/- 1.20, N = 32662721. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-GaussianSMT Enabled - DefaultSMT Disabled4080120160200SE +/- 0.33, N = 31691451. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color SpaceSMT Enabled - DefaultSMT Disabled60120180240300SE +/- 0.88, N = 3SE +/- 1.20, N = 32852831. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GROMACS

The Gromacs molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2018.3Water BenchmarkSMT Enabled - DefaultSMT Disabled0.22280.44560.66840.89121.114SE +/- 0.00, N = 3SE +/- 0.00, N = 30.990.921. (CXX) g++ options: -march=core-avx2 -std=c++11 -O3 -funroll-all-loops -fopenmp -lrt -lpthread -lm

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverSMT Enabled - DefaultSMT Disabled30060090012001500SE +/- 1.81, N = 3SE +/- 3.89, N = 31386.871369.031. (CC) gcc options: -O3 -mavx2

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: BedroomSMT Enabled - DefaultSMT Disabled0.45680.91361.37041.82722.284SE +/- 0.00, N = 3SE +/- 0.00, N = 32.031.38

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: SupercarSMT Enabled - DefaultSMT Disabled0.97881.95762.93643.91524.894SE +/- 0.00, N = 3SE +/- 0.01, N = 34.353.06

MKL-DNN

This is a test of the Intel MKL-DNN as the Intel Math Kernel Library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32SMT Enabled - DefaultSMT Disabled48121620SE +/- 0.29, N = 3SE +/- 0.02, N = 317.257.11MIN: 11.06MIN: 7.021. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: f32SMT Enabled - DefaultSMT Disabled50100150200250SE +/- 1.44, N = 3SE +/- 0.14, N = 3210.1089.54MIN: 126.74MIN: 88.321. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: f32SMT Enabled - DefaultSMT Disabled510152025SE +/- 0.00, N = 3SE +/- 0.09, N = 318.7017.56MIN: 18.37MIN: 17.11. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: f32SMT Enabled - DefaultSMT Disabled5001000150020002500SE +/- 0.80, N = 3SE +/- 0.71, N = 32107.822011.48MIN: 2091.72MIN: 1994.11. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32SMT Enabled - DefaultSMT Disabled612182430SE +/- 0.04, N = 3SE +/- 0.15, N = 326.5220.67MIN: 25.61MIN: 20.191. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: f32SMT Enabled - DefaultSMT Disabled1.13632.27263.40894.54525.6815SE +/- 0.01, N = 3SE +/- 0.01, N = 35.054.89MIN: 4.97MIN: 4.791. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32SMT Enabled - DefaultSMT Disabled60120180240300SE +/- 0.49, N = 3SE +/- 0.21, N = 3257.95248.32MIN: 256.12MIN: 246.461. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: f32SMT Enabled - DefaultSMT Disabled13002600390052006500SE +/- 22.04, N = 3SE +/- 1.43, N = 35993.694559.32MIN: 5634.28MIN: 4450.231. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32SMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.21, N = 3SE +/- 0.16, N = 3112.71108.94MIN: 111.1MIN: 107.131. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13b1ATPase Simulation - 327,506 AtomsSMT Enabled - DefaultSMT Disabled0.38910.77821.16731.55641.9455SE +/- 0.00061, N = 3SE +/- 0.00287, N = 31.443941.72944

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: BT.ASMT Enabled - DefaultSMT Disabled14002800420056007000SE +/- 8.30, N = 3SE +/- 15.50, N = 36417.486440.111. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: EP.CSMT Enabled - DefaultSMT Disabled100200300400500SE +/- 0.04, N = 3SE +/- 0.52, N = 3477.09480.691. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.ASMT Enabled - DefaultSMT Disabled12002400360048006000SE +/- 3.40, N = 3SE +/- 4.89, N = 35711.265701.851. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.BSMT Enabled - DefaultSMT Disabled13002600390052006500SE +/- 7.75, N = 3SE +/- 15.63, N = 36193.026218.671. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.ASMT Enabled - DefaultSMT Disabled11K22K33K44K55KSE +/- 371.45, N = 3SE +/- 457.69, N = 350909.3052828.711. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.CSMT Enabled - DefaultSMT Disabled5K10K15K20K25KSE +/- 16.81, N = 3SE +/- 18.22, N = 321264.9221548.381. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: SP.ASMT Enabled - DefaultSMT Disabled9001800270036004500SE +/- 2.82, N = 3SE +/- 9.96, N = 34293.524399.161. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

Open FMM Nero2D

This is a test of Nero2D, which is a two-dimensional TM/TE solver for Open FMM. Open FMM is a free collection of electromagnetic software for scattering at very large objects. This test profile times how long it takes to solve one of the included 2D examples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total TimeSMT Enabled - DefaultSMT Disabled1632486480SE +/- 0.29, N = 3SE +/- 0.29, N = 372.3467.011. (CXX) g++ options: -O2 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm -pthread -lmpi_cxx -lmpi

Parboil

The Parboil Benchmarks from the IMPACT Research Group at University of Illinois are a set of throughput computing applications for looking at computing architecture and compilers. Parboil test-cases support OpenMP, OpenCL, and CUDA multi-processing environments. However, at this time the test profile is just making use of the OpenMP and OpenCL test workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.04, N = 3SE +/- 0.53, N = 3151.1374.271. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPSMT Enabled - DefaultSMT Disabled0.72451.4492.17352.8983.6225SE +/- 0.00, N = 3SE +/- 0.01, N = 32.223.221. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilSMT Enabled - DefaultSMT Disabled714212835SE +/- 0.02, N = 3SE +/- 0.32, N = 315.1228.891. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingSMT Enabled - DefaultSMT Disabled714212835SE +/- 0.01, N = 3SE +/- 0.02, N = 328.8418.481. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Primesieve

Primesieve generates prime numbers using a highly optimized sieve of Eratosthenes implementation. Primesieve benchmarks the CPU's L1/L2 cache performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number GenerationSMT Enabled - DefaultSMT Disabled48121620SE +/- 0.02, N = 3SE +/- 0.02, N = 315.4316.721. (CXX) g++ options: -O3 -lpthread

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes the OpenCL and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDSMT Enabled - DefaultSMT Disabled1428425670SE +/- 0.06, N = 3SE +/- 0.01, N = 352.2662.201. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD SolverSMT Enabled - DefaultSMT Disabled48121620SE +/- 0.02, N = 3SE +/- 0.07, N = 313.9115.501. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP StreamclusterSMT Enabled - DefaultSMT Disabled510152025SE +/- 0.00, N = 3SE +/- 0.02, N = 321.2016.021. (CXX) g++ options: -O2 -lOpenCL

Rust Prime Benchmark

Based on petehunt/rust-benchmark, this is a prime number benchmark that is multi-threaded and written in Rustlang. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRust Prime BenchmarkPrime Number Test To 200,000,000SMT Enabled - DefaultSMT Disabled918273645SE +/- 0.07, N = 3SE +/- 0.08, N = 330.8740.481. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

Smallpt

Smallpt is a C++ global illumination renderer written in less than 100 lines of code. Global illumination is done via unbiased Monte Carlo path tracing and there is multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesSMT Enabled - DefaultSMT Disabled3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 38.3310.621. (CXX) g++ options: -fopenmp -O3

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeSMT Enabled - DefaultSMT Disabled8M16M24M32M40MSE +/- 320057.07, N = 3SE +/- 316141.73, N = 339137992287658881. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.51080p 8-bit YUV To AV1 Video EncodeSMT Enabled - DefaultSMT Disabled918273645SE +/- 0.25, N = 3SE +/- 0.02, N = 341.3532.241. (CXX) g++ options: -O3 -pie -lpthread -lm

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video EncodeSMT Enabled - DefaultSMT Disabled50100150200250SE +/- 1.83, N = 3SE +/- 1.35, N = 3251.39200.781. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt

Swet

Swet is a synthetic CPU/RAM benchmark, includes multi-processor test cases. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16AverageSMT Enabled - DefaultSMT Disabled200M400M600M800M1000MSE +/- 11310134.98, N = 4SE +/- 10717369.00, N = 38523245878604337651. (CC) gcc options: -lm -lpthread -lcurses -lrt

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.18Time To CompileSMT Enabled - DefaultSMT Disabled1326395265SE +/- 0.70, N = 3SE +/- 0.81, N = 348.6358.80

TTSIOD 3D Renderer

A portable GPL 3D software renderer that supports OpenMP and Intel Threading Building Blocks with many different rendering modes. This version does not use OpenGL but is entirely CPU/software based. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingSMT Enabled - DefaultSMT Disabled140280420560700SE +/- 1.06, N = 3SE +/- 1.29, N = 3643.90483.151. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

x265

This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video EncodingSMT Enabled - DefaultSMT Disabled1122334455SE +/- 0.40, N = 3SE +/- 0.19, N = 340.6648.901. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

XZ Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using XZ compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9SMT Enabled - DefaultSMT Disabled918273645SE +/- 0.07, N = 3SE +/- 0.09, N = 325.6037.181. (CC) gcc options: -pthread -fvisibility=hidden -O2

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19SMT Enabled - DefaultSMT Disabled510152025SE +/- 0.08, N = 3SE +/- 0.02, N = 318.0521.501. (CC) gcc options: -O3 -pthread -lz