Ryzen 9 3900X Linux SMT Performance

AMD Ryzen 9 3900X SMT benchmarks by Michael Larabel.

HTML result view exported from: https://openbenchmarking.org/result/1907314-AS-RYZEN939048&grr&rdt.

Ryzen 9 3900X Linux SMT PerformanceProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionSMT Enabled - DefaultSMT DisabledAMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (0702 BIOS)AMD Device 148016384MB2000GB Force MP600Sapphire AMD Radeon RX 550 640SP / 560/560X 4GB (1300/1750MHz)AMD Device aae0ASUS VP28URealtek Device 8125 + Intel I211 + Intel Device 2723Ubuntu 18.045.3.0-999-generic (x86_64) 20190725GNOME Shell 3.28.4X Server 1.20.4modesetting 1.20.44.5 Mesa 19.0.2 (LLVM 8.0.0)GCC 7.4.0ext43840x2160AMD Ryzen 9 3900X 12-Core @ 3.80GHz (12 Cores)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Processor Details- SMT Enabled - Default: Scaling Governor: acpi-cpufreq schedutil- SMT Disabled: Scaling Governor: acpi-cpufreq ondemandPython Details- Python 2.7.15+ + Python 3.6.8Security Details- SMT Enabled - Default: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: always-on RSB filling - SMT Disabled: l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling

Ryzen 9 3900X Linux SMT Performancemkl-dnn: Convolution Batch conv_all - f32mkl-dnn: Deconvolution Batch deconv_all - f32blender: Barbershop - CPU-Onlyappleseed: Emilygromacs: Water Benchmarkasmfish: 1024 Hash Memory, 26 Depthmkl-dnn: Convolution Batch conv_googlenet_v3 - f32cp2k: Fayalite-FIST Dataappleseed: Material Testerappleseed: Disney Materialparboil: OpenMP LBMnpb: LU.Cnamd: ATPase Simulation - 327,506 Atomsnero2d: Total Timev-ray: CPUstockfish: Total Timeindigobench: Bedroomindigobench: Supercargraphics-magick: Enhancedgraphics-magick: Sharpengraphics-magick: Noise-Gaussiangraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Resizinggraphics-magick: HWB Color Spacerodinia: OpenMP LavaMDc-ray: Total Time - 4K, 16 Rays Per Pixelbuild-linux-kernel: Time To Compilehimeno: Poisson Pressure Solvermkl-dnn: IP Batch All - f32mkl-dnn: Convolution Batch conv_3d - f32coremark: CoreMark Size 666 - Iterations Per Secondrust-prime: Prime Number Test To 200,000,000compress-7zip: Compress Speed Testcompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9ttsiod-renderer: Phong Rendering With Soft-Shadow Mappingnpb: BT.Aparboil: OpenMP MRI Griddingparboil: OpenMP Stencilmkl-dnn: Deconvolution Batch deconv_1d - f32swet: Averagenpb: SP.Acompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19rodinia: OpenMP Streamclusternpb: EP.Cnpb: FT.Bprimesieve: 1e12 Prime Number Generationmkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: IP Batch 1D - f32rodinia: OpenMP CFD Solverx265: H.265 1080p Video Encodingsmallpt: Global Illumination Renderer; 128 Samplessvt-av1: 1080p 8-bit YUV To AV1 Video Encodeffmpeg: H.264 HD To NTSC DVcloverleaf: Lagrangian-Eulerian Hydrodynamicsmkl-dnn: Deconvolution Batch deconv_3d - f32svt-hevc: 1080p 8-bit YUV To HEVC Video Encodeparboil: OpenMP CUTCPnpb: LU.Anpb: FT.ASMT Enabled - DefaultSMT Disabled2107.825993.69710.12271.850.9939682621112.71323.08163.74167.95151.1321264.921.4439472.3420558391379922.034.3519716516923126326628552.2653.2348.631386.87210.1018.70532587.5230.877792225.60643.906417.4828.8415.1226.528523245874293.5218.0521.20477.096193.0215.43257.9517.2513.9140.668.3341.358.453.795.05251.392.2250909.305711.262011.484559.321027.18370.450.9228280592108.94446.70216.56198.3974.2721548.381.7294467.0115709287658881.383.0620317314523326927228362.2056.1458.801369.0389.5417.56376805.0840.485910837.18483.156440.1118.4828.8920.678604337654399.1621.5016.02480.696218.6716.72248.327.1115.5048.9010.6232.244.023.274.89200.783.2252828.715701.85OpenBenchmarking.org

MKL-DNN

Harness: Convolution Batch conv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: f32SMT Enabled - DefaultSMT Disabled5001000150020002500SE +/- 0.80, N = 3SE +/- 0.71, N = 32107.822011.48MIN: 2091.72MIN: 1994.11. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: f32SMT Enabled - DefaultSMT Disabled13002600390052006500SE +/- 22.04, N = 3SE +/- 1.43, N = 35993.694559.32MIN: 5634.28MIN: 4450.231. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79aBlend File: Barbershop - Compute: CPU-OnlySMT Enabled - DefaultSMT Disabled2004006008001000710.121027.18

Appleseed

Scene: Emily

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: EmilySMT Enabled - DefaultSMT Disabled80160240320400271.85370.45

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2018.3Water BenchmarkSMT Enabled - DefaultSMT Disabled0.22280.44560.66840.89121.114SE +/- 0.00, N = 3SE +/- 0.00, N = 30.990.921. (CXX) g++ options: -march=core-avx2 -std=c++11 -O3 -funroll-all-loops -fopenmp -lrt -lpthread -lm

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthSMT Enabled - DefaultSMT Disabled8M16M24M32M40MSE +/- 201681.41, N = 3SE +/- 41988.40, N = 33968262128280592

MKL-DNN

Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32SMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.21, N = 3SE +/- 0.16, N = 3112.71108.94MIN: 111.1MIN: 107.131. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

CP2K Molecular Dynamics

Fayalite-FIST Data

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 6.1Fayalite-FIST DataSMT Enabled - DefaultSMT Disabled100200300400500323.08446.70

Appleseed

Scene: Material Tester

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Material TesterSMT Enabled - DefaultSMT Disabled50100150200250163.74216.56

Appleseed

Scene: Disney Material

OpenBenchmarking.orgSeconds, Fewer Is BetterAppleseed 2.0 BetaScene: Disney MaterialSMT Enabled - DefaultSMT Disabled4080120160200167.95198.39

Parboil

Test: OpenMP LBM

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP LBMSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.04, N = 3SE +/- 0.53, N = 3151.1374.271. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.CSMT Enabled - DefaultSMT Disabled5K10K15K20K25KSE +/- 16.81, N = 3SE +/- 18.22, N = 321264.9221548.381. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13b1ATPase Simulation - 327,506 AtomsSMT Enabled - DefaultSMT Disabled0.38910.77821.16731.55641.9455SE +/- 0.00061, N = 3SE +/- 0.00287, N = 31.443941.72944

Open FMM Nero2D

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen FMM Nero2D 2.0.2Total TimeSMT Enabled - DefaultSMT Disabled1632486480SE +/- 0.29, N = 3SE +/- 0.29, N = 372.3467.011. (CXX) g++ options: -O2 -lfftw3 -llapack -lblas -lgfortran -lquadmath -lm -pthread -lmpi_cxx -lmpi

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgKsamples, More Is BetterChaos Group V-RAY 4.10.03Mode: CPUSMT Enabled - DefaultSMT Disabled4K8K12K16K20KSE +/- 55.98, N = 3SE +/- 24.84, N = 32055815709

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeSMT Enabled - DefaultSMT Disabled8M16M24M32M40MSE +/- 320057.07, N = 3SE +/- 316141.73, N = 339137992287658881. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

IndigoBench

Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: BedroomSMT Enabled - DefaultSMT Disabled0.45680.91361.37041.82722.284SE +/- 0.00, N = 3SE +/- 0.00, N = 32.031.38

IndigoBench

Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.0.64Scene: SupercarSMT Enabled - DefaultSMT Disabled0.97881.95762.93643.91524.894SE +/- 0.00, N = 3SE +/- 0.01, N = 34.353.06

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: EnhancedSMT Enabled - DefaultSMT Disabled4080120160200SE +/- 0.33, N = 31972031. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SharpenSMT Enabled - DefaultSMT Disabled4080120160200SE +/- 0.33, N = 31651731. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-GaussianSMT Enabled - DefaultSMT Disabled4080120160200SE +/- 0.33, N = 31691451. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SwirlSMT Enabled - DefaultSMT Disabled50100150200250SE +/- 0.67, N = 3SE +/- 1.53, N = 32312331. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: RotateSMT Enabled - DefaultSMT Disabled60120180240300SE +/- 0.88, N = 3SE +/- 1.00, N = 32632691. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: ResizingSMT Enabled - DefaultSMT Disabled60120180240300SE +/- 1.20, N = 3SE +/- 1.20, N = 32662721. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color SpaceSMT Enabled - DefaultSMT Disabled60120180240300SE +/- 0.88, N = 3SE +/- 1.20, N = 32852831. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMDSMT Enabled - DefaultSMT Disabled1428425670SE +/- 0.06, N = 3SE +/- 0.01, N = 352.2662.201. (CXX) g++ options: -O2 -lOpenCL

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelSMT Enabled - DefaultSMT Disabled1326395265SE +/- 0.02, N = 3SE +/- 0.04, N = 353.2356.141. (CC) gcc options: -lm -lpthread -O3

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.18Time To CompileSMT Enabled - DefaultSMT Disabled1326395265SE +/- 0.70, N = 3SE +/- 0.81, N = 348.6358.80

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverSMT Enabled - DefaultSMT Disabled30060090012001500SE +/- 1.81, N = 3SE +/- 3.89, N = 31386.871369.031. (CC) gcc options: -O3 -mavx2

MKL-DNN

Harness: IP Batch All - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: f32SMT Enabled - DefaultSMT Disabled50100150200250SE +/- 1.44, N = 3SE +/- 0.14, N = 3210.1089.54MIN: 126.74MIN: 88.321. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: f32SMT Enabled - DefaultSMT Disabled510152025SE +/- 0.00, N = 3SE +/- 0.09, N = 318.7017.56MIN: 18.37MIN: 17.11. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondSMT Enabled - DefaultSMT Disabled110K220K330K440K550KSE +/- 2501.14, N = 3SE +/- 731.59, N = 3532587.52376805.081. (CC) gcc options: -O2 -lrt" -lrt

Rust Prime Benchmark

Prime Number Test To 200,000,000

OpenBenchmarking.orgSeconds, Fewer Is BetterRust Prime BenchmarkPrime Number Test To 200,000,000SMT Enabled - DefaultSMT Disabled918273645SE +/- 0.07, N = 3SE +/- 0.08, N = 330.8740.481. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestSMT Enabled - DefaultSMT Disabled20K40K60K80K100KSE +/- 323.94, N = 3SE +/- 393.22, N = 377922591081. (CXX) g++ options: -pipe -lpthread

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9SMT Enabled - DefaultSMT Disabled918273645SE +/- 0.07, N = 3SE +/- 0.09, N = 325.6037.181. (CC) gcc options: -pthread -fvisibility=hidden -O2

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow MappingSMT Enabled - DefaultSMT Disabled140280420560700SE +/- 1.06, N = 3SE +/- 1.29, N = 3643.90483.151. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

NAS Parallel Benchmarks

Test / Class: BT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: BT.ASMT Enabled - DefaultSMT Disabled14002800420056007000SE +/- 8.30, N = 3SE +/- 15.50, N = 36417.486440.111. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

Parboil

Test: OpenMP MRI Gridding

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP MRI GriddingSMT Enabled - DefaultSMT Disabled714212835SE +/- 0.01, N = 3SE +/- 0.02, N = 328.8418.481. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

Parboil

Test: OpenMP Stencil

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP StencilSMT Enabled - DefaultSMT Disabled714212835SE +/- 0.02, N = 3SE +/- 0.32, N = 315.1228.891. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

MKL-DNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32SMT Enabled - DefaultSMT Disabled612182430SE +/- 0.04, N = 3SE +/- 0.15, N = 326.5220.67MIN: 25.61MIN: 20.191. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

Swet

Average

OpenBenchmarking.orgOperations Per Second, More Is BetterSwet 1.5.16AverageSMT Enabled - DefaultSMT Disabled200M400M600M800M1000MSE +/- 11310134.98, N = 4SE +/- 10717369.00, N = 38523245878604337651. (CC) gcc options: -lm -lpthread -lcurses -lrt

NAS Parallel Benchmarks

Test / Class: SP.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: SP.ASMT Enabled - DefaultSMT Disabled9001800270036004500SE +/- 2.82, N = 3SE +/- 9.96, N = 34293.524399.161. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

Zstd Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19SMT Enabled - DefaultSMT Disabled510152025SE +/- 0.08, N = 3SE +/- 0.02, N = 318.0521.501. (CC) gcc options: -O3 -pthread -lz

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP StreamclusterSMT Enabled - DefaultSMT Disabled510152025SE +/- 0.00, N = 3SE +/- 0.02, N = 321.2016.021. (CXX) g++ options: -O2 -lOpenCL

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: EP.CSMT Enabled - DefaultSMT Disabled100200300400500SE +/- 0.04, N = 3SE +/- 0.52, N = 3477.09480.691. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

NAS Parallel Benchmarks

Test / Class: FT.B

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.BSMT Enabled - DefaultSMT Disabled13002600390052006500SE +/- 7.75, N = 3SE +/- 15.63, N = 36193.026218.671. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.41e12 Prime Number GenerationSMT Enabled - DefaultSMT Disabled48121620SE +/- 0.02, N = 3SE +/- 0.02, N = 315.4316.721. (CXX) g++ options: -O3 -lpthread

MKL-DNN

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32SMT Enabled - DefaultSMT Disabled60120180240300SE +/- 0.49, N = 3SE +/- 0.21, N = 3257.95248.32MIN: 256.12MIN: 246.461. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32SMT Enabled - DefaultSMT Disabled48121620SE +/- 0.29, N = 3SE +/- 0.02, N = 317.257.11MIN: 11.06MIN: 7.021. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD SolverSMT Enabled - DefaultSMT Disabled48121620SE +/- 0.02, N = 3SE +/- 0.07, N = 313.9115.501. (CXX) g++ options: -O2 -lOpenCL

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video EncodingSMT Enabled - DefaultSMT Disabled1122334455SE +/- 0.40, N = 3SE +/- 0.19, N = 340.6648.901. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 SamplesSMT Enabled - DefaultSMT Disabled3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 38.3310.621. (CXX) g++ options: -fopenmp -O3

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.51080p 8-bit YUV To AV1 Video EncodeSMT Enabled - DefaultSMT Disabled918273645SE +/- 0.25, N = 3SE +/- 0.02, N = 341.3532.241. (CXX) g++ options: -O3 -pie -lpthread -lm

FFmpeg

H.264 HD To NTSC DV

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 4.0.2H.264 HD To NTSC DVSMT Enabled - DefaultSMT Disabled246810SE +/- 0.06, N = 3SE +/- 0.01, N = 38.454.021. (CC) gcc options: -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lXv -lX11 -lXext -lm -lxcb -lxcb-shape -lxcb-xfixes -lasound -lSDL2 -lsndio -pthread -lbz2 -llzma -std=c11 -fomit-frame-pointer -fPIC -O3 -fno-math-errno -fno-signed-zeros -fno-tree-vectorize -MMD -MF -MT

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian HydrodynamicsSMT Enabled - DefaultSMT Disabled0.85281.70562.55843.41124.264SE +/- 0.00, N = 3SE +/- 0.01, N = 33.793.271. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

MKL-DNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: f32SMT Enabled - DefaultSMT Disabled1.13632.27263.40894.54525.6815SE +/- 0.01, N = 3SE +/- 0.01, N = 35.054.89MIN: 4.97MIN: 4.791. (CXX) g++ options: -std=c++11 -march=native -mtune=native -fPIC -fopenmp -O3 -pie -lmklml_intel -ldl

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video EncodeSMT Enabled - DefaultSMT Disabled50100150200250SE +/- 1.83, N = 3SE +/- 1.35, N = 3251.39200.781. (CC) gcc options: -fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native -pie -rdynamic -lpthread -lrt

Parboil

Test: OpenMP CUTCP

OpenBenchmarking.orgSeconds, Fewer Is BetterParboil 2.5Test: OpenMP CUTCPSMT Enabled - DefaultSMT Disabled0.72451.4492.17352.8983.6225SE +/- 0.00, N = 3SE +/- 0.01, N = 32.223.221. (CXX) g++ options: -lm -lpthread -lgomp -O3 -ffast-math -fopenmp

NAS Parallel Benchmarks

Test / Class: LU.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: LU.ASMT Enabled - DefaultSMT Disabled11K22K33K44K55KSE +/- 371.45, N = 3SE +/- 457.69, N = 350909.3052828.711. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1

NAS Parallel Benchmarks

Test / Class: FT.A

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.3.1Test / Class: FT.ASMT Enabled - DefaultSMT Disabled12002400360048006000SE +/- 3.40, N = 3SE +/- 4.89, N = 35711.265701.851. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi2. Open MPI 2.1.1


Phoronix Test Suite v10.8.5