Dual AMD EPYC 7601 Linux CPU Core/Thread Benchmarks

2 x AMD EPYC 7601 32-Core testing for a future article on Phoronix.com..

HTML result view exported from: https://openbenchmarking.org/result/1810109-SK-DUALAMDEP26&sro&grt.

Dual AMD EPYC 7601 Linux CPU Core/Thread BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopCompilerFile-SystemScreen Resolution2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads128 Threads2 x AMD EPYC 7601 32-Core @ 2.68GHz (2 Cores)Dell 02MJ3T (1.2.5 BIOS)AMD Family 17h16 x 32 GB DDR4-2400MT/s 36ASF4G72PZ-2G6D2120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860Matrox Matrox G200eW3VE228Broadcom Limited BCM57416 NetXtreme-E 10GBase-T RDMAUbuntu 18.044.19.0-041900rc6-generic (x86_64) 20180930GNOME Shell 3.28.3GCC 7.3.0ext41600x12002 x AMD EPYC 7601 32-Core @ 2.69GHz (4 Cores)20 x 500GB Samsung SSD 860 + 120GB SSDSCKJB120G7R2 x AMD EPYC 7601 32-Core @ 2.69GHz (8 Cores)2 x AMD EPYC 7601 32-Core @ 2.70GHz (16 Cores)2 x AMD EPYC 7601 32-Core @ 2.69GHz (32 Cores)2 x AMD EPYC 7601 32-Core @ 3.10GHz (64 Cores)2 x AMD EPYC 7601 32-Core @ 3.12GHz (64 Cores / 128 Threads)120GB SSDSCKJB120G7R + 20 x 500GB Samsung SSD 860OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++ --enable-libmpx --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib --with-tune=generic --without-cuda-driver -v Python Details- Python 2.7.15rc1 + Python 3.6.6Security Details- __user pointer sanitization + Full AMD retpoline IBPB + SSB disabled via prctl and seccomp

Dual AMD EPYC 7601 Linux CPU Core/Thread Benchmarkscompress-7zip: Compress Speed Testaircrack-ng: asmfish: 1024 Hash Memory, 26 Depthblender: BMW27 - CPU-Onlyblender: Fishy Cat - CPU-Onlybrl-cad: VGR Performance Metricc-ray: Total Time - 4K, 16 Rays Per Pixelv-ray: CPUcpuminer-opt: m7mcpuminer-opt: deepcpuminer-opt: lbrycpuminer-opt: skeincpuminer-opt: myr-grcpuminer-opt: sha256tdacapobench: Jythondarktable: Boat - CPU-onlydarktable: Masskrug - CPU-onlydarktable: Server Rack - CPU-onlydarktable: Server Room - CPU-onlyebizzy: graphics-magick: Swirlgraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Noise-Gaussianhimeno: Poisson Pressure Solverm-queens: Time To Solvenamd: ATPase Simulation - 327,506 Atomsopenssl: RSA 4096-bit Performancepgbench: Buffer Test - Normal Load - Read Onlypovray: Trace Timeprimesieve: 1e12 Prime Number Generationrodinia: OpenMP LavaMDrodinia: OpenMP CFD Solverrust-mandel: Time To Complete Serial/Parallel Mandelbrotsmallpt: Global Illumination Renderer; 128 Samplesstockfish: Total Timetachyon: Total Timetensorflow: Cifar10tesseract-ocr: Time To OCR 7 Imagesbuild-imagemagick: Time To Compilebuild-linux-kernel: Time To Compilebuild-llvm: Time To Compilettsiod-renderer: Phong Rendering With Soft-Shadow Mappingx264: H.264 Video Encoding2 Threads4 Threads8 Threads16 Threads32 Threads64 Threads128 Threads5070230826677662305.433292.225684753.87735.063431885348292045921054148.7443.830.9123.169199864313621539712.6719.7017326917375478.70232.19825.65145.85234.44465.60243484655.70304.9993.13194.81592.71353233.1411.069914461652978221165.431645.8713045376.36372.963690576718829133887925.6324.110.5312.5118388791526036535357.699.9173053835792239.22116.55419.1774.65142.97233.60465815127.80225.2458.84108.08204853.9617.7119092928910677009586.30822.5428440188.41190.17825112503137128088875614.0914.770.357.04362148116798760188180.674.98168107565014120.7958.39211.3642.8695.87117.51891024214.17131.6757.9862.80160.0389395.3533.72373571856920917149297.51413.425819994.4697.9922290274039845223382108.079.860.254.566920031331061128969992.082.50156214911529062.2229.30107.0927.5373.0959.35172471447.0497.7853.3343.6287.0647215964.34747513704941744957158.49213.5211946147.6654.354297469296704664718277643581645.497.900.193.00126678014512813211778447.311.25996429620898433.7214.8353.9618.0461.3730.42347653713.5978.2952.0233.9750.44272246119151861782698757836382.20111.4435409016.7128.481426232535894499653386315998766304.036.100.143.16183322117516116915084226.300.51265863238835717.636.2520.5313.6759.1812.35739458851.9559.0548.9227.1530.871633861561481148103212003769461.2188.6842850215.3324.3217882660765192105883631915893364704.276.710.154.241964128196181188180100418.620.45093923853401312.545.8916.9211.9747.629.031033519361.4765.1841.9024.1825.47154365148OpenBenchmarking.org

7-Zip Compression

Compress Speed Test

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed Test128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads30K60K90K120K150KSE +/- 2138.14, N = 12SE +/- 507.06, N = 3SE +/- 26.46, N = 3SE +/- 1281.79, N = 3SE +/- 126.48, N = 3SE +/- 2300.59, N = 4SE +/- 87.93, N = 3148114373575070747519914151861190921. (CXX) g++ options: -pipe -lpthread

Aircrack-ng

OpenBenchmarking.orgk/s, More Is BetterAircrack-ng 1.3128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads20K40K60K80K100KSE +/- 162.76, N = 3SE +/- 22.26, N = 3SE +/- 10.50, N = 3SE +/- 39.72, N = 3SE +/- 5.51, N = 3SE +/- 128.32, N = 3SE +/- 1.53, N = 3810321856923083704946167826992891. (CXX) g++ options: -O3 -fvisibility=hidden -masm=intel -lpthread -lcrypto -lz -ldl -lm -pthread

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2017-09-191024 Hash Memory, 26 Depth128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads30M60M90M120M150MSE +/- 434580.42, N = 3SE +/- 160168.42, N = 3SE +/- 7410.06, N = 3SE +/- 139208.51, N = 3SE +/- 43237.82, N = 3SE +/- 413953.10, N = 3SE +/- 31620.54, N = 31200376942091714926677664174495752978228757836310677009

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79Blend File: BMW27 - Compute: CPU-Only128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads5001000150020002500SE +/- 0.04, N = 3SE +/- 0.19, N = 3SE +/- 2.96, N = 3SE +/- 0.16, N = 3SE +/- 2.74, N = 3SE +/- 0.16, N = 3SE +/- 0.35, N = 361.21297.512305.43158.491165.4382.20586.30

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.79Blend File: Fishy Cat - Compute: CPU-Only128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads7001400210028003500SE +/- 0.22, N = 3SE +/- 0.13, N = 3SE +/- 4.03, N = 3SE +/- 0.22, N = 3SE +/- 1.48, N = 3SE +/- 0.09, N = 3SE +/- 0.56, N = 388.68413.423292.22213.521645.87111.44822.54

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.28.0VGR Performance Metric128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads90K180K270K360K450K42850258199568411946113045354090284401. (CXX) g++ options: -std=c++98 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lX11 -lXext -lXrender -lpthread -ldl -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads160320480640800SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.81, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 315.3394.46753.8747.66376.3616.71188.411. (CC) gcc options: -lm -lpthread -O3

Chaos Group V-RAY

Mode: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterChaos Group V-RAY 1.1.0Mode: CPU128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads160320480640800SE +/- 0.25, N = 3SE +/- 0.36, N = 3SE +/- 2.58, N = 3SE +/- 0.09, N = 3SE +/- 0.65, N = 3SE +/- 0.11, N = 3SE +/- 0.88, N = 324.3297.99735.0654.35372.9628.48190.17

Cpuminer-Opt

Algorithm: m7m

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: m7m128 Threads32 Threads64 Threads400800120016002000SE +/- 4.26, N = 3SE +/- 26.00, N = 12SE +/- 6.78, N = 3178842914261. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: deep

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: deep128 Threads2 Threads32 Threads64 Threads6K12K18K24K30KSE +/- 3.33, N = 3SE +/- 1477.73, N = 9SE +/- 141.11, N = 3266073437469232531. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: lbry

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: lbry128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads14K28K42K56K70KSE +/- 666.47, N = 12SE +/- 297.38, N = 3SE +/- 41.69, N = 12SE +/- 151.44, N = 3SE +/- 33.32, N = 3SE +/- 4190.25, N = 9SE +/- 1187.09, N = 12651922229018852967036905894482511. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: skein

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: skein128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads20K40K60K80K100KSE +/- 1293.38, N = 3SE +/- 865.16, N = 12SE +/- 543.56, N = 9SE +/- 186.58, N = 3SE +/- 11.17, N = 3SE +/- 1073.89, N = 3SE +/- 5398.62, N = 1110588327403348246647576799653125031. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: myr-gr

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: myr-gr128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads14002800420056007000SE +/- 8.00, N = 3SE +/- 102.97, N = 12SE +/- 74.13, N = 12SE +/- 21.55, N = 3SE +/- 27.89, N = 3SE +/- 41.83, N = 12SE +/- 16.70, N = 7631998492018271882386313711. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: sha256t

OpenBenchmarking.orgkH/s - Hash Speed, More Is BetterCpuminer-Opt 3.8.8.1Algorithm: sha256t128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads30K60K90K120K150KSE +/- 8.82, N = 3SE +/- 1126.32, N = 12SE +/- 1.41, N = 3SE +/- 1331.47, N = 4SE +/- 15.80, N = 3SE +/- 775.55, N = 3SE +/- 373.51, N = 6158933522334592764359133159987280881. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

DaCapo Benchmark

Java Test: Jython

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jython128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads2K4K6K8K10KSE +/- 53.93, N = 4SE +/- 55.77, N = 4SE +/- 106.17, N = 4SE +/- 31.31, N = 4SE +/- 37.10, N = 4SE +/- 25.60, N = 4SE +/- 130.64, N = 464708210105418164887966308756

Darktable

Test: Boat - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Boat - Acceleration: CPU-only128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads1122334455SE +/- 0.05, N = 3SE +/- 0.12, N = 5SE +/- 0.39, N = 3SE +/- 0.05, N = 3SE +/- 0.29, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 34.278.0748.745.4925.634.0314.09

Darktable

Test: Masskrug - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Masskrug - Acceleration: CPU-only128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads1020304050SE +/- 0.10, N = 12SE +/- 0.10, N = 3SE +/- 0.74, N = 3SE +/- 0.07, N = 3SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.24, N = 46.719.8643.837.9024.116.1014.77

Darktable

Test: Server Rack - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Rack - Acceleration: CPU-only128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads0.20480.40960.61440.81921.024SE +/- 0.00, N = 12SE +/- 0.01, N = 12SE +/- 0.00, N = 3SE +/- 0.00, N = 12SE +/- 0.01, N = 3SE +/- 0.00, N = 12SE +/- 0.01, N = 30.150.250.910.190.530.140.35

Darktable

Test: Server Room - Acceleration: CPU-only

OpenBenchmarking.orgSeconds, Fewer Is BetterDarktable 2.4.2Test: Server Room - Acceleration: CPU-only128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads612182430SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 12SE +/- 0.14, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 34.244.5623.163.0012.513.167.04

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads400K800K1200K1600K2000KSE +/- 24977.03, N = 12SE +/- 13784.45, N = 3SE +/- 69.17, N = 3SE +/- 17866.44, N = 3SE +/- 159.52, N = 3SE +/- 34473.66, N = 12SE +/- 5694.82, N = 4196412869200391998126678018388718332213621481. (CC) gcc options: -pthread -lpthread -O3 -march=native

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Swirl128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads4080120160200SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 2.89, N = 1219613364145911751161. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Sharpen128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads4080120160200SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 2.28, N = 121811063112852161791. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Enhanced128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads4080120160200SE +/- 0.67, N = 3SE +/- 0.48, N = 6SE +/- 0.88, N = 3SE +/- 3.25, N = 121881123613260169871. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-Gaussian128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads4080120160200SE +/- 1.69, N = 12180892111736150601. (CC) gcc options: -fopenmp -O2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lgomp -lpthread

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solver128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads2004006008001000SE +/- 2.42, N = 3SE +/- 29.16, N = 9SE +/- 41.17, N = 12SE +/- 1.69, N = 3SE +/- 27.03, N = 12SE +/- 2.41, N = 3SE +/- 50.31, N = 910046995397845358421881. (CC) gcc options: -O3 -mavx2

m-queens

Time To Solve

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.1Time To Solve128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads150300450600750SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 318.6292.08712.6747.31357.6926.30180.671. (CXX) g++ options: -fopenmp -O2 -march=native

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13b1ATPase Simulation - 327,506 Atoms128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads510152025SE +/- 0.00061, N = 3SE +/- 0.00238, N = 3SE +/- 0.00958, N = 3SE +/- 0.00106, N = 3SE +/- 0.01087, N = 3SE +/- 0.00252, N = 3SE +/- 0.00206, N = 30.450932.5015619.701731.259969.917300.512654.98168

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit Performance128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads2K4K6K8K10KSE +/- 6.29, N = 3SE +/- 1.42, N = 3SE +/- 0.15, N = 3SE +/- 3.61, N = 3SE +/- 0.38, N = 3SE +/- 5.90, N = 3SE +/- 0.28, N = 3923821492694296538863210751. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

PostgreSQL pgbench

Scaling: Buffer Test - Test: Normal Load - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 10.3Scaling: Buffer Test - Test: Normal Load - Mode: Read Only128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads110K220K330K440K550KSE +/- 2806.07, N = 3SE +/- 391.56, N = 3SE +/- 189.48, N = 3SE +/- 1520.48, N = 3SE +/- 78.74, N = 3SE +/- 1037.30, N = 3SE +/- 1161.88, N = 35340131152901737520898435792388357650141. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -lcrypt -ldl -lm

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace Time128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads100200300400500SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.43, N = 3SE +/- 0.14, N = 3SE +/- 0.26, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 312.5462.22478.7033.72239.2217.63120.791. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

Primesieve

1e12 Prime Number Generation

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 7.11e12 Prime Number Generation128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads50100150200250SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.31, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.01, N = 3SE +/- 0.09, N = 35.8929.30232.1914.83116.556.2558.391. (CXX) g++ options: -O3 -lpthread

Rodinia

Test: OpenMP LavaMD

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP LavaMD128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads2004006008001000SE +/- 0.29, N = 3SE +/- 0.16, N = 3SE +/- 0.62, N = 3SE +/- 0.02, N = 3SE +/- 0.49, N = 3SE +/- 0.18, N = 3SE +/- 0.66, N = 316.92107.09825.6553.96419.1720.53211.361. (CXX) g++ options: -O2 -lOpenCL

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 2.4Test: OpenMP CFD Solver128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads306090120150SE +/- 0.39, N = 12SE +/- 0.64, N = 12SE +/- 2.88, N = 3SE +/- 0.26, N = 12SE +/- 0.15, N = 3SE +/- 1.51, N = 10SE +/- 0.12, N = 311.9727.53145.8518.0474.6513.6742.861. (CXX) g++ options: -O2 -lOpenCL

Rust Mandelbrot

Time To Complete Serial/Parallel Mandelbrot

OpenBenchmarking.orgSeconds, Fewer Is BetterRust MandelbrotTime To Complete Serial/Parallel Mandelbrot128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads50100150200250SE +/- 0.08, N = 3SE +/- 0.25, N = 3SE +/- 0.14, N = 3SE +/- 0.01, N = 3SE +/- 0.55, N = 3SE +/- 0.80, N = 3SE +/- 0.01, N = 347.6273.09234.4461.37142.9759.1895.871. (CC) gcc options: -m64 -pie -nodefaultlibs

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads100200300400500SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.28, N = 3SE +/- 0.06, N = 3SE +/- 0.33, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 39.0359.35465.6030.42233.6012.35117.511. (CXX) g++ options: -fopenmp

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total Time128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads20M40M60M80M100MSE +/- 611996.89, N = 3SE +/- 46467.43, N = 3SE +/- 16074.95, N = 3SE +/- 188666.06, N = 3SE +/- 13521.46, N = 3SE +/- 472851.76, N = 3SE +/- 27895.57, N = 31033519361724714424348463476537146581517394588589102421. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++11 -pedantic -O3 -msse -msse3 -mpopcnt -flto

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.98.9Total Time128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads1326395265SE +/- 0.03, N = 12SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 12SE +/- 0.23, N = 31.477.0455.703.5927.801.9514.171. (CC) gcc options: -m32 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

Tensorflow

Build: Cifar10

OpenBenchmarking.orgSeconds, Fewer Is BetterTensorflow 2017-02-03Build: Cifar10128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads70140210280350SE +/- 0.42, N = 3SE +/- 0.05, N = 3SE +/- 0.25, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 3SE +/- 0.30, N = 3SE +/- 0.20, N = 365.1897.78304.9978.29225.2459.05131.67

Tesseract OCR

Time To OCR 7 Images

OpenBenchmarking.orgSeconds, Fewer Is BetterTesseract OCR 4.0.0-beta.1Time To OCR 7 Images128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads20406080100SE +/- 0.36, N = 3SE +/- 0.61, N = 3SE +/- 0.30, N = 3SE +/- 0.38, N = 3SE +/- 0.22, N = 3SE +/- 0.67, N = 3SE +/- 0.36, N = 341.9053.3393.1352.0258.8448.9257.98

Timed ImageMagick Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed ImageMagick Compilation 6.9.0Time To Compile128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads4080120160200SE +/- 0.41, N = 3SE +/- 0.12, N = 3SE +/- 0.22, N = 3SE +/- 0.20, N = 3SE +/- 0.40, N = 3SE +/- 0.26, N = 3SE +/- 0.47, N = 324.1843.62194.8133.97108.0827.1562.80

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 4.18Time To Compile128 Threads16 Threads2 Threads32 Threads64 Threads8 Threads130260390520650SE +/- 0.26, N = 12SE +/- 1.26, N = 3SE +/- 1.65, N = 3SE +/- 0.86, N = 4SE +/- 0.34, N = 12SE +/- 1.41, N = 325.4787.06592.7150.4430.87160.03

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To Compile128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads800160024003200400015447235322722048163893

TTSIOD 3D Renderer

Phong Rendering With Soft-Shadow Mapping

OpenBenchmarking.orgFPS, More Is BetterTTSIOD 3D Renderer 2.3bPhong Rendering With Soft-Shadow Mapping128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads80160240320400SE +/- 8.58, N = 9SE +/- 0.49, N = 3SE +/- 0.10, N = 3SE +/- 2.26, N = 3SE +/- 0.12, N = 3SE +/- 6.16, N = 12SE +/- 0.14, N = 3365.00159.0033.14246.0053.96386.0095.351. (CXX) g++ options: -O3 -fomit-frame-pointer -ffast-math -mtune=native -flto -msse -mrecip -mfpmath=sse -msse2 -mssse3 -lSDL -fopenmp -fwhole-program -lstdc++

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-07-28H.264 Video Encoding128 Threads16 Threads2 Threads32 Threads4 Threads64 Threads8 Threads306090120150SE +/- 0.57, N = 3SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.01, N = 3SE +/- 0.26, N = 3SE +/- 0.05, N = 3148.0064.3411.06119.0017.71156.0033.721. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize


Phoronix Test Suite v10.8.4