EPYC 7F72

2 x AMD EPYC 7F72 24-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and ASPEED on Ubuntu 20.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2012196-HA-EPYC7F72759
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Chess Test Suite 3 Tests
Timed Code Compilation 3 Tests
C/C++ Compiler Tests 12 Tests
CPU Massive 19 Tests
Creator Workloads 13 Tests
Database Test Suite 5 Tests
Encoding 4 Tests
Fortran Tests 4 Tests
HPC - High Performance Computing 15 Tests
Imaging 3 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 8 Tests
Molecular Dynamics 3 Tests
MPI Benchmarks 4 Tests
Multi-Core 16 Tests
OpenMPI Tests 4 Tests
Programmer / Developer System Benchmarks 4 Tests
Python 2 Tests
Scientific Computing 6 Tests
Server 6 Tests
Server CPU Tests 11 Tests
Single-Threaded 6 Tests
Speech 2 Tests
Video Encoding 4 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Perf-Per
Dollar
Date
Triggered
  Test
  Duration
EPYC 7F72
December 10 2020
  12 Hours, 13 Minutes
AMD 7F72
December 11 2020
  11 Hours, 50 Minutes
AMD EPYC 7F72
December 11 2020
  11 Hours, 39 Minutes
AMD EPYC 7F72 2P
December 16 2020
  1 Day, 26 Minutes
EPYC 7F72 2P
December 17 2020
  23 Hours, 54 Minutes
7F72 2P
December 18 2020
  1 Day, 4 Hours, 52 Minutes
Invert Hiding All Results Option
  18 Hours, 49 Minutes
Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):


EPYC 7F72ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2PAMD EPYC 7F72 24-Core @ 3.20GHz (24 Cores / 48 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse64GB1000GB Western Digital WD_BLACK SN850 1TBllvmpipeVE2282 x Intel 10G X550TUbuntu 20.105.8.0-29-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.5 Mesa 20.2.1 (LLVM 11.0.0 256 bits)GCC 10.2.0ext41920x10802 x AMD EPYC 7F72 24-Core @ 3.20GHz (48 Cores / 96 Threads)126GBASPEEDOpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2PResult OverviewPhoronix Test Suite 10.2.2100%138%175%213%251%NCNNLevelDBHigh Performance Conjugate GradientNAMDBRL-CADGROMACSStockfishasmFishLAMMPS Molecular Dynamics SimulatorFFTETimed Linux Kernel CompilationKvazaarKeyDBTimed LLVM CompilationoneDNNTimed HMMer SearchAI Benchmark AlphaHuginBasis Universalx265InfluxDBLibRawMlpack BenchmarkRedisx264PostgreSQL pgbenchHPC ChallengeTimed Clash CompilationTNNBYTE Unix Benchmarkrav1eLZ4 CompressionNumpy BenchmarkPHPBenchCraftyWebP Image EncodeHierarchical INTegrationRNNoiseeSpeak-NG Speech EngineTensorFlow Lite

EPYC 7F72leveldb: Hot Readleveldb: Fill Syncleveldb: Fill Syncleveldb: Overwriteleveldb: Overwriteleveldb: Rand Fillleveldb: Rand Fillleveldb: Rand Readleveldb: Seek Randleveldb: Rand Deleteleveldb: Seq Fillleveldb: Seq Fillyquake2: Software CPU - 1920 x 1080hpcg: hpcc: G-HPLhpcc: G-Fftehpcc: EP-DGEMMhpcc: G-Ptranshpcc: EP-STREAM Triadhpcc: G-Rand Accesshpcc: Rand Ring Latencyhpcc: Rand Ring Bandwidthhpcc: Max Ping Pong Bandwidthnamd: ATPase Simulation - 327,506 Atomsffte: N=256, 3D Complex FFT Routinehmmer: Pfam Database Searchlammps: 20k Atomslammps: Rhodopsin Proteinwebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressionbyte: Dhrystone 2compress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedlibraw: Post-Processing Benchmarkcrafty: Elapsed Timeonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Ultra Fastkvazaar: Bosphorus 1080p - Very Fastkvazaar: Bosphorus 1080p - Ultra Fastrav1e: 1rav1e: 5rav1e: 6rav1e: 10x264: H.264 Video Encodingx265: Bosphorus 4Kx265: Bosphorus 1080pstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthbuild-clash: Time To Compilebuild-linux-kernel: Time To Compilebuild-llvm: Time To Compilenumpy: espeak: Text-To-Speech Synthesisrnnoise: keydb: gromacs: Water Benchmarktensorflow-lite: SqueezeNettensorflow-lite: Inception V4tensorflow-lite: NASNet Mobiletensorflow-lite: Mobilenet Floattensorflow-lite: Mobilenet Quanttensorflow-lite: Inception ResNet V2pgbench: 1 - 1 - Read Onlypgbench: 1 - 1 - Read Only - Average Latencypgbench: 1 - 1 - Read Writepgbench: 1 - 1 - Read Write - Average Latencypgbench: 1 - 50 - Read Onlypgbench: 1 - 50 - Read Only - Average Latencypgbench: 1 - 100 - Read Onlypgbench: 1 - 100 - Read Only - Average Latencypgbench: 1 - 50 - Read Writepgbench: 1 - 50 - Read Write - Average Latencypgbench: 100 - 1 - Read Onlypgbench: 100 - 1 - Read Only - Average Latencypgbench: 1 - 100 - Read Writepgbench: 1 - 100 - Read Write - Average Latencypgbench: 100 - 1 - Read Writepgbench: 100 - 1 - Read Write - Average Latencypgbench: 100 - 50 - Read Onlypgbench: 100 - 50 - Read Only - Average Latencypgbench: 100 - 100 - Read Onlypgbench: 100 - 100 - Read Only - Average Latencypgbench: 100 - 50 - Read Writepgbench: 100 - 50 - Read Write - Average Latencypgbench: 100 - 100 - Read Writepgbench: 100 - 100 - Read Write - Average Latencybasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3basis: UASTC Level 2 + RDO Post-Processinghugin: Panorama Photo Assistant + Stitching Timeredis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETncnn: CPU - squeezenetncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinytnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1indigobench: CPU - Bedroomindigobench: CPU - Supercarhint: FLOATai-benchmark: Device Inference Scoreai-benchmark: Device Training Scoreai-benchmark: Device AI Scorephpbench: PHP Benchmark Suitemlpack: scikit_icamlpack: scikit_qdamlpack: scikit_svmmlpack: scikit_linearridgeregressionbrl-cad: VGR Performance Metricinfluxdb: 4 - 10000 - 2,5000,1 - 10000influxdb: 64 - 10000 - 2,5000,1 - 10000EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P39.7724.51168.58023.1229.12123.3228.03839.86764.337209.85324.1220.32714.514.999587.254509.8723836.141007.757393.116270.031441.168692.710769566.4960.88169111181.73685310142.11515.78912.1291.6112.59619.0428.53839.28037455257.29777.4311308.950.7810606.349.7010630.935.1174063091.730512.778151.437820.5928042.849362.359463.150475.303206.377252.309611611.21925.9761613.91934.6860.5782261616.57931.2751.4023510.7310.9536.2137.1424.0041.9983.06142.440.3491.0371.3853.039178.7923.5360.615259754464349837462.28338.863298.145324.1332.79521.137424090.962.84789588.1134287710451760165.361482.01175663294190.03420190.4958472310.0596984520.143242120.663243360.041214546.68318050.5545516230.0915293180.189214792.329294863.39549.3607.57816.90127.182694.86950.0062134890.151658090.041295998.652038392.691495615.5921.2621.4410.049.5210.459.5812.294.0922.0736.4813.8210.3824.9531.05295.193275.5224.76910.309321206770.0579419601513347356879651.7440.0524.451.613379241197999.81339487.540.9904.51163.45523.2228.69723.2228.60740.38363.751210.65124.0220.80423.714.973187.310778.4312136.765538.175533.382200.030511.162762.7251510110.2590.87532111004.65873611142.13115.81611.7001.6022.58519.0188.54939.17037547272.99740.1611271.450.5410661.548.3310616.035.0173829161.733882.750041.426450.8390252.903422.341523.144995.359506.413902.306331613.70930.7141610.95930.7110.5752541606.75924.0241.3865110.7010.9136.0936.9723.9842.1382.46141.930.3491.0361.3863.032177.8623.5960.335342151764448741462.32339.054289.654321.8132.75921.126426284.572.83089393.6134740010364159959.661260.71179913293910.03420270.4938526120.0596987780.143243120.571243450.041214446.69418010.5565553200.0905305810.189215212.324295373.38949.5117.73316.98027.287693.89750.3251376509.231757206.721332610.561814383.801500640.3121.1022.2610.069.6110.359.5112.614.1321.5536.5613.8610.5124.5331.33292.901275.7034.77510.339321880722.6368920091523353256789751.8239.4924.491.593323941199592.41339130.140.6234.51162.71423.2228.70823.2228.31740.03064.871210.33424.0220.86714.515.445186.882979.4189536.438808.285553.299230.030371.157172.7303510456.2860.89277113220.19747808142.08015.77911.7051.6122.59119.0158.54839.34237624546.99780.3811314.649.3110548.848.9810685.734.9673592171.727652.782141.426690.5917022.836482.357263.138785.363306.328842.296721611.74929.8861605.90928.1150.5766571602.59932.4891.4034610.7310.9236.1537.0823.9342.1783.29142.030.3461.0371.3853.026177.7123.6060.255270402363564766461.62438.903294.113318.4732.71121.168424640.302.84689812.7134489310436260331.661596.21178590294620.03420090.4988420220.0596953440.144242420.630244460.041214246.74218070.5535514840.0915290660.189214812.328295523.38849.5797.60616.92627.210694.99550.4611379285.461723521.291317592.971897703.881475849.6821.8722.2010.089.9410.459.5012.544.1622.0137.4814.1010.5225.0330.75294.653275.3814.77010.314322432634.1313220071520352757356751.8439.8625.021.623418851197198.01297582.5112.4976.61594.93714.4736.37714.5731.876114.896191.741690.96315707.54430.3537100.7003320.8577039.2003015.918233.414320.043832.024180.8016711661.7760.44651186579.06690691187.89124.92422.5651.6112.57319.0108.51439.01338124375.59657.0411021.849.9410360.548.2110448.231.4274176211.557670.7846522.072470.7423230.8606792.305642.319475.492202.025801.436711177.12823.7451201.41838.9900.5182691109.29839.8520.88175215.9016.1956.4657.5634.3853.26126.24181.720.3481.0361.3782.972194.1920.4354.7098232488115398550483.40425.696209.247318.0832.79921.120296614.545.28763691.387101811764840228.641754.2758566286810.03520160.4968167560.06114132470.071223922.343231710.043202049.56517840.5616192190.0817082340.141233732.140277303.61251.9047.74612.39117.658687.08456.7851388985.481818648.351365777.401888968.341498592.4368.4346.4633.1831.7331.5336.4540.1811.4042.9554.1921.2411.0746.8945.78316.823274.327322779429.0787416621063272557290551.7254.5624.611.78632495951085.71328030.9112.6956.51630.93614.4736.55814.5734.818112.076191.127690.86814.9710.33330.1860100.3716720.6295739.0294016.456733.423790.043962.055120.826499370.0980.44642185994.09603175186.39624.92022.3851.6042.58018.7688.52339.27437519864.59652.0011114.349.6710409.948.8310473.331.1474190021.523950.8163062.030420.7522900.8688642.262692.365724.875231.991721.468411166.56848.2491182.17827.2230.5291861222.20822.7710.89033315.9416.1356.2957.3734.3153.22126.79180.390.3461.0301.3692.967193.4020.2054.6397582753116524954483.96425.796212.024322.4932.83921.080301799.885.27863239.287280911597640591.041926.2754906292570.03420010.5007975660.06314072710.071228221.917232950.043200749.87617940.5586255120.0807150450.140231212.164277203.61451.8547.80812.41417.601689.47156.4261380069.401814208.701368468.711996216.741580536.8139.1647.4432.6332.1829.7333.4547.1510.8440.6555.5530.4810.8944.5550.92299.447274.129322198292.3995516951078277357779653.0254.1824.511.75638663949471.81345908.1112.4046.41643.49014.4737.22714.5733.819113.380189.297670.01815.1702.12328.116199.0647020.0794335.9138716.123573.217820.042792.040330.802618695.0620.45475163435.01632332188.26224.54919.9621.6112.58319.1988.55639.35238477706.39632.3911168.749.2410402.748.0810602.431.9373859931.672890.8917762.090220.7921530.9281382.445382.307683.650622.107001.501201290.07907.0711334.58873.5900.5674971407.98882.0550.95990415.8316.0456.4357.3133.8452.76126.88181.260.3431.0161.3582.927194.9719.9953.8397376595116189232482.27525.950211.787320.6532.82921.139308314.455.25176438.696329614826345501.849006.2823112294520.03420070.4988254480.06113884650.072227621.981233960.043200250.00617660.5666336360.0797290930.137233602.141275203.63952.0497.78912.42117.704689.79958.6591508834.921784823.371345722.501852789.071519952.9343.0660.7834.7033.5032.4739.1243.8613.1657.4669.5624.8012.1456.4849.62322.302274.450322561482.0050816541075272957283353.7351.4624.551.70629706944430.61337280.4OpenBenchmarking.org

LevelDB

LevelDB is a key-value storage library developed by Google that supports making use of Snappy for data compression and has other modern features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Hot ReadEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P306090120150SE +/- 0.13, N = 3SE +/- 0.50, N = 4SE +/- 0.14, N = 3SE +/- 1.04, N = 15SE +/- 1.54, N = 3SE +/- 1.40, N = 339.7740.9940.62112.50112.70112.401. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Hot ReadEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P20406080100Min: 39.53 / Avg: 39.77 / Max: 39.97Min: 39.79 / Avg: 40.99 / Max: 42.09Min: 40.36 / Avg: 40.62 / Max: 40.85Min: 106.07 / Avg: 112.5 / Max: 121.45Min: 111.04 / Avg: 112.7 / Max: 115.77Min: 110.25 / Avg: 112.4 / Max: 115.031. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Fill SyncEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 15SE +/- 0.06, N = 8SE +/- 0.09, N = 34.54.54.56.66.56.41. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Fill SyncEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P3691215Min: 4.5 / Avg: 4.5 / Max: 4.5Min: 4.5 / Avg: 4.5 / Max: 4.5Min: 4.5 / Avg: 4.5 / Max: 4.5Min: 6.2 / Avg: 6.61 / Max: 7.1Min: 6.3 / Avg: 6.46 / Max: 6.8Min: 6.3 / Avg: 6.43 / Max: 6.61. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Fill SyncEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P400800120016002000SE +/- 2.23, N = 3SE +/- 4.35, N = 3SE +/- 8.92, N = 3SE +/- 17.18, N = 15SE +/- 15.97, N = 8SE +/- 19.99, N = 31168.581163.461162.711594.941630.941643.491. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Fill SyncEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P30060090012001500Min: 1164.76 / Avg: 1168.58 / Max: 1172.5Min: 1154.97 / Avg: 1163.46 / Max: 1169.36Min: 1144.88 / Avg: 1162.71 / Max: 1171.66Min: 1479.7 / Avg: 1594.94 / Max: 1697.79Min: 1535.64 / Avg: 1630.94 / Max: 1676.25Min: 1605.44 / Avg: 1643.49 / Max: 1673.121. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: OverwriteEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P612182430SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 323.123.223.214.414.414.41. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: OverwriteEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P510152025Min: 23.1 / Avg: 23.13 / Max: 23.2Min: 23.1 / Avg: 23.2 / Max: 23.3Min: 23.1 / Avg: 23.2 / Max: 23.3Min: 14.4 / Avg: 14.43 / Max: 14.5Min: 14.4 / Avg: 14.4 / Max: 14.4Min: 14.4 / Avg: 14.4 / Max: 14.41. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: OverwriteEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P160320480640800SE +/- 0.36, N = 3SE +/- 0.71, N = 3SE +/- 0.57, N = 3SE +/- 1.17, N = 3SE +/- 0.59, N = 3SE +/- 0.43, N = 3229.12228.70228.71736.38736.56737.231. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: OverwriteEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P130260390520650Min: 228.43 / Avg: 229.12 / Max: 229.66Min: 227.46 / Avg: 228.7 / Max: 229.9Min: 227.57 / Avg: 228.71 / Max: 229.35Min: 734.2 / Avg: 736.38 / Max: 738.2Min: 735.96 / Avg: 736.56 / Max: 737.73Min: 736.45 / Avg: 737.23 / Max: 737.951. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Random FillEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P612182430SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 323.323.223.214.514.514.51. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Random FillEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P510152025Min: 23.2 / Avg: 23.27 / Max: 23.3Min: 23.2 / Avg: 23.2 / Max: 23.2Min: 23.2 / Avg: 23.2 / Max: 23.2Min: 14.5 / Avg: 14.5 / Max: 14.5Min: 14.4 / Avg: 14.47 / Max: 14.5Min: 14.4 / Avg: 14.47 / Max: 14.61. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random FillEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P160320480640800SE +/- 0.24, N = 3SE +/- 0.18, N = 3SE +/- 0.12, N = 3SE +/- 1.35, N = 3SE +/- 2.04, N = 3SE +/- 3.93, N = 3228.04228.61228.32731.88734.82733.821. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random FillEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P130260390520650Min: 227.7 / Avg: 228.04 / Max: 228.51Min: 228.28 / Avg: 228.61 / Max: 228.91Min: 228.13 / Avg: 228.32 / Max: 228.54Min: 729.22 / Avg: 731.88 / Max: 733.62Min: 731.97 / Avg: 734.82 / Max: 738.78Min: 726.01 / Avg: 733.82 / Max: 738.461. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random ReadEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P306090120150SE +/- 0.18, N = 3SE +/- 0.44, N = 4SE +/- 0.21, N = 3SE +/- 1.04, N = 15SE +/- 0.91, N = 3SE +/- 1.08, N = 1539.8740.3840.03114.90112.08113.381. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random ReadEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P20406080100Min: 39.63 / Avg: 39.87 / Max: 40.23Min: 39.65 / Avg: 40.38 / Max: 41.6Min: 39.66 / Avg: 40.03 / Max: 40.39Min: 106.18 / Avg: 114.9 / Max: 123.24Min: 110.46 / Avg: 112.08 / Max: 113.62Min: 107.54 / Avg: 113.38 / Max: 121.141. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Seek RandomEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P4080120160200SE +/- 0.66, N = 3SE +/- 0.14, N = 3SE +/- 0.52, N = 3SE +/- 1.65, N = 3SE +/- 0.59, N = 3SE +/- 0.57, N = 364.3463.7564.87191.74191.13189.301. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Seek RandomEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P4080120160200Min: 63.57 / Avg: 64.34 / Max: 65.65Min: 63.5 / Avg: 63.75 / Max: 63.98Min: 63.83 / Avg: 64.87 / Max: 65.4Min: 188.96 / Avg: 191.74 / Max: 194.68Min: 190.36 / Avg: 191.13 / Max: 192.28Min: 188.21 / Avg: 189.3 / Max: 190.121. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random DeleteEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P150300450600750SE +/- 0.57, N = 3SE +/- 0.41, N = 3SE +/- 0.20, N = 3SE +/- 1.87, N = 3SE +/- 7.14, N = 3SE +/- 3.35, N = 3209.85210.65210.33690.96690.87670.021. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random DeleteEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P120240360480600Min: 208.72 / Avg: 209.85 / Max: 210.59Min: 210.14 / Avg: 210.65 / Max: 211.45Min: 210.04 / Avg: 210.33 / Max: 210.71Min: 688.75 / Avg: 690.96 / Max: 694.69Min: 676.58 / Avg: 690.87 / Max: 698.1Min: 663.38 / Avg: 670.02 / Max: 674.151. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Sequential FillEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P612182430SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 324.124.024.015.014.915.11. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Sequential FillEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P612182430Min: 24.1 / Avg: 24.1 / Max: 24.1Min: 23.9 / Avg: 24 / Max: 24.1Min: 24 / Avg: 24.03 / Max: 24.1Min: 14.8 / Avg: 14.93 / Max: 15Min: 15 / Avg: 15.1 / Max: 15.21. (CXX) g++ options: -O3 -lsnappy -lpthread

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Sequential FillEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P150300450600750SE +/- 0.18, N = 3SE +/- 0.58, N = 3SE +/- 0.16, N = 3SE +/- 0.53, N = 3SE +/- 2.35, N = 3SE +/- 2.29, N = 3220.33220.80220.87707.54710.33702.121. (CXX) g++ options: -O3 -lsnappy -lpthread
OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Sequential FillEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P130260390520650Min: 219.99 / Avg: 220.33 / Max: 220.6Min: 219.82 / Avg: 220.8 / Max: 221.82Min: 220.55 / Avg: 220.87 / Max: 221.04Min: 706.82 / Avg: 707.54 / Max: 708.58Min: 707.08 / Avg: 710.33 / Max: 714.89Min: 697.79 / Avg: 702.12 / Max: 705.61. (CXX) g++ options: -O3 -lsnappy -lpthread

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 1080EPYC 7F72AMD 7F72AMD EPYC 7F72612182430SE +/- 0.00, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 314.523.714.51. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC
OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 1080EPYC 7F72AMD 7F72AMD EPYC 7F72612182430Min: 14.5 / Avg: 14.5 / Max: 14.5Min: 23.6 / Avg: 23.73 / Max: 23.9Min: 14.5 / Avg: 14.53 / Max: 14.61. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

High Performance Conjugate Gradient

HPCG is the High Performance Conjugate Gradient and is a new scientific benchmark from Sandia National Lans focused for super-computer testing with modern real-world workloads compared to HPCC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P714212835SE +/- 0.25, N = 12SE +/- 0.32, N = 12SE +/- 0.17, N = 3SE +/- 0.30, N = 3SE +/- 0.02, N = 3SE +/- 0.41, N = 1215.0014.9715.4530.3530.1928.121. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi
OpenBenchmarking.orgGFLOP/s Per Core, More Is BetterHigh Performance Conjugate Gradient 3.1Performance Per CoreEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.14480.28960.43440.57920.7240.62500.62390.64350.63240.62890.58581. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgGFLOP/s Per Thread, More Is BetterHigh Performance Conjugate Gradient 3.1Performance Per ThreadEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.07240.14480.21720.28960.3620.31250.31190.32180.31620.31440.29291. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P714212835Min: 13.09 / Avg: 15 / Max: 15.67Min: 12.94 / Avg: 14.97 / Max: 15.76Min: 15.1 / Avg: 15.45 / Max: 15.65Min: 30.04 / Avg: 30.35 / Max: 30.94Min: 30.15 / Avg: 30.19 / Max: 30.22Min: 25.34 / Avg: 28.12 / Max: 30.131. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P20406080100SE +/- 0.39, N = 3SE +/- 0.30, N = 3SE +/- 0.42, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.20, N = 387.2587.3186.88100.70100.3799.061. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGFLOPS Per Core, More Is BetterHPC Challenge 1.5.0Performance Per Core - Test / Class: G-HPLEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.8191.6382.4573.2764.0953.643.643.622.102.092.061. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgGFLOPS Per Thread, More Is BetterHPC Challenge 1.5.0Performance Per Thread - Test / Class: G-HPLEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.40950.8191.22851.6382.04751.821.821.811.051.051.031. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P20406080100Min: 86.63 / Avg: 87.25 / Max: 87.98Min: 86.73 / Avg: 87.31 / Max: 87.73Min: 86.05 / Avg: 86.88 / Max: 87.42Min: 100.62 / Avg: 100.7 / Max: 100.75Min: 100.18 / Avg: 100.37 / Max: 100.52Min: 98.84 / Avg: 99.06 / Max: 99.451. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P510152025SE +/- 1.15527, N = 3SE +/- 0.56601, N = 3SE +/- 0.73072, N = 3SE +/- 0.18996, N = 3SE +/- 0.25898, N = 3SE +/- 0.86449, N = 39.872388.431219.4189520.8577020.6295720.079431. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGFLOPS Per Core, More Is BetterHPC Challenge 1.5.0Performance Per Core - Test / Class: G-FfteEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.09780.19560.29340.39120.4890.41130.35130.39250.43450.42980.41831. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgGFLOPS Per Thread, More Is BetterHPC Challenge 1.5.0Performance Per Thread - Test / Class: G-FfteEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.04890.09780.14670.19560.24450.20570.17570.19620.21730.21490.20921. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P510152025Min: 7.56 / Avg: 9.87 / Max: 11.05Min: 7.51 / Avg: 8.43 / Max: 9.46Min: 8.31 / Avg: 9.42 / Max: 10.8Min: 20.51 / Avg: 20.86 / Max: 21.16Min: 20.12 / Avg: 20.63 / Max: 20.96Min: 18.35 / Avg: 20.08 / Max: 20.971. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P918273645SE +/- 0.60, N = 3SE +/- 0.85, N = 3SE +/- 0.48, N = 3SE +/- 0.14, N = 3SE +/- 0.36, N = 3SE +/- 0.19, N = 336.1436.7736.4439.2039.0335.911. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGFLOPS Per Core, More Is BetterHPC Challenge 1.5.0Performance Per Core - Test / Class: EP-DGEMMEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.34430.68861.03291.37721.72151.51001.53001.52000.81670.81310.74821. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgGFLOPS Per Thread, More Is BetterHPC Challenge 1.5.0Performance Per Thread - Test / Class: EP-DGEMMEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.17230.34460.51690.68920.86150.75290.76590.75910.40830.40660.37411. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P816243240Min: 35.08 / Avg: 36.14 / Max: 37.16Min: 35.57 / Avg: 36.77 / Max: 38.41Min: 35.49 / Avg: 36.44 / Max: 37.09Min: 38.93 / Avg: 39.2 / Max: 39.42Min: 38.32 / Avg: 39.03 / Max: 39.48Min: 35.53 / Avg: 35.91 / Max: 36.111. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P48121620SE +/- 0.43146, N = 3SE +/- 0.42362, N = 3SE +/- 0.32995, N = 3SE +/- 0.20382, N = 3SE +/- 0.32220, N = 3SE +/- 0.83051, N = 37.757398.175538.2855515.9182316.4567316.123571. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGB/s Per Core, More Is BetterHPC Challenge 1.5.0Performance Per Core - Test / Class: G-PtransEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.07770.15540.23310.31080.38850.32320.34060.34520.33160.34280.33591. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgGB/s Per Thread, More Is BetterHPC Challenge 1.5.0Performance Per Thread - Test / Class: G-PtransEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.03880.07760.11640.15520.1940.16160.17030.17260.16580.17140.16801. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P48121620Min: 7.29 / Avg: 7.76 / Max: 8.62Min: 7.33 / Avg: 8.18 / Max: 8.61Min: 7.63 / Avg: 8.29 / Max: 8.63Min: 15.64 / Avg: 15.92 / Max: 16.31Min: 15.92 / Avg: 16.46 / Max: 17.04Min: 14.46 / Avg: 16.12 / Max: 17.011. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.77041.54082.31123.08163.852SE +/- 0.13922, N = 3SE +/- 0.00767, N = 3SE +/- 0.06266, N = 3SE +/- 0.01458, N = 3SE +/- 0.00319, N = 3SE +/- 0.13098, N = 33.116273.382203.299233.414323.423793.217821. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGB/s Per Core, More Is BetterHPC Challenge 1.5.0Performance Per Core - Test / Class: EP-STREAM TriadEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.03170.06340.09510.12680.15850.12980.14090.13750.07110.07130.06701. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgGB/s Per Thread, More Is BetterHPC Challenge 1.5.0Performance Per Thread - Test / Class: EP-STREAM TriadEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.01590.03180.04770.06360.07950.06490.07050.06870.03560.03570.03351. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 2.94 / Avg: 3.12 / Max: 3.39Min: 3.37 / Avg: 3.38 / Max: 3.39Min: 3.17 / Avg: 3.3 / Max: 3.36Min: 3.39 / Avg: 3.41 / Max: 3.43Min: 3.42 / Avg: 3.42 / Max: 3.43Min: 2.96 / Avg: 3.22 / Max: 3.361. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.00990.01980.02970.03960.0495SE +/- 0.00015, N = 3SE +/- 0.00105, N = 3SE +/- 0.00070, N = 3SE +/- 0.00017, N = 3SE +/- 0.00002, N = 3SE +/- 0.00060, N = 30.031440.030510.030370.043830.043960.042791. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGUP/s Per Core, More Is BetterHPC Challenge 1.5.0Performance Per Core - Test / Class: G-Random AccessEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.00030.00060.00090.00120.00150.00130.00130.00130.00090.00090.00091. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgGUP/s Per Thread, More Is BetterHPC Challenge 1.5.0Performance Per Thread - Test / Class: G-Random AccessEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.00020.00040.00060.00080.0010.00070.00060.00060.00050.00050.00041. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P12345Min: 0.03 / Avg: 0.03 / Max: 0.03Min: 0.03 / Avg: 0.03 / Max: 0.03Min: 0.03 / Avg: 0.03 / Max: 0.03Min: 0.04 / Avg: 0.04 / Max: 0.04Min: 0.04 / Avg: 0.04 / Max: 0.04Min: 0.04 / Avg: 0.04 / Max: 0.041. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencyEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.46240.92481.38721.84962.312SE +/- 0.00865, N = 3SE +/- 0.01886, N = 3SE +/- 0.01464, N = 3SE +/- 0.01426, N = 3SE +/- 0.03528, N = 3SE +/- 0.01822, N = 31.168691.162761.157172.024182.055122.040331. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgusecs x Core, Fewer Is BetterHPC Challenge 1.5.0Performance Per Core - Test / Class: Random Ring LatencyEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2040608010028.0527.9127.7797.1698.6597.941. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgusecs x Thread, Fewer Is BetterHPC Challenge 1.5.0Performance Per Thread - Test / Class: Random Ring LatencyEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P408012016020056.1055.8155.54194.32197.29195.871. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencyEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 1.16 / Avg: 1.17 / Max: 1.19Min: 1.13 / Avg: 1.16 / Max: 1.2Min: 1.13 / Avg: 1.16 / Max: 1.18Min: 2 / Avg: 2.02 / Max: 2.05Min: 1.99 / Avg: 2.06 / Max: 2.11Min: 2.01 / Avg: 2.04 / Max: 2.071. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.61431.22861.84292.45723.0715SE +/- 0.01911, N = 3SE +/- 0.06549, N = 3SE +/- 0.03283, N = 3SE +/- 0.00576, N = 3SE +/- 0.02292, N = 3SE +/- 0.00700, N = 32.710762.725152.730350.801670.826490.802611. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGB/s Per Core, More Is BetterHPC Challenge 1.5.0Performance Per Core - Test / Class: Random Ring BandwidthEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.02560.05120.07680.10240.1280.11290.11350.11380.01670.01720.01671. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgGB/s Per Thread, More Is BetterHPC Challenge 1.5.0Performance Per Thread - Test / Class: Random Ring BandwidthEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.01280.02560.03840.05120.0640.05650.05680.05690.00840.00860.00841. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 2.67 / Avg: 2.71 / Max: 2.74Min: 2.6 / Avg: 2.73 / Max: 2.81Min: 2.66 / Avg: 2.73 / Max: 2.76Min: 0.8 / Avg: 0.8 / Max: 0.81Min: 0.79 / Avg: 0.83 / Max: 0.87Min: 0.79 / Avg: 0.8 / Max: 0.811. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2K4K6K8K10KSE +/- 564.95, N = 3SE +/- 760.03, N = 3SE +/- 1135.29, N = 3SE +/- 491.47, N = 3SE +/- 437.93, N = 3SE +/- 296.59, N = 39566.5010110.2610456.2911661.789370.108695.061. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgMB/s Per Core, More Is BetterHPC Challenge 1.5.0Performance Per Core - Test / Class: Max Ping Pong BandwidthEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P90180270360450398.60421.26435.68242.95195.21181.151. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgMB/s Per Thread, More Is BetterHPC Challenge 1.5.0Performance Per Thread - Test / Class: Max Ping Pong BandwidthEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P50100150200250199.30210.63217.84121.4897.6190.571. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2K4K6K8K10KMin: 8667.35 / Avg: 9566.5 / Max: 10608.64Min: 8702.82 / Avg: 10110.26 / Max: 11311.26Min: 8498.65 / Avg: 10456.29 / Max: 12431.28Min: 10682.36 / Avg: 11661.78 / Max: 12223.45Min: 8581.08 / Avg: 9370.1 / Max: 10093.9Min: 8135.84 / Avg: 8695.06 / Max: 9145.971. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.20090.40180.60270.80361.0045SE +/- 0.00562, N = 3SE +/- 0.00402, N = 3SE +/- 0.01077, N = 3SE +/- 0.00014, N = 3SE +/- 0.00027, N = 3SE +/- 0.00250, N = 30.881690.875320.892770.446510.446420.45475
OpenBenchmarking.orgdays/ns x Core, Fewer Is BetterNAMD 2.14Performance Per Core - ATPase Simulation - 327,506 AtomsEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P51015202521.1621.0121.4321.4321.4321.831. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgdays/ns x Thread, Fewer Is BetterNAMD 2.14Performance Per Thread - ATPase Simulation - 327,506 AtomsEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P102030405042.3242.0242.8542.8742.8643.661. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 0.87 / Avg: 0.88 / Max: 0.89Min: 0.87 / Avg: 0.88 / Max: 0.88Min: 0.88 / Avg: 0.89 / Max: 0.91Min: 0.45 / Avg: 0.45 / Max: 0.45Min: 0.45 / Avg: 0.45 / Max: 0.45Min: 0.45 / Avg: 0.45 / Max: 0.46

FFTE

FFTE is a package by Daisuke Takahashi to compute Discrete Fourier Transforms of 1-, 2- and 3- dimensional sequences of length (2^p)*(3^q)*(5^r). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT RoutineEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P40K80K120K160K200KSE +/- 1183.09, N = 3SE +/- 1197.97, N = 4SE +/- 1183.24, N = 3SE +/- 2264.92, N = 4SE +/- 2641.44, N = 3SE +/- 2147.54, N = 15111181.74111004.66113220.20186579.07185994.10163435.021. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp
OpenBenchmarking.orgMFLOPS Per Core, More Is BetterFFTE 7.0Performance Per Core - N=256, 3D Complex FFT RoutineEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P100020003000400050004632.574625.194717.513887.063874.883404.901. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgMFLOPS Per Thread, More Is BetterFFTE 7.0Performance Per Thread - N=256, 3D Complex FFT RoutineEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P50010001500200025002316.292312.602358.751943.531937.441702.451. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT RoutineEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P30K60K90K120K150KMin: 109161.14 / Avg: 111181.74 / Max: 113258.33Min: 109385.83 / Avg: 111004.66 / Max: 114487.26Min: 111881.55 / Avg: 113220.2 / Max: 115579.54Min: 182616.87 / Avg: 186579.07 / Max: 192633.36Min: 182318.5 / Avg: 185994.1 / Max: 191118.08Min: 146573.22 / Avg: 163435.02 / Max: 174724.31. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P4080120160200SE +/- 0.11, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 3SE +/- 0.64, N = 3SE +/- 0.78, N = 3SE +/- 0.09, N = 3142.12142.13142.08187.89186.40188.261. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
OpenBenchmarking.orgSeconds x Core, Fewer Is BetterTimed HMMer Search 3.3.1Performance Per Core - Pfam Database SearchEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2K4K6K8K10K3410.763411.143409.929018.778947.019036.581. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgSeconds x Thread, Fewer Is BetterTimed HMMer Search 3.3.1Performance Per Thread - Pfam Database SearchEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P4K8K12K16K20K6821.526822.296819.8418037.5417894.0218073.151. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P306090120150Min: 141.96 / Avg: 142.11 / Max: 142.33Min: 142.09 / Avg: 142.13 / Max: 142.15Min: 141.82 / Avg: 142.08 / Max: 142.3Min: 186.64 / Avg: 187.89 / Max: 188.79Min: 185.31 / Avg: 186.4 / Max: 187.91Min: 188.11 / Avg: 188.26 / Max: 188.41. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P612182430SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 315.7915.8215.7824.9224.9224.551. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day Per Core, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Performance Per Core - Model: 20k AtomsEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.14830.29660.44490.59320.74150.65790.65900.65750.51930.51920.51141. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgns/day Per Thread, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Performance Per Thread - Model: 20k AtomsEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.07410.14820.22230.29640.37050.32890.32950.32870.25960.25960.25571. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P612182430Min: 15.75 / Avg: 15.79 / Max: 15.86Min: 15.71 / Avg: 15.82 / Max: 15.96Min: 15.76 / Avg: 15.78 / Max: 15.8Min: 24.88 / Avg: 24.92 / Max: 25Min: 24.81 / Avg: 24.92 / Max: 25Min: 24.37 / Avg: 24.55 / Max: 24.681. (CXX) g++ options: -O3 -pthread -lm

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P510152025SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.43, N = 15SE +/- 0.25, N = 3SE +/- 0.22, N = 512.1311.7011.7122.5722.3919.961. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day Per Core, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Performance Per Core - Model: Rhodopsin ProteinEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.11370.22740.34110.45480.56850.50540.48750.48770.47010.46640.41591. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgns/day Per Thread, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Performance Per Thread - Model: Rhodopsin ProteinEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.05690.11380.17070.22760.28450.25270.24380.24390.23510.23320.20791. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P510152025Min: 11.96 / Avg: 12.13 / Max: 12.22Min: 11.55 / Avg: 11.7 / Max: 11.78Min: 11.67 / Avg: 11.71 / Max: 11.73Min: 20.17 / Avg: 22.57 / Max: 24.68Min: 22.1 / Avg: 22.39 / Max: 22.88Min: 19.52 / Avg: 19.96 / Max: 20.791. (CXX) g++ options: -O3 -pthread -lm

WebP Image Encode

This is a test of Google's libwebp with the cwebp image encode utility and using a sample 6000x4000 pixel JPEG image as the input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.36270.72541.08811.45081.8135SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 31.6111.6021.6121.6111.6041.6111. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds x Core, Fewer Is BetterWebP Image Encode 1.1Performance Per Core - Encode Settings: DefaultEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2040608010038.6638.4538.6977.3376.9977.331. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgEncode Time - Seconds x Thread, Fewer Is BetterWebP Image Encode 1.1Performance Per Thread - Encode Settings: DefaultEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P30609012015077.3376.9077.38154.66153.98154.661. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 1.6 / Avg: 1.61 / Max: 1.62Min: 1.6 / Avg: 1.6 / Max: 1.61Min: 1.61 / Avg: 1.61 / Max: 1.62Min: 1.61 / Avg: 1.61 / Max: 1.61Min: 1.6 / Avg: 1.6 / Max: 1.61Min: 1.61 / Avg: 1.61 / Max: 1.611. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.58411.16821.75232.33642.9205SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.003, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 32.5962.5852.5912.5732.5802.5831. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds x Core, Fewer Is BetterWebP Image Encode 1.1Performance Per Core - Encode Settings: Quality 100EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P30609012015062.3062.0462.18123.50123.84123.981. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgEncode Time - Seconds x Thread, Fewer Is BetterWebP Image Encode 1.1Performance Per Thread - Encode Settings: Quality 100EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P50100150200250124.61124.08124.37247.01247.68247.971. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 2.59 / Avg: 2.6 / Max: 2.6Min: 2.58 / Avg: 2.58 / Max: 2.59Min: 2.58 / Avg: 2.59 / Max: 2.6Min: 2.57 / Avg: 2.57 / Max: 2.57Min: 2.58 / Avg: 2.58 / Max: 2.58Min: 2.58 / Avg: 2.58 / Max: 2.591. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.09, N = 319.0419.0219.0219.0118.7719.201. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds x Core, Fewer Is BetterWebP Image Encode 1.1Performance Per Core - Encode Settings: Quality 100, LosslessEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2004006008001000457.01456.43456.36912.48900.86921.501. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgEncode Time - Seconds x Thread, Fewer Is BetterWebP Image Encode 1.1Performance Per Thread - Encode Settings: Quality 100, LosslessEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P400800120016002000914.02912.86912.721824.961801.731843.011. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P510152025Min: 19.02 / Avg: 19.04 / Max: 19.08Min: 19 / Avg: 19.02 / Max: 19.05Min: 19.01 / Avg: 19.02 / Max: 19.03Min: 18.92 / Avg: 19.01 / Max: 19.08Min: 18.62 / Avg: 18.77 / Max: 19.01Min: 19.07 / Avg: 19.2 / Max: 19.371. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810SE +/- 0.004, N = 3SE +/- 0.021, N = 3SE +/- 0.001, N = 3SE +/- 0.006, N = 3SE +/- 0.009, N = 3SE +/- 0.027, N = 38.5388.5498.5488.5148.5238.5561. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds x Core, Fewer Is BetterWebP Image Encode 1.1Performance Per Core - Encode Settings: Quality 100, Highest CompressionEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P90180270360450204.91205.18205.15408.67409.10410.691. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgEncode Time - Seconds x Thread, Fewer Is BetterWebP Image Encode 1.1Performance Per Thread - Encode Settings: Quality 100, Highest CompressionEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2004006008001000409.82410.35410.30817.34818.21821.381. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P3691215Min: 8.53 / Avg: 8.54 / Max: 8.55Min: 8.53 / Avg: 8.55 / Max: 8.59Min: 8.55 / Avg: 8.55 / Max: 8.55Min: 8.51 / Avg: 8.51 / Max: 8.53Min: 8.51 / Avg: 8.52 / Max: 8.54Min: 8.53 / Avg: 8.56 / Max: 8.611. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P918273645SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.13, N = 339.2839.1739.3439.0139.2739.351. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff
OpenBenchmarking.orgEncode Time - Seconds x Core, Fewer Is BetterWebP Image Encode 1.1Performance Per Core - Encode Settings: Quality 100, Lossless, Highest CompressionEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P400800120016002000942.72940.08944.211872.621885.151888.901. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgEncode Time - Seconds x Thread, Fewer Is BetterWebP Image Encode 1.1Performance Per Thread - Encode Settings: Quality 100, Lossless, Highest CompressionEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P80016002400320040001885.441880.161888.423745.253770.303777.791. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P816243240Min: 39.1 / Avg: 39.28 / Max: 39.38Min: 39.09 / Avg: 39.17 / Max: 39.33Min: 39.17 / Avg: 39.34 / Max: 39.49Min: 38.96 / Avg: 39.01 / Max: 39.11Min: 39.25 / Avg: 39.27 / Max: 39.32Min: 39.1 / Avg: 39.35 / Max: 39.571. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

BYTE Unix Benchmark

This is a test of BYTE. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P8M16M24M32M40MSE +/- 295946.74, N = 12SE +/- 231062.98, N = 3SE +/- 93255.90, N = 3SE +/- 218663.85, N = 3SE +/- 455276.37, N = 3SE +/- 523722.43, N = 337455257.237547272.937624546.938124375.537519864.538477706.3
OpenBenchmarking.orgLPS Per Core, More Is BetterBYTE Unix Benchmark 3.6Performance Per Core - Computational Test: Dhrystone 2EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P300K600K900K1200K1500K1560635.721564469.701567689.45794257.82781663.84801618.881. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgLPS Per Thread, More Is BetterBYTE Unix Benchmark 3.6Performance Per Thread - Computational Test: Dhrystone 2EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P200K400K600K800K1000K780317.86782234.85783844.73397128.91390831.92400809.441. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2EPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P7M14M21M28M35MMin: 36026099.6 / Avg: 37455257.24 / Max: 39383837.8Min: 37101722.6 / Avg: 37547272.9 / Max: 37876274.2Min: 37523480.1 / Avg: 37624546.93 / Max: 37810834.2Min: 37764703.2 / Avg: 38124375.47 / Max: 38519661.2Min: 36709387.9 / Avg: 37519864.53 / Max: 38284512.7Min: 37786879.9 / Avg: 38477706.33 / Max: 39504973.4

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2K4K6K8K10KSE +/- 29.36, N = 3SE +/- 22.56, N = 3SE +/- 17.93, N = 3SE +/- 55.92, N = 3SE +/- 79.54, N = 3SE +/- 45.18, N = 39777.439740.169780.389657.049652.009632.391. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s Per Core, More Is BetterLZ4 Compression 1.9.3Performance Per Core - Compression Level: 1 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P90180270360450407.39405.84407.52201.19201.08200.671. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgMB/s Per Thread, More Is BetterLZ4 Compression 1.9.3Performance Per Thread - Compression Level: 1 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P4080120160200203.70202.92203.76100.59100.54100.341. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2K4K6K8K10KMin: 9722.63 / Avg: 9777.43 / Max: 9823.08Min: 9710.44 / Avg: 9740.16 / Max: 9784.42Min: 9751.48 / Avg: 9780.38 / Max: 9813.23Min: 9566.8 / Avg: 9657.04 / Max: 9759.38Min: 9547.98 / Avg: 9652 / Max: 9808.23Min: 9545.13 / Avg: 9632.39 / Max: 9696.331. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2K4K6K8K10KSE +/- 35.72, N = 3SE +/- 3.46, N = 3SE +/- 28.90, N = 3SE +/- 47.03, N = 3SE +/- 54.20, N = 3SE +/- 80.64, N = 311308.911271.411314.611021.811114.311168.71. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s Per Core, More Is BetterLZ4 Compression 1.9.3Performance Per Core - Compression Level: 1 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P100200300400500471.20469.64471.44229.62231.55232.681. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgMB/s Per Thread, More Is BetterLZ4 Compression 1.9.3Performance Per Thread - Compression Level: 1 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P50100150200250235.60234.82235.72114.81115.77116.341. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2K4K6K8K10KMin: 11273.1 / Avg: 11308.87 / Max: 11380.3Min: 11266.3 / Avg: 11271.4 / Max: 11278Min: 11258.5 / Avg: 11314.6 / Max: 11354.7Min: 10963.6 / Avg: 11021.8 / Max: 11114.9Min: 11007 / Avg: 11114.3 / Max: 11181.3Min: 11014.5 / Avg: 11168.7 / Max: 11286.71. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P1122334455SE +/- 0.55, N = 4SE +/- 0.36, N = 3SE +/- 0.05, N = 3SE +/- 0.49, N = 6SE +/- 0.61, N = 4SE +/- 0.56, N = 1550.7850.5449.3149.9449.6749.241. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s Per Core, More Is BetterLZ4 Compression 1.9.3Performance Per Core - Compression Level: 3 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.4770.9541.4311.9082.3852.122.112.051.041.031.031. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgMB/s Per Thread, More Is BetterLZ4 Compression 1.9.3Performance Per Thread - Compression Level: 3 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.23850.4770.71550.9541.19251.06001.05001.03000.52020.51740.51291. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P1020304050Min: 49.99 / Avg: 50.78 / Max: 52.35Min: 49.82 / Avg: 50.54 / Max: 50.94Min: 49.22 / Avg: 49.31 / Max: 49.36Min: 49.12 / Avg: 49.94 / Max: 51.59Min: 49 / Avg: 49.67 / Max: 51.5Min: 44.07 / Avg: 49.24 / Max: 51.231. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2K4K6K8K10KSE +/- 37.48, N = 4SE +/- 1.64, N = 3SE +/- 18.01, N = 3SE +/- 35.35, N = 6SE +/- 72.96, N = 4SE +/- 35.55, N = 1510606.310661.510548.810360.510409.910402.71. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s Per Core, More Is BetterLZ4 Compression 1.9.3Performance Per Core - Compression Level: 3 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P100200300400500441.93444.23439.53215.84216.87216.721. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgMB/s Per Thread, More Is BetterLZ4 Compression 1.9.3Performance Per Thread - Compression Level: 3 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P50100150200250220.96222.11219.77107.92108.44108.361. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2K4K6K8K10KMin: 10520 / Avg: 10606.33 / Max: 10683.5Min: 10658.3 / Avg: 10661.47 / Max: 10663.8Min: 10513.1 / Avg: 10548.77 / Max: 10571Min: 10307.4 / Avg: 10360.48 / Max: 10530.7Min: 10265.2 / Avg: 10409.85 / Max: 10537.8Min: 10135.6 / Avg: 10402.73 / Max: 10629.11. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P1122334455SE +/- 0.51, N = 5SE +/- 0.54, N = 5SE +/- 0.42, N = 3SE +/- 0.41, N = 3SE +/- 0.62, N = 3SE +/- 0.31, N = 349.7048.3348.9848.2148.8348.081. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s Per Core, More Is BetterLZ4 Compression 1.9.3Performance Per Core - Compression Level: 9 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.46580.93161.39741.86322.3292.072.012.041.001.021.001. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgMB/s Per Thread, More Is BetterLZ4 Compression 1.9.3Performance Per Thread - Compression Level: 9 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.2340.4680.7020.9361.171.04001.01001.02000.50220.50860.50081. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P1020304050Min: 48.53 / Avg: 49.7 / Max: 50.9Min: 46.91 / Avg: 48.33 / Max: 49.98Min: 48.14 / Avg: 48.98 / Max: 49.48Min: 47.75 / Avg: 48.21 / Max: 49.03Min: 47.77 / Avg: 48.83 / Max: 49.91Min: 47.46 / Avg: 48.08 / Max: 48.471. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2K4K6K8K10KSE +/- 28.09, N = 5SE +/- 22.43, N = 5SE +/- 8.87, N = 3SE +/- 50.17, N = 3SE +/- 99.24, N = 3SE +/- 45.57, N = 310630.910616.010685.710448.210473.310602.41. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s Per Core, More Is BetterLZ4 Compression 1.9.3Performance Per Core - Compression Level: 9 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P100200300400500442.95442.33445.24217.67218.19220.881. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgMB/s Per Thread, More Is BetterLZ4 Compression 1.9.3Performance Per Thread - Compression Level: 9 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P50100150200250221.48221.17222.62108.84109.10110.441. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2K4K6K8K10KMin: 10579.5 / Avg: 10630.88 / Max: 10710.9Min: 10577.3 / Avg: 10616.04 / Max: 10671Min: 10668 / Avg: 10685.7 / Max: 10695.7Min: 10355.3 / Avg: 10448.2 / Max: 10527.5Min: 10305.6 / Avg: 10473.27 / Max: 10649.1Min: 10514.3 / Avg: 10602.43 / Max: 10666.61. (CC) gcc options: -O3

LibRaw

LibRaw is a RAW image decoder for digital camera photos. This test profile runs LibRaw's post-processing benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P816243240SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.13, N = 3SE +/- 0.32, N = 3SE +/- 0.18, N = 335.1135.0134.9631.4231.1431.931. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
OpenBenchmarking.orgMpix/sec Per Core, More Is BetterLibRaw 0.20Performance Per Core - Post-Processing BenchmarkEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.32850.6570.98551.3141.64251.46001.46001.46000.65460.64880.66521. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgMpix/sec Per Thread, More Is BetterLibRaw 0.20Performance Per Thread - Post-Processing BenchmarkEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.16460.32920.49380.65840.8230.73150.72940.72830.32730.32440.33261. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P816243240Min: 34.99 / Avg: 35.11 / Max: 35.32Min: 34.83 / Avg: 35.01 / Max: 35.18Min: 34.84 / Avg: 34.96 / Max: 35.07Min: 31.23 / Avg: 31.42 / Max: 31.68Min: 30.51 / Avg: 31.14 / Max: 31.54Min: 31.63 / Avg: 31.93 / Max: 32.241. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P1.6M3.2M4.8M6.4M8MSE +/- 14432.27, N = 3SE +/- 15235.97, N = 3SE +/- 62574.81, N = 3SE +/- 1368.48, N = 3SE +/- 4184.78, N = 3SE +/- 15876.84, N = 37406309738291673592177417621741900273859931. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
OpenBenchmarking.orgNodes Per Second Per Core, More Is BetterCrafty 25.2Performance Per Core - Elapsed TimeEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P70K140K210K280K350K308596.21307621.50306634.04154533.77154562.54153874.851. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgNodes Per Second Per Thread, More Is BetterCrafty 25.2Performance Per Thread - Elapsed TimeEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P30K60K90K120K150K154298.10153810.75153317.0277266.8977281.2776937.431. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P1.3M2.6M3.9M5.2M6.5MMin: 7383037 / Avg: 7406309.33 / Max: 7432733Min: 7361528 / Avg: 7382916.33 / Max: 7412407Min: 7235678 / Avg: 7359216.67 / Max: 7438320Min: 7415815 / Avg: 7417621 / Max: 7420305Min: 7410787 / Avg: 7419001.67 / Max: 7424497Min: 7354988 / Avg: 7385993.33 / Max: 74074311. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.39010.78021.17031.56041.9505SE +/- 0.00365, N = 3SE +/- 0.01016, N = 3SE +/- 0.00210, N = 3SE +/- 0.01626, N = 5SE +/- 0.01312, N = 3SE +/- 0.01869, N = 51.730511.733881.727651.557671.523951.67289MIN: 1.58MIN: 1.57MIN: 1.58MIN: 1.3MIN: 1.3MIN: 1.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2040608010041.5341.6141.4674.7773.1580.301. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgms x Thread, Fewer Is BetteroneDNN 2.0Performance Per Thread - Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P408012016020083.0683.2382.93149.54146.30160.601. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 1.73 / Avg: 1.73 / Max: 1.74Min: 1.72 / Avg: 1.73 / Max: 1.75Min: 1.73 / Avg: 1.73 / Max: 1.73Min: 1.53 / Avg: 1.56 / Max: 1.62Min: 1.5 / Avg: 1.52 / Max: 1.55Min: 1.62 / Avg: 1.67 / Max: 1.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.6261.2521.8782.5043.13SE +/- 0.011681, N = 3SE +/- 0.017798, N = 3SE +/- 0.013730, N = 3SE +/- 0.007882, N = 6SE +/- 0.009922, N = 3SE +/- 0.008788, N = 32.7781502.7500402.7821400.7846520.8163060.891776MIN: 2.48MIN: 2.48MIN: 2.48MIN: 0.67MIN: 0.69MIN: 0.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P153045607566.6866.0066.7737.6639.1842.811. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgms x Thread, Fewer Is BetteroneDNN 2.0Performance Per Thread - Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P306090120150133.35132.00133.5475.3378.3785.611. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 2.76 / Avg: 2.78 / Max: 2.8Min: 2.73 / Avg: 2.75 / Max: 2.79Min: 2.76 / Avg: 2.78 / Max: 2.81Min: 0.76 / Avg: 0.78 / Max: 0.82Min: 0.8 / Avg: 0.82 / Max: 0.83Min: 0.87 / Avg: 0.89 / Max: 0.91. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.47030.94061.41091.88122.3515SE +/- 0.00957, N = 3SE +/- 0.00516, N = 3SE +/- 0.00056, N = 3SE +/- 0.02566, N = 4SE +/- 0.01974, N = 6SE +/- 0.01869, N = 71.437821.426451.426692.072472.030422.09022MIN: 1.33MIN: 1.33MIN: 1.32MIN: 1.66MIN: 1.66MIN: 1.741. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2040608010034.5134.2434.2499.4897.46100.331. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgms x Thread, Fewer Is BetteroneDNN 2.0Performance Per Thread - Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P408012016020069.0268.4768.48198.96194.92200.661. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 1.42 / Avg: 1.44 / Max: 1.45Min: 1.42 / Avg: 1.43 / Max: 1.44Min: 1.43 / Avg: 1.43 / Max: 1.43Min: 2.02 / Avg: 2.07 / Max: 2.14Min: 1.96 / Avg: 2.03 / Max: 2.08Min: 2.01 / Avg: 2.09 / Max: 2.141. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.18880.37760.56640.75520.944SE +/- 0.002974, N = 3SE +/- 0.017578, N = 12SE +/- 0.000920, N = 3SE +/- 0.002965, N = 3SE +/- 0.009511, N = 3SE +/- 0.001315, N = 30.5928040.8390250.5917020.7423230.7522900.792153MIN: 0.51MIN: 0.64MIN: 0.5MIN: 0.67MIN: 0.67MIN: 0.661. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P91827364514.2320.1414.2035.6336.1138.021. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgms x Thread, Fewer Is BetteroneDNN 2.0Performance Per Thread - Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2040608010028.4640.2728.4071.2672.2276.051. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 0.59 / Avg: 0.59 / Max: 0.6Min: 0.74 / Avg: 0.84 / Max: 0.93Min: 0.59 / Avg: 0.59 / Max: 0.59Min: 0.74 / Avg: 0.74 / Max: 0.75Min: 0.73 / Avg: 0.75 / Max: 0.76Min: 0.79 / Avg: 0.79 / Max: 0.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.65331.30661.95992.61323.2665SE +/- 0.008471, N = 3SE +/- 0.008767, N = 3SE +/- 0.018414, N = 3SE +/- 0.011932, N = 3SE +/- 0.002168, N = 3SE +/- 0.005045, N = 32.8493602.9034202.8364800.8606790.8688640.928138MIN: 2.46MIN: 2.52MIN: 2.45MIN: 0.79MIN: 0.79MIN: 0.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P163248648068.3969.6868.0841.3141.7144.551. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgms x Thread, Fewer Is BetteroneDNN 2.0Performance Per Thread - Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P306090120150136.77139.36136.1582.6383.4189.101. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 2.83 / Avg: 2.85 / Max: 2.86Min: 2.89 / Avg: 2.9 / Max: 2.92Min: 2.81 / Avg: 2.84 / Max: 2.87Min: 0.84 / Avg: 0.86 / Max: 0.88Min: 0.86 / Avg: 0.87 / Max: 0.87Min: 0.92 / Avg: 0.93 / Max: 0.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.55021.10041.65062.20082.751SE +/- 0.00293, N = 3SE +/- 0.02163, N = 3SE +/- 0.01986, N = 3SE +/- 0.03458, N = 12SE +/- 0.02846, N = 15SE +/- 0.03614, N = 152.359462.341522.357262.305642.262692.44538MIN: 2.12MIN: 2.1MIN: 2.1MIN: 1.86MIN: 1.86MIN: 1.831. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P30609012015056.6356.2056.57110.67108.61117.381. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgms x Thread, Fewer Is BetteroneDNN 2.0Performance Per Thread - Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P50100150200250113.25112.39113.15221.34217.22234.761. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 2.35 / Avg: 2.36 / Max: 2.36Min: 2.3 / Avg: 2.34 / Max: 2.38Min: 2.34 / Avg: 2.36 / Max: 2.4Min: 2.1 / Avg: 2.31 / Max: 2.44Min: 2.08 / Avg: 2.26 / Max: 2.44Min: 2.12 / Avg: 2.45 / Max: 2.611. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.70891.41782.12672.83563.5445SE +/- 0.01172, N = 3SE +/- 0.01746, N = 3SE +/- 0.01523, N = 3SE +/- 0.05452, N = 12SE +/- 0.02839, N = 3SE +/- 0.03863, N = 153.150473.144993.138782.319472.365722.30768MIN: 2.98MIN: 2.97MIN: 2.98MIN: 1.96MIN: 2.01MIN: 1.951. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P30609012015075.6175.4875.33111.34113.56110.771. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgms x Thread, Fewer Is BetteroneDNN 2.0Performance Per Thread - Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P50100150200250151.22150.96150.66222.67227.11221.541. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 3.13 / Avg: 3.15 / Max: 3.17Min: 3.11 / Avg: 3.14 / Max: 3.16Min: 3.11 / Avg: 3.14 / Max: 3.17Min: 2.09 / Avg: 2.32 / Max: 2.7Min: 2.32 / Avg: 2.37 / Max: 2.42Min: 2.13 / Avg: 2.31 / Max: 2.571. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P1.23572.47143.70714.94286.1785SE +/- 0.01527, N = 3SE +/- 0.02370, N = 3SE +/- 0.03533, N = 3SE +/- 0.51653, N = 14SE +/- 0.48144, N = 12SE +/- 0.04554, N = 35.303205.359505.363305.492204.875233.65062MIN: 4.97MIN: 4.96MIN: 4.96MIN: 2.85MIN: 2.89MIN: 2.991. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P60120180240300127.28128.63128.72263.63234.01175.231. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgms x Thread, Fewer Is BetteroneDNN 2.0Performance Per Thread - Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P110220330440550254.55257.26257.44527.25468.02350.461. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 5.27 / Avg: 5.3 / Max: 5.33Min: 5.31 / Avg: 5.36 / Max: 5.38Min: 5.3 / Avg: 5.36 / Max: 5.42Min: 3.26 / Avg: 5.49 / Max: 7.99Min: 3.2 / Avg: 4.88 / Max: 7.49Min: 3.59 / Avg: 3.65 / Max: 3.741. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810SE +/- 0.00849, N = 3SE +/- 0.01141, N = 3SE +/- 0.03710, N = 3SE +/- 0.01946, N = 3SE +/- 0.01945, N = 3SE +/- 0.01009, N = 36.377256.413906.328842.025801.991722.10700MIN: 5.79MIN: 5.78MIN: 5.76MIN: 1.87MIN: 1.86MIN: 1.861. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P306090120150153.05153.93151.8997.2495.60101.141. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgms x Thread, Fewer Is BetteroneDNN 2.0Performance Per Thread - Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P70140210280350306.11307.87303.78194.48191.21202.271. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P3691215Min: 6.37 / Avg: 6.38 / Max: 6.39Min: 6.39 / Avg: 6.41 / Max: 6.43Min: 6.26 / Avg: 6.33 / Max: 6.38Min: 1.99 / Avg: 2.03 / Max: 2.05Min: 1.95 / Avg: 1.99 / Max: 2.02Min: 2.09 / Avg: 2.11 / Max: 2.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P0.51971.03941.55912.07882.5985SE +/- 0.00115, N = 3SE +/- 0.00978, N = 3SE +/- 0.00103, N = 3SE +/- 0.00707, N = 3SE +/- 0.01842, N = 15SE +/- 0.01121, N = 152.309612.306332.296721.436711.468411.50120MIN: 2.18MIN: 2.18MIN: 2.18MIN: 1.28MIN: 1.27MIN: 1.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P163248648055.4355.3555.1268.9670.4872.061. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgms x Thread, Fewer Is BetteroneDNN 2.0Performance Per Thread - Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P306090120150110.86110.70110.24137.92140.97144.121. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P246810Min: 2.31 / Avg: 2.31 / Max: 2.31Min: 2.29 / Avg: 2.31 / Max: 2.32Min: 2.29 / Avg: 2.3 / Max: 2.3Min: 1.42 / Avg: 1.44 / Max: 1.44Min: 1.39 / Avg: 1.47 / Max: 1.64Min: 1.46 / Avg: 1.5 / Max: 1.591. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P30060090012001500SE +/- 2.75, N = 3SE +/- 10.75, N = 3SE +/- 3.90, N = 3SE +/- 20.62, N = 14SE +/- 20.53, N = 15SE +/- 11.72, N = 31611.211613.701611.741177.121166.561290.07MIN: 1572.74MIN: 1565.85MIN: 1572.94MIN: 1074.6MIN: 1069.01MIN: 1195.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P13K26K39K52K65K38669.0438728.8038681.7656501.7655994.8861923.361. EPYC 7F72: Detected core count of 242. AMD 7F72: Detected core count of 243. AMD EPYC 7F72: Detected core count of 244. AMD EPYC 7F72 2P: Detected core count of 485. EPYC 7F72 2P: Detected core count of 486. 7F72 2P: Detected core count of 48
OpenBenchmarking.orgms x Thread, Fewer Is BetteroneDNN 2.0Performance Per Thread - Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P30K60K90K120K150K77338.0877457.6077363.52113003.52111989.76123846.721. EPYC 7F72: Detected thread count of 482. AMD 7F72: Detected thread count of 483. AMD EPYC 7F72: Detected thread count of 484. AMD EPYC 7F72 2P: Detected thread count of 965. EPYC 7F72 2P: Detected thread count of 966. 7F72 2P: Detected thread count of 96
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P30060090012001500Min: 1606.72 / Avg: 1611.21 / Max: 1616.2Min: 1593.97 / Avg: 1613.7 / Max: 1630.96Min: 1607.18 / Avg: 1611.74 / Max: 1619.5Min: 1106.82 / Avg: 1177.12 / Max: 1397.87Min: 1105.59 / Avg: 1166.56 / Max: 1377.85Min: 1278.32 / Avg: 1290.07 / Max: 1313.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P2004006008001000SE +/- 1.31, N = 3SE +/- 0.29, N = 3SE +/- 7.20, N = 3SE +/- 14.28, N = 15SE +/- 16.69, N = 15SE +/- 15.65, N = 15925.98930.71929.89823.75848.25907.07MIN: 896.94MIN: 898.24MIN: 896.55MIN: 700.39MIN: 699.46MIN: 758.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms x Core, Fewer Is BetteroneDNN 2.0Performance Per Core - Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 2PEPYC 7F72 2P7F72 2P9K18K27K36K45K