EPYC 7F72

AMD EPYC 7F72 24-Core testing with a Supermicro H11DSi-NT v2.00 (2.1 BIOS) and llvmpipe on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012129-HA-EPYC7F72896&grr.

EPYC 7F72ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionEPYC 7F72AMD 7F72AMD EPYC 7F72AMD EPYC 7F72 24-Core @ 3.20GHz (24 Cores / 48 Threads)Supermicro H11DSi-NT v2.00 (2.1 BIOS)AMD Starship/Matisse64GB1000GB Western Digital WD_BLACK SN850 1TBllvmpipeVE2282 x Intel 10G X550TUbuntu 20.105.8.0-29-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.5 Mesa 20.2.1 (LLVM 11.0.0 256 bits)GCC 10.2.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034 Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC 7F72hpcc: G-HPLbasis: UASTC Level 2 + RDO Post-Processinglammps: 20k Atomshpcg: build-clash: Time To Compileai-benchmark: Device AI Scoreai-benchmark: Device Training Scoreai-benchmark: Device Inference Scorebuild-llvm: Time To Compilebrl-cad: VGR Performance Metricnumpy: hint: FLOATbyte: Dhrystone 2mlpack: scikit_qdaasmfish: 1024 Hash Memory, 26 Depthstockfish: Total Timehmmer: Pfam Database Searchleveldb: Seq Fillleveldb: Seq Fillleveldb: Rand Deleteinfluxdb: 4 - 10000 - 2,5000,1 - 10000compress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedinfluxdb: 64 - 10000 - 2,5000,1 - 10000ncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetncnn: CPU - squeezenetonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUkeydb: gromacs: Water Benchmarktensorflow-lite: Inception V4compress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedleveldb: Seek Randindigobench: CPU - Bedroomtensorflow-lite: Inception ResNet V2indigobench: CPU - Supercartensorflow-lite: NASNet Mobiletensorflow-lite: SqueezeNetbuild-linux-kernel: Time To Compiletensorflow-lite: Mobilenet Quanttensorflow-lite: Mobilenet Floatmlpack: scikit_linearridgeregressionrav1e: 5rav1e: 1kvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediummlpack: scikit_icaredis: GETredis: LPOPredis: LPUSHredis: SEThugin: Panorama Photo Assistant + Stitching Timeredis: SADDbasis: ETC1Snamd: ATPase Simulation - 327,506 Atomsleveldb: Hot Readleveldb: Rand Readespeak: Text-To-Speech Synthesisrav1e: 6yquake2: Software CPU - 1920 x 1080webp: Quality 100, Lossless, Highest Compressionpgbench: 100 - 1 - Read Only - Average Latencypgbench: 100 - 1 - Read Onlypgbench: 100 - 50 - Read Only - Average Latencypgbench: 100 - 50 - Read Onlypgbench: 100 - 1 - Read Write - Average Latencypgbench: 100 - 1 - Read Writepgbench: 100 - 100 - Read Only - Average Latencypgbench: 100 - 100 - Read Onlypgbench: 100 - 50 - Read Write - Average Latencypgbench: 100 - 50 - Read Writepgbench: 100 - 100 - Read Write - Average Latencypgbench: 100 - 100 - Read Writephpbench: PHP Benchmark Suitelibraw: Post-Processing Benchmarkcompress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedrav1e: 10mlpack: scikit_svmpgbench: 1 - 1 - Read Only - Average Latencypgbench: 1 - 1 - Read Onlybasis: UASTC Level 3crafty: Elapsed Timex265: Bosphorus 4Kpgbench: 1 - 100 - Read Write - Average Latencypgbench: 1 - 100 - Read Writepgbench: 1 - 50 - Read Write - Average Latencypgbench: 1 - 50 - Read Writepgbench: 1 - 100 - Read Only - Average Latencypgbench: 1 - 100 - Read Onlypgbench: 1 - 50 - Read Only - Average Latencypgbench: 1 - 50 - Read Onlypgbench: 1 - 1 - Read Write - Average Latencypgbench: 1 - 1 - Read Writekvazaar: Bosphorus 4K - Very Fastleveldb: Overwriteleveldb: Overwriteleveldb: Rand Fillleveldb: Rand Fillonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUrnnoise: onednn: Deconvolution Batch shapes_1d - f32 - CPUtnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1webp: Quality 100, Losslessonednn: IP Shapes 3D - u8s8f32 - CPUbasis: UASTC Level 2kvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUkvazaar: Bosphorus 4K - Ultra Fastonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUx265: Bosphorus 1080ponednn: IP Shapes 3D - f32 - CPUwebp: Quality 100, Highest Compressionbasis: UASTC Level 0kvazaar: Bosphorus 1080p - Very Fastonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUkvazaar: Bosphorus 1080p - Ultra Fastx264: H.264 Video Encodingonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUwebp: Quality 100ffte: N=256, 3D Complex FFT Routinelammps: Rhodopsin Proteinwebp: Defaultleveldb: Fill Syncleveldb: Fill Synchpcc: Max Ping Pong Bandwidthhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: EP-DGEMMhpcc: G-FfteEPYC 7F72AMD 7F72AMD EPYC 7F7287.25450694.86915.78914.9995462.283347315131960298.145337924324.13321206770.0579437455257.240.056434983752597544142.115220.32724.1209.8531197999.810630.949.701339487.531.0524.9510.3813.8236.4822.074.0912.299.5810.459.5210.0421.4421.261611.211613.911616.57934.686925.976931.275424090.962.847134287710606.350.7864.3374.769117566310.30910451789588.138.86361482.060165.31.611.0370.34910.7310.9551.742038392.692134890.151295998.651495615.5950.0061658090.0449.3600.8816939.77239.86732.7951.38514.539.2800.041243360.0915516230.55418050.1895293182.329214793.3952948656879635.1111308.99777.433.03924.450.0342941927.182740630923.5346.683214520.66324210.1436984520.0598472310.495201924.00229.12123.1228.03823.36.3772521.1372.35946295.193275.52219.0420.59280416.90136.2137.141.730511.4378241.990.5782261.4023560.612.778158.5387.57883.065.303202.84936142.44178.792.309613.150472.596111181.7368531012.1291.6111168.5804.59566.4962.710761.168690.031443.116277.7573936.141009.8723887.31077693.89715.81614.9731462.323353215232009289.654332394321.81321880722.6368937547272.939.496444874153421517142.131220.80424.0210.6511199592.410616.048.331339130.131.3324.5310.5113.8636.5621.554.1312.619.5110.359.6110.0622.2621.101613.701610.951606.75930.711930.714924.024426284.572.830134740010661.550.5463.7514.775117991310.33910364189393.639.05461260.759959.61.591.0360.34910.7010.9151.821814383.801376509.231332610.561500640.3150.3251757206.7249.5110.8753240.99040.38332.7591.38623.739.1700.041243450.0905553200.55618010.1895305812.324215213.3892953756789735.0111271.49740.163.03224.490.0342939127.287738291623.5946.694214420.57124310.1436987780.0598526120.493202723.98228.69723.2228.60723.26.4139021.1262.34152292.901275.70319.0180.83902516.98036.0936.971.733881.4264542.130.5752541.3865160.332.750048.5497.73382.465.359502.90342141.93177.862.306333.144992.585111004.6587361111.7001.6021163.4554.510110.2592.725151.162760.030513.382208.1755336.765538.4312186.88297694.99515.77915.4451461.624352715202007294.113341885318.47322432634.1313237624546.939.866356476652704023142.080220.86724.0210.3341197198.010685.748.981297582.530.7525.0310.5214.1037.4822.014.1612.549.5010.459.9410.0822.2021.871611.741605.901602.59928.115929.886932.489424640.302.846134489310548.849.3164.8714.770117859010.31410436289812.738.90361596.260331.61.621.0370.34610.7310.9251.841897703.881379285.461317592.971475849.6850.4611723521.2949.5790.8927740.62340.03032.7111.38514.539.3420.041244460.0915514840.55318070.1895290662.328214813.3882955257356734.9611314.69780.383.02625.020.0342946227.210735921723.6046.742214220.63024240.1446953440.0598420220.498200923.93228.70823.2228.31723.26.3288421.1682.35726294.653275.38119.0150.59170216.92636.1537.081.727651.4266942.170.5766571.4034660.252.782148.5487.60683.295.363302.83648142.03177.712.296723.138782.591113220.1974780811.7051.6121162.7144.510456.2862.730351.157170.030373.299238.2855536.438809.41895OpenBenchmarking.org

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLEPYC 7F72AMD 7F72AMD EPYC 7F7220406080100SE +/- 0.39, N = 3SE +/- 0.30, N = 3SE +/- 0.42, N = 387.2587.3186.881. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-ProcessingEPYC 7F72AMD 7F72AMD EPYC 7F72150300450600750SE +/- 2.14, N = 3SE +/- 0.27, N = 3SE +/- 1.41, N = 3694.87693.90695.001. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsEPYC 7F72AMD 7F72AMD EPYC 7F7248121620SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 315.7915.8215.781. (CXX) g++ options: -O3 -pthread -lm

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.1EPYC 7F72AMD 7F72AMD EPYC 7F7248121620SE +/- 0.25, N = 12SE +/- 0.32, N = 12SE +/- 0.17, N = 315.0014.9715.451. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -pthread -lmpi_cxx -lmpi

Timed Clash Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Clash CompilationTime To CompileEPYC 7F72AMD 7F72AMD EPYC 7F72100200300400500SE +/- 0.12, N = 3SE +/- 1.06, N = 3SE +/- 0.52, N = 3462.28462.32461.62

AI Benchmark Alpha

Device AI Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreEPYC 7F72AMD 7F72AMD EPYC 7F728001600240032004000347335323527

AI Benchmark Alpha

Device Training Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training ScoreEPYC 7F72AMD 7F72AMD EPYC 7F7230060090012001500151315231520

AI Benchmark Alpha

Device Inference Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference ScoreEPYC 7F72AMD 7F72AMD EPYC 7F72400800120016002000196020092007

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 10.0Time To CompileEPYC 7F72AMD 7F72AMD EPYC 7F7260120180240300SE +/- 1.69, N = 3SE +/- 2.80, N = 3SE +/- 4.06, N = 3298.15289.65294.11

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance MetricEPYC 7F72AMD 7F72AMD EPYC 7F7270K140K210K280K350K3379243323943418851. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkEPYC 7F72AMD 7F72AMD EPYC 7F7270140210280350SE +/- 0.44, N = 3SE +/- 2.42, N = 3SE +/- 0.87, N = 3324.13321.81318.47

Hierarchical INTegration

Test: FLOAT

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATEPYC 7F72AMD 7F72AMD EPYC 7F7270M140M210M280M350MSE +/- 739610.19, N = 3SE +/- 81095.29, N = 3SE +/- 256408.73, N = 3321206770.06321880722.64322432634.131. (CC) gcc options: -O3 -march=native -lm

BYTE Unix Benchmark

Computational Test: Dhrystone 2

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 3.6Computational Test: Dhrystone 2EPYC 7F72AMD 7F72AMD EPYC 7F728M16M24M32M40MSE +/- 295946.74, N = 12SE +/- 231062.98, N = 3SE +/- 93255.90, N = 337455257.237547272.937624546.9

Mlpack Benchmark

Benchmark: scikit_qda

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_qdaEPYC 7F72AMD 7F72AMD EPYC 7F72918273645SE +/- 0.08, N = 3SE +/- 0.18, N = 3SE +/- 0.31, N = 1040.0539.4939.86

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthEPYC 7F72AMD 7F72AMD EPYC 7F7214M28M42M56M70MSE +/- 282969.84, N = 3SE +/- 380311.53, N = 3SE +/- 644607.06, N = 3643498376444874163564766

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total TimeEPYC 7F72AMD 7F72AMD EPYC 7F7211M22M33M44M55MSE +/- 458927.44, N = 15SE +/- 678703.71, N = 3SE +/- 356097.54, N = 155259754453421517527040231. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchEPYC 7F72AMD 7F72AMD EPYC 7F72306090120150SE +/- 0.11, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 3142.12142.13142.081. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

LevelDB

Benchmark: Sequential Fill

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Sequential FillEPYC 7F72AMD 7F72AMD EPYC 7F7250100150200250SE +/- 0.18, N = 3SE +/- 0.58, N = 3SE +/- 0.16, N = 3220.33220.80220.871. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Sequential Fill

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Sequential FillEPYC 7F72AMD 7F72AMD EPYC 7F72612182430SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 324.124.024.01. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Random Delete

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random DeleteEPYC 7F72AMD 7F72AMD EPYC 7F7250100150200250SE +/- 0.57, N = 3SE +/- 0.41, N = 3SE +/- 0.20, N = 3209.85210.65210.331. (CXX) g++ options: -O3 -lsnappy -lpthread

InfluxDB

Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 4 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000EPYC 7F72AMD 7F72AMD EPYC 7F72300K600K900K1200K1500KSE +/- 2428.05, N = 3SE +/- 3879.14, N = 3SE +/- 2015.92, N = 31197999.81199592.41197198.0

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F722K4K6K8K10KSE +/- 28.09, N = 5SE +/- 22.43, N = 5SE +/- 8.87, N = 310630.910616.010685.71. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F721122334455SE +/- 0.51, N = 5SE +/- 0.54, N = 5SE +/- 0.42, N = 349.7048.3348.981. (CC) gcc options: -O3

InfluxDB

Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000

OpenBenchmarking.orgval/sec, More Is BetterInfluxDB 1.8.2Concurrent Streams: 64 - Batch Size: 10000 - Tags: 2,5000,1 - Points Per Series: 10000EPYC 7F72AMD 7F72AMD EPYC 7F72300K600K900K1200K1500KSE +/- 1881.74, N = 3SE +/- 811.21, N = 3SE +/- 1608.37, N = 31339487.51339130.11297582.5

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: yolov4-tinyEPYC 7F72AMD 7F72AMD EPYC 7F72714212835SE +/- 0.18, N = 3SE +/- 0.43, N = 3SE +/- 0.29, N = 331.0531.3330.75MIN: 28.99 / MAX: 116.2MIN: 28.52 / MAX: 71.5MIN: 28.27 / MAX: 1341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet50EPYC 7F72AMD 7F72AMD EPYC 7F72612182430SE +/- 0.13, N = 3SE +/- 0.54, N = 3SE +/- 0.20, N = 324.9524.5325.03MIN: 22.93 / MAX: 118.48MIN: 22.55 / MAX: 91.3MIN: 23.18 / MAX: 107.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: alexnetEPYC 7F72AMD 7F72AMD EPYC 7F723691215SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.18, N = 310.3810.5110.52MIN: 9.08 / MAX: 21.34MIN: 9.04 / MAX: 16.98MIN: 9.15 / MAX: 90.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: resnet18EPYC 7F72AMD 7F72AMD EPYC 7F7248121620SE +/- 0.08, N = 3SE +/- 0.27, N = 3SE +/- 0.35, N = 313.8213.8614.10MIN: 12.13 / MAX: 30.86MIN: 11.95 / MAX: 34.76MIN: 12.23 / MAX: 36.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: vgg16EPYC 7F72AMD 7F72AMD EPYC 7F72918273645SE +/- 0.23, N = 3SE +/- 0.88, N = 3SE +/- 0.64, N = 336.4836.5637.48MIN: 33.2 / MAX: 130.64MIN: 33.32 / MAX: 133.5MIN: 33.78 / MAX: 126.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: googlenetEPYC 7F72AMD 7F72AMD EPYC 7F72510152025SE +/- 0.24, N = 3SE +/- 0.20, N = 3SE +/- 0.41, N = 322.0721.5522.01MIN: 20.38 / MAX: 109.3MIN: 20.16 / MAX: 115.96MIN: 20.09 / MAX: 121.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: blazefaceEPYC 7F72AMD 7F72AMD EPYC 7F720.9361.8722.8083.7444.68SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 34.094.134.16MIN: 3.49 / MAX: 10.46MIN: 3.53 / MAX: 17.37MIN: 3.55 / MAX: 15.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: efficientnet-b0EPYC 7F72AMD 7F72AMD EPYC 7F723691215SE +/- 0.25, N = 3SE +/- 0.09, N = 3SE +/- 0.12, N = 312.2912.6112.54MIN: 10.71 / MAX: 71.52MIN: 10.98 / MAX: 64.21MIN: 11.02 / MAX: 93.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mnasnetEPYC 7F72AMD 7F72AMD EPYC 7F723691215SE +/- 0.33, N = 3SE +/- 0.24, N = 3SE +/- 0.18, N = 39.589.519.50MIN: 8.12 / MAX: 77.12MIN: 8.23 / MAX: 82.21MIN: 8.28 / MAX: 26.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: shufflenet-v2EPYC 7F72AMD 7F72AMD EPYC 7F723691215SE +/- 0.49, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 310.4510.3510.45MIN: 9 / MAX: 67.14MIN: 8.97 / MAX: 20.34MIN: 9.13 / MAX: 68.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v3-v3 - Model: mobilenet-v3EPYC 7F72AMD 7F72AMD EPYC 7F723691215SE +/- 0.20, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 39.529.619.94MIN: 8.4 / MAX: 19.2MIN: 8.63 / MAX: 62.41MIN: 8.64 / MAX: 101.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU-v2-v2 - Model: mobilenet-v2EPYC 7F72AMD 7F72AMD EPYC 7F723691215SE +/- 0.17, N = 3SE +/- 0.17, N = 3SE +/- 0.04, N = 310.0410.0610.08MIN: 8.78 / MAX: 67.63MIN: 8.8 / MAX: 95.72MIN: 8.99 / MAX: 20.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: mobilenetEPYC 7F72AMD 7F72AMD EPYC 7F72510152025SE +/- 0.66, N = 3SE +/- 0.49, N = 3SE +/- 0.56, N = 321.4422.2622.20MIN: 18.65 / MAX: 82.99MIN: 18.47 / MAX: 54.35MIN: 18.67 / MAX: 87.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20200916Target: CPU - Model: squeezenetEPYC 7F72AMD 7F72AMD EPYC 7F72510152025SE +/- 0.28, N = 3SE +/- 0.17, N = 3SE +/- 0.29, N = 321.2621.1021.87MIN: 18.08 / MAX: 111.42MIN: 18.23 / MAX: 105.85MIN: 18.11 / MAX: 113.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F7230060090012001500SE +/- 2.75, N = 3SE +/- 10.75, N = 3SE +/- 3.90, N = 31611.211613.701611.74MIN: 1572.74MIN: 1565.85MIN: 1572.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F7230060090012001500SE +/- 7.42, N = 3SE +/- 6.46, N = 3SE +/- 6.62, N = 31613.911610.951605.90MIN: 1557.21MIN: 1556.69MIN: 1559.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F7230060090012001500SE +/- 3.74, N = 3SE +/- 2.70, N = 3SE +/- 2.00, N = 31616.571606.751602.59MIN: 1572.33MIN: 1564.52MIN: 1564.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F722004006008001000SE +/- 2.42, N = 3SE +/- 2.82, N = 3SE +/- 3.30, N = 3934.69930.71928.12MIN: 900.24MIN: 900MIN: 898.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F722004006008001000SE +/- 1.31, N = 3SE +/- 0.29, N = 3SE +/- 7.20, N = 3925.98930.71929.89MIN: 896.94MIN: 898.24MIN: 896.551. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F722004006008001000SE +/- 6.19, N = 3SE +/- 3.80, N = 3SE +/- 4.03, N = 3931.28924.02932.49MIN: 898.52MIN: 894.61MIN: 897.71. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

KeyDB

OpenBenchmarking.orgOps/sec, More Is BetterKeyDB 6.0.16EPYC 7F72AMD 7F72AMD EPYC 7F7290K180K270K360K450KSE +/- 1068.73, N = 3SE +/- 4696.92, N = 3SE +/- 2330.52, N = 3424090.96426284.57424640.301. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkEPYC 7F72AMD 7F72AMD EPYC 7F720.64061.28121.92182.56243.203SE +/- 0.004, N = 3SE +/- 0.009, N = 3SE +/- 0.003, N = 32.8472.8302.8461. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

TensorFlow Lite

Model: Inception V4

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception V4EPYC 7F72AMD 7F72AMD EPYC 7F72300K600K900K1200K1500KSE +/- 3318.77, N = 3SE +/- 2838.53, N = 3SE +/- 3249.51, N = 3134287713474001344893

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F722K4K6K8K10KSE +/- 37.48, N = 4SE +/- 1.64, N = 3SE +/- 18.01, N = 310606.310661.510548.81. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F721122334455SE +/- 0.55, N = 4SE +/- 0.36, N = 3SE +/- 0.05, N = 350.7850.5449.311. (CC) gcc options: -O3

LevelDB

Benchmark: Seek Random

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Seek RandomEPYC 7F72AMD 7F72AMD EPYC 7F721428425670SE +/- 0.66, N = 3SE +/- 0.14, N = 3SE +/- 0.52, N = 364.3463.7564.871. (CXX) g++ options: -O3 -lsnappy -lpthread

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: BedroomEPYC 7F72AMD 7F72AMD EPYC 7F721.07442.14883.22324.29765.372SE +/- 0.029, N = 3SE +/- 0.013, N = 3SE +/- 0.006, N = 34.7694.7754.770

TensorFlow Lite

Model: Inception ResNet V2

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Inception ResNet V2EPYC 7F72AMD 7F72AMD EPYC 7F72300K600K900K1200K1500KSE +/- 2727.68, N = 3SE +/- 2105.81, N = 3SE +/- 3330.08, N = 3117566311799131178590

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: SupercarEPYC 7F72AMD 7F72AMD EPYC 7F723691215SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 310.3110.3410.31

TensorFlow Lite

Model: NASNet Mobile

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: NASNet MobileEPYC 7F72AMD 7F72AMD EPYC 7F7220K40K60K80K100KSE +/- 165.43, N = 3SE +/- 239.98, N = 3SE +/- 60.45, N = 3104517103641104362

TensorFlow Lite

Model: SqueezeNet

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: SqueezeNetEPYC 7F72AMD 7F72AMD EPYC 7F7220K40K60K80K100KSE +/- 44.82, N = 3SE +/- 45.73, N = 3SE +/- 87.10, N = 389588.189393.689812.7

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To CompileEPYC 7F72AMD 7F72AMD EPYC 7F72918273645SE +/- 0.47, N = 3SE +/- 0.41, N = 5SE +/- 0.37, N = 638.8639.0538.90

TensorFlow Lite

Model: Mobilenet Quant

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet QuantEPYC 7F72AMD 7F72AMD EPYC 7F7213K26K39K52K65KSE +/- 90.15, N = 3SE +/- 111.88, N = 3SE +/- 119.05, N = 361482.061260.761596.2

TensorFlow Lite

Model: Mobilenet Float

OpenBenchmarking.orgMicroseconds, Fewer Is BetterTensorFlow Lite 2020-08-23Model: Mobilenet FloatEPYC 7F72AMD 7F72AMD EPYC 7F7213K26K39K52K65KSE +/- 43.27, N = 3SE +/- 39.74, N = 3SE +/- 126.43, N = 360165.359959.660331.6

Mlpack Benchmark

Benchmark: scikit_linearridgeregression

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregressionEPYC 7F72AMD 7F72AMD EPYC 7F720.36450.7291.09351.4581.8225SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.611.591.62

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 5EPYC 7F72AMD 7F72AMD EPYC 7F720.23330.46660.69990.93321.1665SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 31.0371.0361.037

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 1EPYC 7F72AMD 7F72AMD EPYC 7F720.07850.1570.23550.3140.3925SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.3490.3490.346

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: SlowEPYC 7F72AMD 7F72AMD EPYC 7F723691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.7310.7010.731. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: MediumEPYC 7F72AMD 7F72AMD EPYC 7F723691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 310.9510.9110.921. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Mlpack Benchmark

Benchmark: scikit_ica

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_icaEPYC 7F72AMD 7F72AMD EPYC 7F721224364860SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 351.7451.8251.84

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETEPYC 7F72AMD 7F72AMD EPYC 7F72400K800K1200K1600K2000KSE +/- 51555.46, N = 15SE +/- 29329.60, N = 15SE +/- 51919.80, N = 152038392.691814383.801897703.881. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOPEPYC 7F72AMD 7F72AMD EPYC 7F72500K1000K1500K2000K2500KSE +/- 44675.79, N = 15SE +/- 22561.14, N = 15SE +/- 61951.41, N = 132134890.151376509.231379285.461. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHEPYC 7F72AMD 7F72AMD EPYC 7F72300K600K900K1200K1500KSE +/- 23366.84, N = 15SE +/- 24808.96, N = 12SE +/- 23963.24, N = 151295998.651332610.561317592.971. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETEPYC 7F72AMD 7F72AMD EPYC 7F72300K600K900K1200K1500KSE +/- 37419.69, N = 12SE +/- 35684.24, N = 15SE +/- 27970.44, N = 151495615.591500640.311475849.681. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Hugin

Panorama Photo Assistant + Stitching Time

OpenBenchmarking.orgSeconds, Fewer Is BetterHuginPanorama Photo Assistant + Stitching TimeEPYC 7F72AMD 7F72AMD EPYC 7F721122334455SE +/- 0.51, N = 3SE +/- 0.39, N = 3SE +/- 0.24, N = 350.0150.3350.46

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDEPYC 7F72AMD 7F72AMD EPYC 7F72400K800K1200K1600K2000KSE +/- 37861.09, N = 12SE +/- 39190.67, N = 15SE +/- 39087.64, N = 151658090.041757206.721723521.291. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1SEPYC 7F72AMD 7F72AMD EPYC 7F721122334455SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 349.3649.5149.581. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsEPYC 7F72AMD 7F72AMD EPYC 7F720.20090.40180.60270.80361.0045SE +/- 0.00562, N = 3SE +/- 0.00402, N = 3SE +/- 0.01077, N = 30.881690.875320.89277

LevelDB

Benchmark: Hot Read

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Hot ReadEPYC 7F72AMD 7F72AMD EPYC 7F72918273645SE +/- 0.13, N = 3SE +/- 0.50, N = 4SE +/- 0.14, N = 339.7740.9940.621. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Random Read

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random ReadEPYC 7F72AMD 7F72AMD EPYC 7F72918273645SE +/- 0.18, N = 3SE +/- 0.44, N = 4SE +/- 0.21, N = 339.8740.3840.031. (CXX) g++ options: -O3 -lsnappy -lpthread

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech SynthesisEPYC 7F72AMD 7F72AMD EPYC 7F72816243240SE +/- 0.11, N = 4SE +/- 0.11, N = 4SE +/- 0.09, N = 432.8032.7632.711. (CC) gcc options: -O2 -std=c99

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 6EPYC 7F72AMD 7F72AMD EPYC 7F720.31190.62380.93571.24761.5595SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 31.3851.3861.385

yquake2

Renderer: Software CPU - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 1080EPYC 7F72AMD 7F72AMD EPYC 7F72612182430SE +/- 0.00, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 314.523.714.51. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionEPYC 7F72AMD 7F72AMD EPYC 7F72918273645SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 339.2839.1739.341. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F720.00920.01840.02760.03680.046SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.0410.0410.0411. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read OnlyEPYC 7F72AMD 7F72AMD EPYC 7F725K10K15K20K25KSE +/- 171.06, N = 3SE +/- 144.26, N = 3SE +/- 325.52, N = 32433624345244461. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 50 - Mode: Read Only - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F720.02050.0410.06150.0820.1025SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0910.0900.0911. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 50 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 50 - Mode: Read OnlyEPYC 7F72AMD 7F72AMD EPYC 7F72120K240K360K480K600KSE +/- 5773.06, N = 3SE +/- 2634.03, N = 3SE +/- 1352.44, N = 35516235553205514841. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F720.12510.25020.37530.50040.6255SE +/- 0.001, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 30.5540.5560.5531. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read WriteEPYC 7F72AMD 7F72AMD EPYC 7F72400800120016002000SE +/- 1.53, N = 3SE +/- 10.89, N = 3SE +/- 5.85, N = 31805180118071. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F720.04250.0850.12750.170.2125SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1890.1890.1891. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read OnlyEPYC 7F72AMD 7F72AMD EPYC 7F72110K220K330K440K550KSE +/- 602.59, N = 3SE +/- 493.86, N = 3SE +/- 613.62, N = 35293185305815290661. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 50 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 50 - Mode: Read Write - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F720.5241.0481.5722.0962.62SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 32.3292.3242.3281. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 50 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 50 - Mode: Read WriteEPYC 7F72AMD 7F72AMD EPYC 7F725K10K15K20K25KSE +/- 24.14, N = 3SE +/- 9.19, N = 3SE +/- 31.97, N = 32147921521214811. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F720.76391.52782.29173.05563.8195SE +/- 0.006, N = 3SE +/- 0.013, N = 3SE +/- 0.003, N = 33.3953.3893.3881. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read WriteEPYC 7F72AMD 7F72AMD EPYC 7F726K12K18K24K30KSE +/- 54.62, N = 3SE +/- 108.48, N = 3SE +/- 34.89, N = 32948629537295521. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteEPYC 7F72AMD 7F72AMD EPYC 7F72120K240K360K480K600KSE +/- 5796.24, N = 3SE +/- 4364.31, N = 3SE +/- 3310.62, N = 3568796567897573567

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkEPYC 7F72AMD 7F72AMD EPYC 7F72816243240SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 335.1135.0134.961. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F722K4K6K8K10KSE +/- 35.72, N = 3SE +/- 3.46, N = 3SE +/- 28.90, N = 311308.911271.411314.61. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedEPYC 7F72AMD 7F72AMD EPYC 7F722K4K6K8K10KSE +/- 29.36, N = 3SE +/- 22.56, N = 3SE +/- 17.93, N = 39777.439740.169780.381. (CC) gcc options: -O3

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 10EPYC 7F72AMD 7F72AMD EPYC 7F720.68381.36762.05142.73523.419SE +/- 0.003, N = 3SE +/- 0.008, N = 3SE +/- 0.002, N = 33.0393.0323.026

Mlpack Benchmark

Benchmark: scikit_svm

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svmEPYC 7F72AMD 7F72AMD EPYC 7F72612182430SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.15, N = 324.4524.4925.02

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Only - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F720.00770.01540.02310.03080.0385SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 40.0340.0340.0341. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read OnlyEPYC 7F72AMD 7F72AMD EPYC 7F726K12K18K24K30KSE +/- 161.30, N = 3SE +/- 325.35, N = 3SE +/- 335.87, N = 42941929391294621. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3EPYC 7F72AMD 7F72AMD EPYC 7F72612182430SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 327.1827.2927.211. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeEPYC 7F72AMD 7F72AMD EPYC 7F721.6M3.2M4.8M6.4M8MSE +/- 14432.27, N = 3SE +/- 15235.97, N = 3SE +/- 62574.81, N = 37406309738291673592171. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KEPYC 7F72AMD 7F72AMD EPYC 7F72612182430SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 323.5323.5923.601. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 100 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Write - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F721122334455SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.08, N = 346.6846.6946.741. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 100 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read WriteEPYC 7F72AMD 7F72AMD EPYC 7F725001000150020002500SE +/- 4.79, N = 3SE +/- 6.61, N = 3SE +/- 3.93, N = 32145214421421. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 50 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Write - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F72510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 320.6620.5720.631. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 50 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read WriteEPYC 7F72AMD 7F72AMD EPYC 7F725001000150020002500SE +/- 2.57, N = 3SE +/- 1.25, N = 3SE +/- 5.02, N = 32421243124241. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 100 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read Only - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F720.03240.06480.09720.12960.162SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1430.1430.1441. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 100 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 100 - Mode: Read OnlyEPYC 7F72AMD 7F72AMD EPYC 7F72150K300K450K600K750KSE +/- 1961.28, N = 3SE +/- 755.05, N = 3SE +/- 1176.24, N = 36984526987786953441. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 50 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read Only - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F720.01330.02660.03990.05320.0665SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.0590.0590.0591. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 50 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 50 - Mode: Read OnlyEPYC 7F72AMD 7F72AMD EPYC 7F72200K400K600K800K1000KSE +/- 11183.06, N = 3SE +/- 4242.40, N = 3SE +/- 9353.87, N = 38472318526128420221. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average LatencyEPYC 7F72AMD 7F72AMD EPYC 7F720.11210.22420.33630.44840.5605SE +/- 0.000, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 30.4950.4930.4981. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 1 - Clients: 1 - Mode: Read WriteEPYC 7F72AMD 7F72AMD EPYC 7F72400800120016002000SE +/- 0.78, N = 3SE +/- 15.86, N = 3SE +/- 7.88, N = 32019202720091. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastEPYC 7F72AMD 7F72AMD EPYC 7F72612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 324.0023.9823.931. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LevelDB

Benchmark: Overwrite

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: OverwriteEPYC 7F72AMD 7F72AMD EPYC 7F7250100150200250SE +/- 0.36, N = 3SE +/- 0.71, N = 3SE +/- 0.57, N = 3229.12228.70228.711. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Overwrite

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: OverwriteEPYC 7F72AMD 7F72AMD EPYC 7F72612182430SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 323.123.223.21. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Random Fill

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Random FillEPYC 7F72AMD 7F72AMD EPYC 7F7250100150200250SE +/- 0.24, N = 3SE +/- 0.18, N = 3SE +/- 0.12, N = 3228.04228.61228.321. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Random Fill

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Random FillEPYC 7F72AMD 7F72AMD EPYC 7F72612182430SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 323.323.223.21. (CXX) g++ options: -O3 -lsnappy -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F72246810SE +/- 0.00849, N = 3SE +/- 0.01141, N = 3SE +/- 0.03710, N = 36.377256.413906.32884MIN: 5.79MIN: 5.78MIN: 5.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

RNNoise

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28EPYC 7F72AMD 7F72AMD EPYC 7F72510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 321.1421.1321.171. (CC) gcc options: -O2 -pedantic -fvisibility=hidden

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F720.53091.06181.59272.12362.6545SE +/- 0.00293, N = 3SE +/- 0.02163, N = 3SE +/- 0.01986, N = 32.359462.341522.35726MIN: 2.12MIN: 2.1MIN: 2.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2EPYC 7F72AMD 7F72AMD EPYC 7F7260120180240300SE +/- 0.66, N = 3SE +/- 0.39, N = 3SE +/- 0.88, N = 3295.19292.90294.65MIN: 281.63 / MAX: 334.08MIN: 281.47 / MAX: 324.04MIN: 281.79 / MAX: 332.391. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1EPYC 7F72AMD 7F72AMD EPYC 7F7260120180240300SE +/- 0.13, N = 3SE +/- 0.24, N = 3SE +/- 0.38, N = 3275.52275.70275.38MIN: 274.32 / MAX: 281.55MIN: 274.17 / MAX: 289.54MIN: 273.95 / MAX: 287.651. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessEPYC 7F72AMD 7F72AMD EPYC 7F72510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 319.0419.0219.021. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F720.18880.37760.56640.75520.944SE +/- 0.002974, N = 3SE +/- 0.017578, N = 12SE +/- 0.000920, N = 30.5928040.8390250.591702MIN: 0.51MIN: 0.64MIN: 0.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2EPYC 7F72AMD 7F72AMD EPYC 7F7248121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 316.9016.9816.931. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: SlowEPYC 7F72AMD 7F72AMD EPYC 7F72816243240SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 336.2136.0936.151. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: MediumEPYC 7F72AMD 7F72AMD EPYC 7F72918273645SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 337.1436.9737.081. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F720.39010.78021.17031.56041.9505SE +/- 0.00365, N = 3SE +/- 0.01016, N = 3SE +/- 0.00210, N = 31.730511.733881.72765MIN: 1.58MIN: 1.57MIN: 1.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F720.32350.6470.97051.2941.6175SE +/- 0.00957, N = 3SE +/- 0.00516, N = 3SE +/- 0.00056, N = 31.437821.426451.42669MIN: 1.33MIN: 1.33MIN: 1.321. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastEPYC 7F72AMD 7F72AMD EPYC 7F721020304050SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.16, N = 341.9942.1342.171. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F720.13010.26020.39030.52040.6505SE +/- 0.001786, N = 3SE +/- 0.002676, N = 3SE +/- 0.001616, N = 30.5782260.5752540.576657MIN: 0.52MIN: 0.52MIN: 0.521. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F720.31580.63160.94741.26321.579SE +/- 0.00911, N = 3SE +/- 0.00538, N = 3SE +/- 0.00520, N = 31.402351.386511.40346MIN: 1.29MIN: 1.29MIN: 1.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pEPYC 7F72AMD 7F72AMD EPYC 7F721428425670SE +/- 0.19, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 360.6160.3360.251. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F720.6261.2521.8782.5043.13SE +/- 0.01168, N = 3SE +/- 0.01780, N = 3SE +/- 0.01373, N = 32.778152.750042.78214MIN: 2.48MIN: 2.48MIN: 2.481. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionEPYC 7F72AMD 7F72AMD EPYC 7F72246810SE +/- 0.004, N = 3SE +/- 0.021, N = 3SE +/- 0.001, N = 38.5388.5498.5481. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 0EPYC 7F72AMD 7F72AMD EPYC 7F72246810SE +/- 0.004, N = 3SE +/- 0.015, N = 3SE +/- 0.021, N = 37.5787.7337.6061. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastEPYC 7F72AMD 7F72AMD EPYC 7F7220406080100SE +/- 0.22, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 383.0682.4683.291. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F721.20672.41343.62014.82686.0335SE +/- 0.01527, N = 3SE +/- 0.02370, N = 3SE +/- 0.03533, N = 35.303205.359505.36330MIN: 4.97MIN: 4.96MIN: 4.961. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F720.65331.30661.95992.61323.2665SE +/- 0.00847, N = 3SE +/- 0.00877, N = 3SE +/- 0.01841, N = 32.849362.903422.83648MIN: 2.46MIN: 2.52MIN: 2.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastEPYC 7F72AMD 7F72AMD EPYC 7F72306090120150SE +/- 0.27, N = 3SE +/- 0.44, N = 3SE +/- 0.12, N = 3142.44141.93142.031. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingEPYC 7F72AMD 7F72AMD EPYC 7F724080120160200SE +/- 1.85, N = 3SE +/- 1.47, N = 3SE +/- 1.51, N = 3178.79177.86177.711. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F720.51971.03941.55912.07882.5985SE +/- 0.00115, N = 3SE +/- 0.00978, N = 3SE +/- 0.00103, N = 32.309612.306332.29672MIN: 2.18MIN: 2.18MIN: 2.181. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUEPYC 7F72AMD 7F72AMD EPYC 7F720.70891.41782.12672.83563.5445SE +/- 0.01172, N = 3SE +/- 0.01746, N = 3SE +/- 0.01523, N = 33.150473.144993.13878MIN: 2.98MIN: 2.97MIN: 2.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100EPYC 7F72AMD 7F72AMD EPYC 7F720.58411.16821.75232.33642.9205SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.003, N = 32.5962.5852.5911. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

FFTE

N=256, 3D Complex FFT Routine

OpenBenchmarking.orgMFLOPS, More Is BetterFFTE 7.0N=256, 3D Complex FFT RoutineEPYC 7F72AMD 7F72AMD EPYC 7F7220K40K60K80K100KSE +/- 1183.09, N = 3SE +/- 1197.97, N = 4SE +/- 1183.24, N = 3111181.74111004.66113220.201. (F9X) gfortran options: -O3 -fomit-frame-pointer -fopenmp

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinEPYC 7F72AMD 7F72AMD EPYC 7F723691215SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 312.1311.7011.711. (CXX) g++ options: -O3 -pthread -lm

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultEPYC 7F72AMD 7F72AMD EPYC 7F720.36270.72541.08811.45081.8135SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.004, N = 31.6111.6021.6121. (CC) gcc options: -fvisibility=hidden -O2 -pthread -lm -ljpeg -lpng16 -ltiff

LevelDB

Benchmark: Fill Sync

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Fill SyncEPYC 7F72AMD 7F72AMD EPYC 7F7230060090012001500SE +/- 2.23, N = 3SE +/- 4.35, N = 3SE +/- 8.92, N = 31168.581163.461162.711. (CXX) g++ options: -O3 -lsnappy -lpthread

LevelDB

Benchmark: Fill Sync

OpenBenchmarking.orgMB/s, More Is BetterLevelDB 1.22Benchmark: Fill SyncEPYC 7F72AMD 7F72AMD EPYC 7F721.01252.0253.03754.055.0625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.54.54.51. (CXX) g++ options: -O3 -lsnappy -lpthread

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthEPYC 7F72AMD 7F72AMD EPYC 7F722K4K6K8K10KSE +/- 564.95, N = 3SE +/- 760.03, N = 3SE +/- 1135.29, N = 39566.5010110.2610456.291. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthEPYC 7F72AMD 7F72AMD EPYC 7F720.61431.22861.84292.45723.0715SE +/- 0.01911, N = 3SE +/- 0.06549, N = 3SE +/- 0.03283, N = 32.710762.725152.730351. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencyEPYC 7F72AMD 7F72AMD EPYC 7F720.2630.5260.7891.0521.315SE +/- 0.00865, N = 3SE +/- 0.01886, N = 3SE +/- 0.01464, N = 31.168691.162761.157171. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessEPYC 7F72AMD 7F72AMD EPYC 7F720.00710.01420.02130.02840.0355SE +/- 0.00015, N = 3SE +/- 0.00105, N = 3SE +/- 0.00070, N = 30.031440.030510.030371. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadEPYC 7F72AMD 7F72AMD EPYC 7F720.7611.5222.2833.0443.805SE +/- 0.13922, N = 3SE +/- 0.00767, N = 3SE +/- 0.06266, N = 33.116273.382203.299231. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransEPYC 7F72AMD 7F72AMD EPYC 7F72246810SE +/- 0.43146, N = 3SE +/- 0.42362, N = 3SE +/- 0.32995, N = 37.757398.175538.285551. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMEPYC 7F72AMD 7F72AMD EPYC 7F72816243240SE +/- 0.60, N = 3SE +/- 0.85, N = 3SE +/- 0.48, N = 336.1436.7736.441. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteEPYC 7F72AMD 7F72AMD EPYC 7F723691215SE +/- 1.15527, N = 3SE +/- 0.56601, N = 3SE +/- 0.73072, N = 39.872388.431219.418951. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3


Phoronix Test Suite v10.8.4