2990wx-2021-amd

AMD Ryzen Threadripper 2990WX 32-Core testing with a ASUS ROG ZENITH EXTREME (1701 BIOS) and Gigabyte AMD Radeon RX 470/480/570/570X/580/580X/590 4GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2101259-HA-2990WX20208&sor&gru.

2990wx-2021-amd ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution11a2345AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1701 BIOS)AMD 17h32GBSamsung SSD 970 EVO 500GB + 250GB Western Digital WDS250G2X0C-00L350Gigabyte AMD Radeon RX 470/480/570/570X/580/580X/590 4GB (1244/1750MHz)Realtek ALC1220LG Ultra HDIntel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adUbuntu 20.105.8.0-34-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.6 Mesa 20.2.1 (LLVM 11.0.0)1.2.131GCC 10.2.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- 1, 2, 3, 4, 5: NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x800820dPython Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected Kernel Details- 4, 5: Transparent Huge Pages: madvise

2990wx-2021-amd amg: dav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitrav1e: 1rav1e: 5rav1e: 6rav1e: 10onnx: yolov4 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUcpuminer-opt: Magicpuminer-opt: x25xcpuminer-opt: Deepcoincpuminer-opt: Ringcoincpuminer-opt: Blake-2 Scpuminer-opt: Garlicoincpuminer-opt: Skeincoincpuminer-opt: Myriad-Groestlcpuminer-opt: LBC, LBRY Creditscpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: Triple SHA-256, Onecoinior: 2MB - Default Test Directoryior: 4MB - Default Test Directoryior: 8MB - Default Test Directoryior: 16MB - Default Test Directoryior: 32MB - Default Test Directorylzbench: XZ 0 - Compressionlzbench: XZ 0 - Decompressionlzbench: Zstd 1 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Zstd 8 - Compressionlzbench: Zstd 8 - Decompressionlzbench: Crush 0 - Compressionlzbench: Crush 0 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Brotli 0 - Decompressionlzbench: Brotli 2 - Compressionlzbench: Brotli 2 - Decompressionlzbench: Libdeflate 1 - Compressionquantlib: etcpak: DXT1etcpak: ETC1etcpak: ETC2etcpak: ETC1 + Ditheringlammps: 20k Atomslammps: Rhodopsin Proteinkripke: npb: CG.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: IS.Dnpb: LU.Cnpb: MG.Csynthmark: VoiceMark_100mnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3tnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1financebench: Repo OpenMPfinancebench: Bonds OpenMPcloverleaf: Lagrangian-Eulerian Hydrodynamicscp2k: Fayalite-FIST Dataopenfoam: Motorbike 30Mopenfoam: Motorbike 60Mqe: AUSURF112relion: Basic - CPUbuild-godot: Time To Compilecython-bench: N-Queensgcrypt: qmcpack: simple-H2O11a2345401359967533.91206.02555.39117.160.3401.0021.3222.9571551225450112211958.85623.83524.14492.78492.811599.218247.713154.297227.80215.35213.00826877810594.5619.45938.3295.6724.33948.514281.081251.689119.541474.83176.57726.901705.651823.68883.67536.3051172.19819.47170592936.885774104590.081370401012345567166940221130371085261583941765944814985771966622032320.77685.881733.781740.4321668.10751.0040721.5117335.8242858.38151058307.90494826.033216.058391813467540.87204.27548.65117.180.3370.9931.3072.88515712854548722231149.83839.46168832940.545710134639.971356431013045570165833218553781.51592.02535.92490.50480.75371095261572941786944814895691966642042286.01615.224248.786154.896230.99315.43712.766273585037203.141741.251740.5521182.51770.1039921.1316587.15593.5459.20837.8125.6704.33648.290279.843251.54042328.95312558878.983073125.4340.811710.861793.74182.31725.862217.83337.346396403133542.26202.23552.84117.720.3380.9951.3082.911753.94582.93529.36496.04471.831607.366247.557154.923229.77014.79912.475132.071687.651798.42737.455398053633540.92208.11554.60117.850.3431.0141.3292.92418321359640023621172.74830.66169202929.725758334521.831314371021046943165943220147791.04687.41594.69808.18813.49371095251572941787954804925721976662032288.61617.869248.611155.118230.94215.01312.589257108387217.861750.211732.0722314.71793.8339758.7917192.72587.9659.04437.9355.6214.40847.444289.279251.41741316.53645856415.07812586.581447.5791677.481767.92581.89426.152216.24635.900386077433539.62209.54552.65118.300.3441.0081.3202.90818121661649823411171.44830.89169472950.685657674520.301321401012745451166993220797961.66654.19536.25527.67506.83371085251533941784944814945721966402062296.21619.436248.733154.303229.60715.06112.832258576237441.031743.291737.3322032.23811.9841730.7416629.67589.7789.12637.8355.6324.34048.246288.426251.33941135.45052156829.99218786.151673.031781.07382.10525.821216.76835.624OpenBenchmarking.org

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.21432590M180M270M360M450MSE +/- 4525834.83, N = 3SE +/- 277098.08, N = 3SE +/- 2489918.59, N = 3SE +/- 1988092.57, N = 3SE +/- 3176655.21, N = 34013599673980536333964031333918134673860774331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p34251120240360480600SE +/- 1.05, N = 3SE +/- 2.60, N = 3SE +/- 1.71, N = 3SE +/- 2.78, N = 3SE +/- 8.93, N = 3542.26540.92540.87539.62533.91MIN: 422.86 / MAX: 672.41MIN: 423.15 / MAX: 672.98MIN: 423.28 / MAX: 669.12MIN: 421.29 / MAX: 668.77MIN: 411.82 / MAX: 669.311. (CC) gcc options: -pthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K5412350100150200250SE +/- 0.49, N = 3SE +/- 2.27, N = 7SE +/- 2.02, N = 15SE +/- 2.03, N = 15SE +/- 1.90, N = 15209.54208.11206.02204.27202.23MIN: 140.28 / MAX: 222.38MIN: 133.38 / MAX: 225.17MIN: 129.35 / MAX: 225.85MIN: 127.4 / MAX: 222.46MIN: 126.91 / MAX: 221.771. (CC) gcc options: -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p14352120240360480600SE +/- 2.78, N = 3SE +/- 1.32, N = 3SE +/- 2.25, N = 3SE +/- 2.21, N = 3SE +/- 2.92, N = 3555.39554.60552.84552.65548.65MIN: 320.77 / MAX: 613.37MIN: 342.81 / MAX: 607.82MIN: 322.61 / MAX: 608.3MIN: 338.43 / MAX: 607.43MIN: 328.88 / MAX: 602.051. (CC) gcc options: -pthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit54321306090120150SE +/- 0.22, N = 3SE +/- 0.13, N = 3SE +/- 0.26, N = 3SE +/- 0.07, N = 3SE +/- 0.47, N = 3118.30117.85117.72117.18117.16MIN: 81.84 / MAX: 191.83MIN: 81.72 / MAX: 196.43MIN: 81.13 / MAX: 195.69MIN: 80.97 / MAX: 191.46MIN: 80.9 / MAX: 196.361. (CC) gcc options: -pthread

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1541320.07740.15480.23220.30960.387SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 30.3440.3430.3400.3380.337

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 5451320.22820.45640.68460.91281.141SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 31.0141.0081.0020.9950.993

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6415320.2990.5980.8971.1961.495SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.006, N = 31.3291.3221.3201.3081.307

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 10143520.66531.33061.99592.66123.3265SE +/- 0.011, N = 3SE +/- 0.010, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 3SE +/- 0.009, N = 32.9572.9242.9112.9082.885

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU45214080120160200SE +/- 2.10, N = 12SE +/- 1.96, N = 3SE +/- 2.09, N = 4SE +/- 1.78, N = 121831811571551. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU542150100150200250SE +/- 2.59, N = 12SE +/- 1.92, N = 3SE +/- 1.50, N = 3SE +/- 2.08, N = 122162131281221. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU54211428425670SE +/- 0.58, N = 3SE +/- 0.17, N = 3SE +/- 0.17, N = 3615954541. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU542114002800420056007000SE +/- 65.22, N = 3SE +/- 69.07, N = 12SE +/- 55.38, N = 3SE +/- 167.82, N = 1264986400548750111. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU45215001000150020002500SE +/- 7.60, N = 3SE +/- 22.29, N = 3SE +/- 8.92, N = 3SE +/- 4.69, N = 323622341222322111. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Cpuminer-Opt

Algorithm: Magi

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Magi41a5230060090012001500SE +/- 1.16, N = 3SE +/- 0.84, N = 3SE +/- 3.50, N = 3SE +/- 14.98, N = 41172.741172.191171.441149.831. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: x25x

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: x25x2541a2004006008001000SE +/- 5.74, N = 3SE +/- 1.08, N = 3SE +/- 1.18, N = 3SE +/- 7.85, N = 3839.46830.89830.66819.471. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Deepcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Deepcoin1a5424K8K12K16K20KSE +/- 119.53, N = 14SE +/- 49.78, N = 3SE +/- 11.55, N = 3SE +/- 27.28, N = 3170591694716920168831. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Ringcoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Ringcoin521a46001200180024003000SE +/- 11.03, N = 3SE +/- 8.81, N = 3SE +/- 6.53, N = 3SE +/- 8.05, N = 32950.682940.542936.882929.721. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Blake-2 S

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Blake-2 S1a425120K240K360K480K600KSE +/- 3940.00, N = 3SE +/- 3934.48, N = 3SE +/- 3628.00, N = 3SE +/- 5261.67, N = 35774105758335710135657671. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Garlicoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Garlicoin21a4510002000300040005000SE +/- 65.81, N = 3SE +/- 68.57, N = 4SE +/- 2.46, N = 3SE +/- 0.83, N = 34639.974590.084521.834520.301. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Skeincoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Skeincoin1a25430K60K90K120K150KSE +/- 918.82, N = 3SE +/- 1055.21, N = 3SE +/- 441.63, N = 3SE +/- 1707.45, N = 31370401356431321401314371. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Myriad-Groestl

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Myriad-Groestl4251a2K4K6K8K10KSE +/- 66.58, N = 3SE +/- 17.32, N = 3SE +/- 12.02, N = 3SE +/- 8.82, N = 3102101013010127101231. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: LBC, LBRY Credits

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: LBC, LBRY Credits421a510K20K30K40K50KSE +/- 536.95, N = 3SE +/- 502.13, N = 3SE +/- 668.84, N = 3SE +/- 381.65, N = 15469434557045567454511. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Quad SHA-256, Pyrite

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Quad SHA-256, Pyrite51a4240K80K120K160K200KSE +/- 1729.38, N = 3SE +/- 222.71, N = 3SE +/- 1373.44, N = 3SE +/- 1021.96, N = 31669931669401659431658331. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

Cpuminer-Opt

Algorithm: Triple SHA-256, Onecoin

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.15.5Algorithm: Triple SHA-256, Onecoin1a54250K100K150K200K250KSE +/- 469.18, N = 3SE +/- 877.54, N = 3SE +/- 568.46, N = 3SE +/- 612.27, N = 32211302207972201472185531. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp

IOR

Block Size: 2MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test Directory514232004006008001000SE +/- 3.23, N = 3SE +/- 3.76, N = 3SE +/- 6.02, N = 3SE +/- 8.48, N = 3SE +/- 10.17, N = 3961.66958.85791.04781.51753.94MIN: 792.79 / MAX: 1076.37MIN: 837.5 / MAX: 1071.89MIN: 324.26 / MAX: 1057.74MIN: 347.79 / MAX: 1068.81MIN: 291.39 / MAX: 1030.851. (CC) gcc options: -O2 -lm -pthread -lmpi

IOR

Block Size: 4MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 4MB - Disk Target: Default Test Directory45123150300450600750SE +/- 3.96, N = 3SE +/- 35.87, N = 15SE +/- 37.33, N = 15SE +/- 6.62, N = 12SE +/- 4.85, N = 15687.41654.19623.83592.02582.93MIN: 279.64 / MAX: 1117.59MIN: 328.97 / MAX: 1093.06MIN: 310.99 / MAX: 1067.3MIN: 332.2 / MAX: 1089.17MIN: 366.23 / MAX: 1049.91. (CC) gcc options: -O2 -lm -pthread -lmpi

IOR

Block Size: 8MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test Directory45231130260390520650SE +/- 7.50, N = 3SE +/- 4.89, N = 15SE +/- 8.80, N = 3SE +/- 6.43, N = 6SE +/- 8.56, N = 3594.69536.25535.92529.36524.14MIN: 330.92 / MAX: 1124.59MIN: 376.61 / MAX: 1090.13MIN: 424.77 / MAX: 1080.17MIN: 393.92 / MAX: 1033.73MIN: 399.31 / MAX: 1031.121. (CC) gcc options: -O2 -lm -pthread -lmpi

IOR

Block Size: 16MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 16MB - Disk Target: Default Test Directory453122004006008001000SE +/- 3.29, N = 3SE +/- 2.84, N = 3SE +/- 1.33, N = 3SE +/- 3.19, N = 3SE +/- 6.50, N = 4808.18527.67496.04492.78490.50MIN: 297.46 / MAX: 1133.39MIN: 422.53 / MAX: 1035.04MIN: 422.66 / MAX: 988.54MIN: 391.54 / MAX: 1031.18MIN: 367.13 / MAX: 1024.211. (CC) gcc options: -O2 -lm -pthread -lmpi

IOR

Block Size: 32MB - Disk Target: Default Test Directory

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 32MB - Disk Target: Default Test Directory451232004006008001000SE +/- 6.78, N = 3SE +/- 6.72, N = 5SE +/- 4.96, N = 12SE +/- 5.02, N = 7SE +/- 4.61, N = 3813.49506.83492.81480.75471.83MIN: 299.5 / MAX: 1128.35MIN: 420 / MAX: 1027.38MIN: 391.29 / MAX: 1073.97MIN: 396.65 / MAX: 1002.46MIN: 411.76 / MAX: 928.831. (CC) gcc options: -O2 -lm -pthread -lmpi

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression5421a918273645SE +/- 0.33, N = 3373737371. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression4251a204060801001091091081081. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression21a54110220330440550SE +/- 0.67, N = 3SE +/- 1.15, N = 3SE +/- 1.00, N = 3SE +/- 0.33, N = 35265265255251. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression1a42530060090012001500SE +/- 1.15, N = 3SE +/- 2.65, N = 3SE +/- 0.33, N = 3SE +/- 40.67, N = 315831572157215331. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression5421a20406080100949494941. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression4251a400800120016002000SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 2.33, N = 317871786178417651. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression4521a20406080100SE +/- 0.88, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3959494941. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression521a4100200300400500SE +/- 0.58, N = 3SE +/- 0.33, N = 34814814814801. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression1a542110220330440550SE +/- 3.18, N = 3SE +/- 1.15, N = 3SE +/- 1.20, N = 3SE +/- 3.00, N = 34984944924891. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression1a542120240360480600SE +/- 4.33, N = 3SE +/- 2.00, N = 25775725725691. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression4521a4080120160200SE +/- 0.33, N = 31971961961961. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression421a5140280420560700SE +/- 0.67, N = 3SE +/- 1.20, N = 3SE +/- 4.26, N = 3SE +/- 23.67, N = 36666646626401. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression5241a50100150200250SE +/- 2.49, N = 15SE +/- 3.06, N = 3SE +/- 2.87, N = 4SE +/- 3.06, N = 32062042032031. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.211a5425001000150020002500SE +/- 4.31, N = 3SE +/- 27.47, N = 6SE +/- 27.83, N = 6SE +/- 23.91, N = 82320.72296.22288.62286.01. (CXX) g++ options: -O3 -march=native -rdynamic

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT15423130060090012001500SE +/- 1.41, N = 3SE +/- 0.57, N = 3SE +/- 2.00, N = 3SE +/- 9.61, N = 3SE +/- 0.37, N = 31619.441617.871615.221607.371599.221. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC12541350100150200250SE +/- 0.06, N = 3SE +/- 0.45, N = 3SE +/- 0.57, N = 3SE +/- 1.00, N = 3SE +/- 1.16, N = 3248.79248.73248.61247.71247.561. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC243251306090120150SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.63, N = 3155.12154.92154.90154.30154.301. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering2435150100150200250SE +/- 0.07, N = 3SE +/- 0.34, N = 3SE +/- 1.28, N = 3SE +/- 0.32, N = 3SE +/- 2.59, N = 6230.99230.94229.77229.61227.801. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms2154348121620SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 315.4415.3515.0615.0114.801. (CXX) g++ options: -O3 -pthread -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein152433691215SE +/- 0.22, N = 13SE +/- 0.22, N = 15SE +/- 0.25, N = 15SE +/- 0.22, N = 15SE +/- 0.16, N = 1513.0112.8312.7712.5912.481. (CXX) g++ options: -O3 -pthread -lm

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.421546M12M18M24M30MSE +/- 496569.70, N = 12SE +/- 46576.08, N = 3SE +/- 621600.77, N = 12SE +/- 432230.86, N = 12273585032687781025857623257108381. (CXX) g++ options: -O3 -fopenmp

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C1a54216003200480064008000SE +/- 214.27, N = 15SE +/- 198.36, N = 15SE +/- 203.10, N = 15SE +/- 230.46, N = 157685.887441.037217.867203.141. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C4521a400800120016002000SE +/- 6.67, N = 3SE +/- 4.65, N = 3SE +/- 0.68, N = 3SE +/- 7.09, N = 31750.211743.291741.251733.781. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D21a54400800120016002000SE +/- 0.69, N = 3SE +/- 0.38, N = 3SE +/- 2.63, N = 3SE +/- 2.98, N = 31740.551740.431737.331732.071. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C451a25K10K15K20K25KSE +/- 137.92, N = 3SE +/- 47.70, N = 3SE +/- 397.10, N = 15SE +/- 443.87, N = 1522314.7122032.2321668.1021182.511. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D5421a2004006008001000SE +/- 9.93, N = 3SE +/- 8.96, N = 15SE +/- 14.63, N = 12SE +/- 17.71, N = 15811.98793.83770.10751.001. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C51a249K18K27K36K45KSE +/- 555.68, N = 15SE +/- 705.58, N = 15SE +/- 824.34, N = 15SE +/- 904.51, N = 1541730.7440721.5139921.1339758.791. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C1a4524K8K12K16K20KSE +/- 398.98, N = 15SE +/- 312.61, N = 15SE +/- 531.61, N = 15SE +/- 500.68, N = 1517335.8217192.7216629.6716587.151. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_1001254130260390520650SE +/- 1.64, N = 3SE +/- 1.23, N = 3SE +/- 1.79, N = 3SE +/- 1.86, N = 3594.56593.55589.78587.971. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.045213691215SE +/- 0.149, N = 3SE +/- 0.086, N = 15SE +/- 0.104, N = 15SE +/- 0.119, N = 159.0449.1269.2089.459MIN: 8.26 / MAX: 12.92MIN: 8.22 / MAX: 19.08MIN: 8.24 / MAX: 22.52MIN: 8.23 / MAX: 19.921. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-502541918273645SE +/- 0.26, N = 15SE +/- 0.30, N = 15SE +/- 0.29, N = 3SE +/- 0.25, N = 1537.8137.8437.9438.33MIN: 34.87 / MAX: 128.92MIN: 35 / MAX: 102.86MIN: 36.11 / MAX: 89.49MIN: 35.25 / MAX: 98.221. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_22445211.27622.55243.82865.10486.381SE +/- 0.138, N = 3SE +/- 0.055, N = 15SE +/- 0.073, N = 15SE +/- 0.062, N = 155.6215.6325.6705.672MIN: 5.33 / MAX: 6.27MIN: 5.17 / MAX: 6.36MIN: 5.12 / MAX: 13.18MIN: 5.22 / MAX: 6.841. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.021540.99181.98362.97543.96724.959SE +/- 0.067, N = 15SE +/- 0.050, N = 15SE +/- 0.036, N = 15SE +/- 0.099, N = 34.3364.3394.3404.408MIN: 3.43 / MAX: 36.21MIN: 3.45 / MAX: 35.08MIN: 3.82 / MAX: 35.09MIN: 3.88 / MAX: 23.421. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v345211122334455SE +/- 0.12, N = 3SE +/- 0.45, N = 15SE +/- 0.35, N = 15SE +/- 0.32, N = 1547.4448.2548.2948.51MIN: 45.7 / MAX: 117.61MIN: 43.95 / MAX: 109.39MIN: 44.15 / MAX: 138.3MIN: 44.81 / MAX: 103.241. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2215460120180240300SE +/- 0.81, N = 3SE +/- 0.22, N = 3SE +/- 0.44, N = 3SE +/- 0.81, N = 3279.84281.08288.43289.28MIN: 264.8 / MAX: 315.98MIN: 263.7 / MAX: 327.87MIN: 263.56 / MAX: 332.62MIN: 265.62 / MAX: 312.791. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1542150100150200250SE +/- 0.07, N = 3SE +/- 0.31, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 3251.34251.42251.54251.69MIN: 250.59 / MAX: 254.17MIN: 250.45 / MAX: 260.46MIN: 250.62 / MAX: 254.15MIN: 250.86 / MAX: 254.161. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP5421a9K18K27K36K45KSE +/- 260.65, N = 3SE +/- 57.95, N = 3SE +/- 566.70, N = 3SE +/- 192.21, N = 341135.4541316.5442328.9542858.381. (CXX) g++ options: -O3 -march=native -fopenmp

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP451a213K26K39K52K65KSE +/- 106.37, N = 3SE +/- 43.63, N = 3SE +/- 196.31, N = 3SE +/- 203.39, N = 356415.0856829.9958307.9058878.981. (CXX) g++ options: -O3 -march=native -fopenmp

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics54123306090120150SE +/- 1.40, N = 15SE +/- 2.71, N = 12SE +/- 0.72, N = 3SE +/- 4.21, N = 9SE +/- 3.21, N = 986.1586.58119.54125.43132.071. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

CP2K Molecular Dynamics

Fayalite-FIST Data

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data41300600900120015001447.581474.83

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M2120406080100SE +/- 18.67, N = 3SE +/- 10.16, N = 840.8176.571. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: Motorbike 60M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60M1160320480640800SE +/- 10.63, N = 9726.901. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF11254312400800120016002000SE +/- 5.57, N = 3SE +/- 23.00, N = 4SE +/- 3.23, N = 3SE +/- 19.15, N = 7SE +/- 26.20, N = 31673.031677.481687.651705.651710.861. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

RELION

Test: Basic - Device: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPU45231400800120016002000SE +/- 9.74, N = 3SE +/- 5.46, N = 3SE +/- 2.64, N = 3SE +/- 5.61, N = 3SE +/- 15.01, N = 31767.931781.071793.741798.431823.691. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -pthread -lmpi_cxx -lmpi

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile452120406080100SE +/- 0.19, N = 3SE +/- 0.38, N = 3SE +/- 0.43, N = 2SE +/- 0.87, N = 381.8982.1182.3283.68

Cython Benchmark

Test: N-Queens

OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-Queens521a4612182430SE +/- 0.08, N = 3SE +/- 0.25, N = 3SE +/- 0.13, N = 3SE +/- 0.19, N = 325.8225.8626.0326.15

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.91a45250100150200250SE +/- 0.45, N = 3SE +/- 0.72, N = 3SE +/- 1.12, N = 3SE +/- 2.05, N = 3216.06216.25216.77217.831. (CC) gcc options: -O2 -fvisibility=hidden

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O54123918273645SE +/- 0.39, N = 7SE +/- 0.44, N = 15SE +/- 0.57, N = 3SE +/- 0.53, N = 3SE +/- 0.49, N = 1535.6235.9036.3137.3537.461. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm


Phoronix Test Suite v10.8.4