GCE c3d-standard-60

KVM testing on Ubuntu 22.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2310148-NE-2310093NE34
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
c3d-standard-60 AMD Genoa
October 03 2023
  10 Hours, 30 Minutes
t2d-standard-60 AMD Milan
October 03 2023
  13 Hours, 23 Minutes
c6g.16xlarge
October 05 2023
  9 Hours, 27 Minutes
m7a.16xlarge
October 06 2023
  9 Hours, 1 Minute
c7g.16xlarge
October 08 2023
  6 Hours, 28 Minutes
c2d-standard-56
October 13 2023
  12 Hours, 50 Minutes
Invert Behavior (Only Show Selected Data)
  10 Hours, 17 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


GCE c3d-standard-60ProcessorMotherboardChipsetMemoryDiskNetworkOSKernelVulkanCompilerFile-SystemSystem Layerc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlargem7a.16xlargec7g.16xlargec2d-standard-56AMD EPYC 9B14 (30 Cores / 60 Threads)Google Compute Engine c3d-standard-60Intel 440FX 82441FX PMC240GB215GB nvme_card-pdGoogle Compute Engine VirtualUbuntu 22.046.2.0-1014-gcp (x86_64)1.3.238GCC 11.4.0ext4KVMAMD EPYC 7B13 (60 Cores)Google Compute Engine t2d-standard-60215GB PersistentDiskRed Hat Virtio deviceARMv8 Neoverse-N1 (64 Cores)Amazon EC2 c6g.16xlarge (1.0 BIOS)Amazon Device 0200128GB215GB Amazon Elastic Block StoreAmazon Elastic5.19.0-1025-aws (aarch64)amazonAMD EPYC 9R14 (64 Cores)Amazon EC2 m7a.16xlarge (1.0 BIOS)Intel 440FX 82441FX PMC256GB5.19.0-1025-aws (x86_64)ARMv8 Neoverse-V1 (64 Cores)Amazon EC2 c7g.16xlarge (1.0 BIOS)Amazon Device 0200128GB5.19.0-1025-aws (aarch64)AMD EPYC 7B13 (28 Cores / 56 Threads)Google Compute Engine c2d-standard-56Intel 440FX 82441FX PMC224GB215GB PersistentDiskRed Hat Virtio device6.2.0-1014-gcp (x86_64)KVMOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- c3d-standard-60 AMD Genoa: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - t2d-standard-60 AMD Milan: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - c6g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - m7a.16xlarge: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - c7g.16xlarge: --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v - c2d-standard-56: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-XeT9lY/gcc-11-11.4.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- c3d-standard-60 AMD Genoa: CPU Microcode: 0xffffffff- t2d-standard-60 AMD Milan: CPU Microcode: 0xffffffff- m7a.16xlarge: CPU Microcode: 0xa10113e- c2d-standard-56: CPU Microcode: 0xffffffffJava Details- OpenJDK Runtime Environment (build 11.0.20.1+1-post-Ubuntu-0ubuntu122.04)Python Details- Python 3.10.12Security Details- c3d-standard-60 AMD Genoa: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - t2d-standard-60 AMD Milan: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - c6g.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - m7a.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - c7g.16xlarge: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected - c2d-standard-56: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlargem7a.16xlargec7g.16xlargec2d-standard-56Logarithmic Result OverviewPhoronix Test SuiteNAS Parallel BenchmarksOpenSSLnekRSGROMACSlibavif avifenclibxsmmRemhosXcompact3d Incompact3dRodiniaLAMMPS Molecular Dynamics SimulatorAlgebraic Multi-Grid BenchmarkOpenVINOTimed Node.js CompilationCoremarkApache CassandraStockfishnginxHeFFTe - Highly Efficient FFT for ExascaleTimed Gem5 CompilationLaghos7-Zip Compression

GCE c3d-standard-60pgbench: 100 - 800 - Read Write - Average Latencyblender: Barbershop - CPU-Onlylammps: 20k Atomspgbench: 100 - 800 - Read Writepgbench: 100 - 1000 - Read Write - Average Latencyapache-iotdb: 800 - 100 - 800 - 400apache-iotdb: 800 - 100 - 800 - 400openradioss: Chrysler Neon 1Mrodinia: OpenMP HotSpot3Dbuild-linux-kernel: allmodconfigpgbench: 100 - 1000 - Read Writestockfish: Total Timenekrs: Kershawnekrs: TurboPipe Periodicapache-iotdb: 800 - 100 - 500 - 400apache-iotdb: 800 - 100 - 500 - 400apache-iotdb: 500 - 100 - 800 - 400apache-iotdb: 500 - 100 - 800 - 400build-nodejs: Time To Compileopenradioss: Bird Strike on Windshieldcassandra: Writesbuild-gem5: Time To Compileopenssl: AES-256-GCMopenssl: ChaCha20openssl: AES-128-GCMopenssl: ChaCha20-Poly1305openssl: SHA512openssl: SHA256openradioss: Bumper Beambrl-cad: VGR Performance Metricapache-iotdb: 500 - 100 - 500 - 400apache-iotdb: 500 - 100 - 500 - 400pgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 1000 - Read Only - Average Latencyopenradioss: Rubber O-Ring Seal Installationtensorflow: CPU - 64 - ResNet-50avifenc: 0pgbench: 100 - 800 - Read Onlypgbench: 100 - 1000 - Read Onlyopenradioss: Cell Phone Drop Testblender: Pabellon Barcelona - CPU-Onlyopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUlaghos: Sedov Blast Wave, ube_922_hex.meshblender: Classroom - CPU-Onlynginx: 1000nginx: 500openvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUtensorflow: CPU - 32 - ResNet-50avifenc: 2openvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUnpb: EP.Dopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Road Segmentation ADAS FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Handwritten English Recognition FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Face Detection Retail FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUnpb: LU.Copenssl: RSA4096openssl: RSA4096npb: SP.Crodinia: OpenMP LavaMDbuild-linux-kernel: defconfiglaghos: Triple Point Problemnpb: IS.Dnpb: BT.Cgromacs: MPI CPU - water_GMX50_bareblender: Fishy Cat - CPU-Onlytensorflow: CPU - 16 - ResNet-50blender: BMW27 - CPU-Onlyincompact3d: input.i3d 193 Cells Per Directionrodinia: OpenMP Leukocytecoremark: CoreMark Size 666 - Iterations Per Secondcompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingamg: remhos: Sample Remap Examplelibxsmm: 64libxsmm: 32npb: FT.Cnpb: CG.Crodinia: OpenMP Streamclusterrodinia: OpenMP CFD Solveravifenc: 6, Losslessincompact3d: input.i3d 129 Cells Per Directionnpb: MG.Cavifenc: 6heffte: c2c - FFTW - double - 128lammps: Rhodopsin Proteinheffte: r2c - FFTW - double - 128heffte: r2c - FFTW - float - 128heffte: c2c - FFTW - float - 128openradioss: INIVOL and Fluid Structure Interaction Drop Containerc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlargem7a.16xlargec7g.16xlargec2d-standard-5619.776682.1235359884337.7084.16610589445742898583334723940000447.683433223734762565623.89198.394147.31228640176.767293328048497173980949893343095284440123909304773147022705734621182131392.87510819418.593326815889.6569.6878.06838.825.862043.0583.99142.7583.91142.90259.55180537.84187350.44340.2935.1962.7441.538648.8118.3918.56645.743783.604.536605.1264.70185.296.791764.2120.77576.9439.38761.1431.08964.468.621389.698.203650.5715.981875.282.874166.390.454971.260.5243607.0473563.13493077.620079.539919.7164.862209.002422.4096257.484.39150.9928.019687745.4981445843.52155222621127179596288983333.362489.7255.439647.4719597.866.44810.0256.8895.8715788542701.833.25057.311617.42393.6005148.57588.6301140.916351.5826.7345682172.717709.4635068557327.8888.535333.351579311295878836819358332730620000433.963412381034925899633.58191.706123.61187169170.930216025967640180249145770234604082610119647720337222448041835088499710375.68629363415.08334668040.3990.49872.0620.9078.3502003784200818630.12112.649.901512.57208.4773.74193.4578.96364.6489.35155609.04162957.75568.7726.2820.3641.9891393.5610.7326.50565.344935.683.524239.52155.1496.5823.64633.7666.46225.4876.72390.7281.00370.0340.67368.3911.322646.5814.761014.7011.651285.470.6144049.150.9929668.1294247.77860844.612973.043228.1150.97433.399222.301752.62122720.615.28945.2218.2934.2724.572118142.0101730658.44944024725527897392042776716.326554.2289.254846.1816649.376.4237.3687.6395.6305732747291.963.20560.034327.828106.029196.948109.676168.19125.0594784210.608409.09747768180770617588600002221710000286.201217355224.414129198197600673247783601587885109704671512648714384917863422885139730.7671.026270.06810432679750316990.100.14947.591.06947.861.06321.29158700.36162553.8522391.860.04167.9469996.560.16773.310.152213.762186.810.46735.581.36135.877.36382.472.61423.952.36394.942.53153.126.53181.945.50119.178.3948.0820.797.33136.165.58178.8218807.75215683.22640.09716.9962.301102.216179.52915.8024229.142.76625.87483281259870.716902234046239735103289366720.816589.5312.721386.3713343.3514.2125.9838.8795.6181168625661.044.46732.357526.04179.0156202.445129.172150.601276.2331.4715312188.676598.4944699315190.7974.778267.965530013541916976678466674774796667355.134264390344502333521.98154.441115.96278585153.800522113080527308308045083592545362740216773475533264815068206225386119766.27788704340.11408992100.2740.34759.18100.1565.4472923009288094026.1491.884.353666.8956.21284.4256.41283.40409.7371.51224859.09233014.72261.1161.1987.1935.805503.3831.7213.071222.997501.763.0710382.2250.72315.145.073146.0415.221049.6627.601158.4322.531419.116.602417.345.166177.9410.193132.482.217222.100.2792100.800.3881996.81210544.87996017.531583.8102392.4043.28627.709218.864085.20193219.127.65537.1269.5527.7411.591308634.6612158639.274883282593330633184344433313.8671201.8643.4103413.0242007.575.9306.4805.6782.89602661121293.802.64971.109532.785124.363190.602121.505149.59736.7255348187.188316.005534211950247332521666673991736667236.482316573179.9992831878262301032346086833323451119837430651576332045616820544385551700.6090.770170.587131431712985273957.740.25554.281.81553.521.81409.66255540.03254864.9312852.080.08102.0815639.780.183871.330.263648.431255.020.8431.902.3298.5710.15221.504.51410.002.44395.912.5389.3611.19109.999.0969.8914.3130.0833.235.52180.793.86258.5928356.70713624.410181.117223.9543.90580.347231.851797.7638934.154.19413.79193881608877.078630285523310517176734900014.258785.0494.239830.1822031.2711.4574.3535.9143.1587883749799.573.17852.477937.462127.985291.966175.729140.410468.2018.3695702171.947679.8635377376363.6090.736452.74858208401701981952933335431083333414.403462589935460713629.00231.384146.56178227192.60111840259574012082530595312954876668782221279643131045990073982579484098.45410738408.13333959680.5300.66969.4321.8288.8101509987149444536.84150.0612.831089.83169.5782.47169.6282.47239.48125.73161127.09160196.24868.3516.0321.4846.5411914.277.2529.99466.412579.924.363195.58149.9793.2313.661022.8143.39322.2792.77301.6099.81280.2619.94701.0017.691581.2320.28689.566.302215.740.635877.141.2521951.0378848.08468650.67181.448423.5270.15741.168199.802190.3994143.853.95159.2220.0049.0125.415339146.7411212209.81928020819223043098187726724.643581.1306.347216.0523089.426.6589.3438.0806.1440919254301.253.80261.184618.02899.6480164.720104.534OpenBenchmarking.org

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average Latencyc6g.16xlargem7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-564080120160200SE +/- 3.85, N = 12SE +/- 0.39, N = 3SE +/- 0.83, N = 3SE +/- 1.23, N = 12SE +/- 1.15, N = 12168.19150.60149.60140.92140.411. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-Onlyc2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge100200300400500SE +/- 0.71, N = 3SE +/- 0.60, N = 3SE +/- 0.49, N = 3468.20351.58276.23

Blend File: Barbershop - Compute: CPU-Only

c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k Atomsc2d-standard-56c3d-standard-60 AMD Genoac6g.16xlarget2d-standard-60 AMD Milanm7a.16xlargec7g.16xlarge816243240SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 318.3719.7825.0626.7331.4736.73-lm-lm-lm-lm1. (CXX) g++ options: -O3 -ldl

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Writec6g.16xlargem7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-5612002400360048006000SE +/- 109.40, N = 12SE +/- 13.63, N = 3SE +/- 29.72, N = 3SE +/- 49.28, N = 12SE +/- 47.23, N = 12478453125348568257021. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Scaling Factor: 100 - Clients: 800 - Mode: Read Write

c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latencyc6g.16xlargem7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-5650100150200250SE +/- 6.00, N = 9SE +/- 0.59, N = 3SE +/- 0.83, N = 3SE +/- 1.44, N = 8SE +/- 1.46, N = 12210.61188.68187.19172.72171.951. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Apache IoTDB

Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56m7a.16xlarge150300450600750SE +/- 25.87, N = 3SE +/- 7.53, N = 3SE +/- 1.38, N = 3SE +/- 4.86, N = 3709.46682.12679.86598.49MAX: 113264.06MAX: 98294.84MAX: 79453.07MAX: 62432.53

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

c6g.16xlarge: The test quit with a non-zero exit status.

c7g.16xlarge: The test quit with a non-zero exit status.

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56m7a.16xlarge10M20M30M40M50MSE +/- 354923.23, N = 3SE +/- 267152.12, N = 3SE +/- 248722.01, N = 3SE +/- 99649.31, N = 335068557353598843537737644699315

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

c6g.16xlarge: The test quit with a non-zero exit status.

c7g.16xlarge: The test quit with a non-zero exit status.

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1Mc2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge80160240320400SE +/- 0.14, N = 3SE +/- 2.03, N = 3SE +/- 1.48, N = 3SE +/- 0.61, N = 3363.60337.70327.88190.79

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP HotSpot3Dc2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge20406080100SE +/- 1.43, N = 15SE +/- 1.83, N = 12SE +/- 0.86, N = 15SE +/- 0.74, N = 1590.7488.5484.1774.781. (CXX) g++ options: -O2 -lOpenCL

Test: OpenMP HotSpot3D

c6g.16xlarge: The test quit with a non-zero exit status.

c7g.16xlarge: The test quit with a non-zero exit status.

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigc2d-standard-56c6g.16xlarget2d-standard-60 AMD Milanc7g.16xlargem7a.16xlarge100200300400500SE +/- 1.50, N = 3SE +/- 2.41, N = 3SE +/- 1.20, N = 3SE +/- 0.81, N = 3SE +/- 0.37, N = 3452.75409.10333.35316.01267.97

Build: allmodconfig

c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status. E: linux-6.1/tools/objtool/include/objtool/elf.h:10:10: fatal error: gelf.h: No such file or directory

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Writec6g.16xlargem7a.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-5612002400360048006000SE +/- 124.50, N = 9SE +/- 16.57, N = 3SE +/- 23.65, N = 3SE +/- 49.93, N = 8SE +/- 50.46, N = 12477653005342579358201. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Scaling Factor: 100 - Clients: 1000 - Mode: Read Write

c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total Timec6g.16xlargec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc7g.16xlargem7a.16xlarge30M60M90M120M150MSE +/- 1645401.81, N = 15SE +/- 705199.36, N = 15SE +/- 1450871.09, N = 3SE +/- 1618403.29, N = 14SE +/- 1921303.63, N = 15SE +/- 1001447.21, N = 38180770684017019105894457112958788119502473135419169-m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2-m64 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi21. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

nekRS

nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: Kershawc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlargec2d-standard-562000M4000M6000M8000M10000MSE +/- 2970005.61, N = 3SE +/- 2525353.66, N = 3SE +/- 84802173.02, N = 12SE +/- 57202190.49, N = 12SE +/- 49077561.86, N = 3SE +/- 88231529.95, N = 31758860000325216666736819358334289858333766784666781952933331. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe Periodicc6g.16xlarget2d-standard-60 AMD Milanc7g.16xlargec3d-standard-60 AMD Genoam7a.16xlargec2d-standard-561200M2400M3600M4800M6000MSE +/- 1790009.31, N = 3SE +/- 481352.26, N = 3SE +/- 1792766.33, N = 3SE +/- 201939132.13, N = 12SE +/- 6657808.28, N = 3SE +/- 4494968.79, N = 32221710000273062000039917366674723940000477479666754310833331. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Apache IoTDB

Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56m7a.16xlarge100200300400500SE +/- 27.08, N = 3SE +/- 35.57, N = 3SE +/- 4.37, N = 3SE +/- 7.15, N = 3447.68433.96414.40355.13MAX: 95136.87MAX: 103381.73MAX: 91109.59MAX: 67149.77

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

c6g.16xlarge: The test quit with a non-zero exit status.

c7g.16xlarge: The test quit with a non-zero exit status.

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56m7a.16xlarge9M18M27M36M45MSE +/- 130588.20, N = 3SE +/- 66565.66, N = 3SE +/- 177230.11, N = 3SE +/- 111073.27, N = 334123810343322373462589942643903

Device Count: 800 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

c6g.16xlarge: The test quit with a non-zero exit status.

c7g.16xlarge: The test quit with a non-zero exit status.

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56m7a.16xlarge10M20M30M40M50MSE +/- 188621.29, N = 3SE +/- 221111.93, N = 3SE +/- 80227.43, N = 3SE +/- 224779.59, N = 334762565349258993546071344502333

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

c6g.16xlarge: The test quit with a non-zero exit status.

c7g.16xlarge: The test quit with a non-zero exit status.

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400t2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge140280420560700SE +/- 12.58, N = 3SE +/- 13.51, N = 3SE +/- 3.18, N = 3SE +/- 0.65, N = 3633.58629.00623.89521.98MAX: 54749.7MAX: 48846.34MAX: 41831.2MAX: 34235.44

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 800 - Client Number: 400

c6g.16xlarge: The test quit with a non-zero exit status.

c7g.16xlarge: The test quit with a non-zero exit status.

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 19.8.1Time To Compilec6g.16xlargec7g.16xlargec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge60120180240300SE +/- 0.11, N = 3SE +/- 0.31, N = 3SE +/- 0.26, N = 3SE +/- 0.29, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3286.20236.48231.38198.39191.71154.44

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on Windshieldc3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge306090120150SE +/- 3.26, N = 9SE +/- 0.41, N = 3SE +/- 0.13, N = 3SE +/- 0.18, N = 3147.31146.56123.61115.96

Apache Cassandra

This is a benchmark of the Apache Cassandra NoSQL database management system making use of cassandra-stress. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writesc2d-standard-56t2d-standard-60 AMD Milanc6g.16xlargec3d-standard-60 AMD Genoam7a.16xlargec7g.16xlarge70K140K210K280K350KSE +/- 1718.54, N = 3SE +/- 681.76, N = 3SE +/- 3249.94, N = 12SE +/- 1129.40, N = 3SE +/- 352.65, N = 3SE +/- 2261.70, N = 3178227187169217355228640278585316573

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compilec6g.16xlargec2d-standard-56c7g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge50100150200250SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.25, N = 3SE +/- 0.07, N = 3SE +/- 0.27, N = 3SE +/- 0.39, N = 3224.41192.60180.00176.77170.93153.80

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMc2d-standard-56c6g.16xlarget2d-standard-60 AMD Milanc7g.16xlargec3d-standard-60 AMD Genoam7a.16xlarge110000M220000M330000M440000M550000MSE +/- 18032723.86, N = 3SE +/- 2100313.05, N = 3SE +/- 178290221.71, N = 3SE +/- 10793448.32, N = 3SE +/- 71241287.97, N = 3SE +/- 1691817104.57, N = 3118402595740129198197600216025967640283187826230293328048497522113080527-m64 -lssl -lcrypto-lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20c6g.16xlargec7g.16xlargec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge70000M140000M210000M280000M350000MSE +/- 372419.81, N = 3SE +/- 926708.87, N = 3SE +/- 13175767.44, N = 3SE +/- 12326205.97, N = 3SE +/- 47698640.87, N = 3SE +/- 280681661.27, N = 367324778360103234608683120825305953173980949893180249145770308308045083-lssl -lcrypto-m64-m64 -lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMc2d-standard-56c6g.16xlarget2d-standard-60 AMD Milanc7g.16xlargec3d-standard-60 AMD Genoam7a.16xlarge130000M260000M390000M520000M650000MSE +/- 2205754.73, N = 3SE +/- 5537993.15, N = 3SE +/- 376720190.05, N = 3SE +/- 38386470.30, N = 3SE +/- 342949201.09, N = 3SE +/- 1818538727.73, N = 3129548766687158788510970234604082610332345111983343095284440592545362740-m64 -lssl -lcrypto-lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305c6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge50000M100000M150000M200000M250000MSE +/- 2259404.37, N = 3SE +/- 814473.51, N = 3SE +/- 25534569.58, N = 3SE +/- 198663058.81, N = 3SE +/- 3664727.47, N = 3SE +/- 85099636.22, N = 3467151264877430651576382221279643119647720337123909304773216773475533-lssl -lcrypto-m64-m64 -lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512c2d-standard-56c6g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlargec7g.16xlarge7000M14000M21000M28000M35000MSE +/- 631939.37, N = 3SE +/- 6214593.12, N = 3SE +/- 4399663.13, N = 3SE +/- 108274834.92, N = 3SE +/- 29763937.73, N = 3SE +/- 9797429.91, N = 3131045990071438491786314702270573222448041832648150682032045616820-m64-lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto-lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256c2d-standard-56c6g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc7g.16xlargem7a.16xlarge13000M26000M39000M52000M65000MSE +/- 7310870.38, N = 3SE +/- 192235444.29, N = 3SE +/- 9562123.40, N = 3SE +/- 20491615.60, N = 3SE +/- 1709212.75, N = 3SE +/- 161718701.08, N = 3398257948404228851397346211821313508849971035443855517062253861197-m64-lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto-lssl -lcrypto-m64 -lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper Beamc2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge20406080100SE +/- 0.85, N = 3SE +/- 1.56, N = 15SE +/- 0.10, N = 3SE +/- 0.05, N = 398.4592.8775.6866.27

BRL-CAD

BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.36VGR Performance Metricc2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge200K400K600K800K1000K4107385108196293637887041. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

VGR Performance Metric

c6g.16xlarge: The test quit with a non-zero exit status. E: ERROR: Could not find the BRL-CAD raytracer

c7g.16xlarge: The test quit with a non-zero exit status. E: ERROR: Could not find the BRL-CAD raytracer

Apache IoTDB

Apache IotDB is a time series database and this benchmark is facilitated using the IoT Benchmaark [https://github.com/thulab/iot-benchmark/]. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgAverage Latency, Fewer Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56m7a.16xlarge90180270360450SE +/- 2.14, N = 3SE +/- 4.08, N = 3SE +/- 2.53, N = 3SE +/- 4.92, N = 3418.59415.08408.13340.11MAX: 31920.67MAX: 28810.4MAX: 33039.23MAX: 37899.66

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

c6g.16xlarge: The test quit with a non-zero exit status.

c7g.16xlarge: The test quit with a non-zero exit status.

OpenBenchmarking.orgpoint/sec, More Is BetterApache IoTDB 1.2Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400c3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge9M18M27M36M45MSE +/- 154587.14, N = 3SE +/- 247504.71, N = 3SE +/- 303573.79, N = 3SE +/- 457186.43, N = 333268158333959683346680440899210

Device Count: 500 - Batch Size Per Write: 100 - Sensor Count: 500 - Client Number: 400

c6g.16xlarge: The test quit with a non-zero exit status.

c7g.16xlarge: The test quit with a non-zero exit status.

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latencyc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge0.17260.34520.51780.69040.863SE +/- 0.009, N = 3SE +/- 0.007, N = 3SE +/- 0.007, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 30.7670.6090.5300.3990.2741. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latencyc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge0.23090.46180.69270.92361.1545SE +/- 0.013, N = 3SE +/- 0.007, N = 3SE +/- 0.004, N = 3SE +/- 0.006, N = 3SE +/- 0.000, N = 31.0260.7700.6690.4980.3471. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal Installationc3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56m7a.16xlarge20406080100SE +/- 4.01, N = 12SE +/- 0.22, N = 3SE +/- 0.24, N = 3SE +/- 0.73, N = 389.6572.0669.4359.18

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50t2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge20406080100SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 320.9021.8269.68100.15

Device: CPU - Batch Size: 64 - Model: ResNet-50

c6g.16xlarge: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

c7g.16xlarge: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 0c6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge60120180240300SE +/- 0.34, N = 3SE +/- 0.42, N = 3SE +/- 0.31, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3270.07170.5988.8178.3578.0765.451. (CXX) g++ options: -O3 -fPIC -lm

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 800 - Mode: Read Onlyc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge600K1200K1800K2400K3000KSE +/- 12058.27, N = 3SE +/- 14241.12, N = 3SE +/- 20718.28, N = 3SE +/- 20558.90, N = 3SE +/- 13309.30, N = 3104326713143171509987200378429230091. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Scaling Factor: 100 - Clients: 800 - Mode: Read Only

c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Onlyc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge600K1200K1800K2400K3000KSE +/- 12606.16, N = 3SE +/- 11830.85, N = 3SE +/- 9220.12, N = 3SE +/- 22497.42, N = 3SE +/- 2874.04, N = 397503112985271494445200818628809401. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ./pgbench: 21: pg_/bin/pgbench: not found

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop Testc3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge918273645SE +/- 1.27, N = 15SE +/- 0.12, N = 3SE +/- 0.39, N = 15SE +/- 0.08, N = 338.8236.8430.1226.14

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Onlyc2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge306090120150SE +/- 0.13, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3150.06112.6491.88

Blend File: Pabellon Barcelona - Compute: CPU-Only

c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge15003000450060007500SE +/- 27.09, N = 15SE +/- 1.04, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 36990.103957.7412.839.905.864.35-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6842.82 / MAX: 7088.53-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 3952.99 / MAX: 3967.181. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge8001600240032004000SE +/- 0.00, N = 15SE +/- 0.00, N = 3SE +/- 1.05, N = 3SE +/- 1.12, N = 3SE +/- 3.16, N = 3SE +/- 1.69, N = 30.140.251089.831512.572043.053666.89-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Detection FP16 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge2004006008001000SE +/- 0.37, N = 3SE +/- 0.71, N = 3SE +/- 9.17, N = 15SE +/- 0.29, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 3947.59554.28208.47169.5783.9956.21-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 943.57 / MAX: 951.68-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 550.22 / MAX: 563.55-pie - MIN: 119.65 / MAX: 316.01-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 94.4 / MAX: 218.71. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Detection FP16 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge60120180240300SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 3.12, N = 15SE +/- 0.13, N = 3SE +/- 0.23, N = 3SE +/- 0.53, N = 31.061.8173.7482.47142.75284.42-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Detection FP32 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge2004006008001000SE +/- 0.55, N = 3SE +/- 0.11, N = 3SE +/- 7.79, N = 15SE +/- 0.29, N = 3SE +/- 0.29, N = 3SE +/- 0.06, N = 3947.86553.52193.45169.6283.9156.41-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 944.57 / MAX: 958.26-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 549.89 / MAX: 562.06-pie - MIN: 113.44 / MAX: 315.54-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 104.86 / MAX: 209.351. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Detection FP32 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge60120180240300SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 2.74, N = 15SE +/- 0.14, N = 3SE +/- 0.48, N = 3SE +/- 0.29, N = 31.061.8178.9682.47142.90283.40-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

Laghos

Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshc2d-standard-56c3d-standard-60 AMD Genoac6g.16xlarget2d-standard-60 AMD Milanc7g.16xlargem7a.16xlarge90180270360450SE +/- 0.02, N = 3SE +/- 0.31, N = 3SE +/- 0.79, N = 3SE +/- 0.62, N = 3SE +/- 0.57, N = 3SE +/- 1.29, N = 3239.48259.55321.29364.64409.66409.731. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-Onlyc2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge306090120150SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3125.7389.3571.51

Blend File: Classroom - Compute: CPU-Only

c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000t2d-standard-60 AMD Milanc6g.16xlargec2d-standard-56c3d-standard-60 AMD Genoam7a.16xlargec7g.16xlarge50K100K150K200K250KSE +/- 156.32, N = 3SE +/- 132.24, N = 3SE +/- 157.70, N = 3SE +/- 688.79, N = 3SE +/- 260.28, N = 3SE +/- 141.46, N = 3155609.04158700.36161127.09180537.84224859.09255540.031. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500c2d-standard-56c6g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlargec7g.16xlarge50K100K150K200K250KSE +/- 513.44, N = 3SE +/- 249.05, N = 3SE +/- 394.72, N = 3SE +/- 126.15, N = 3SE +/- 451.60, N = 3SE +/- 757.47, N = 3160196.24162553.85162957.75187350.44233014.72254864.931. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge5K10K15K20K25KSE +/- 15.60, N = 3SE +/- 5.72, N = 3SE +/- 0.38, N = 3SE +/- 0.35, N = 3SE +/- 0.21, N = 3SE +/- 0.03, N = 322391.8612852.08868.35568.77340.29261.11-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 22364.17 / MAX: 22423.42-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 12821.62 / MAX: 12882.021. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge1428425670SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 30.040.0816.0326.2835.1961.19-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: ResNet-50t2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge20406080100SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 320.3621.4862.7487.19

Device: CPU - Batch Size: 32 - Model: ResNet-50

c6g.16xlarge: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

c7g.16xlarge: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 2c6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge4080120160200SE +/- 0.22, N = 3SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.22, N = 3167.95102.0846.5441.9941.5435.811. (CXX) g++ options: -O3 -fPIC -lm

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection FP16 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge2K4K6K8K10KSE +/- 1.02, N = 3SE +/- 1.45, N = 3SE +/- 3.85, N = 3SE +/- 1.32, N = 3SE +/- 0.17, N = 3SE +/- 0.12, N = 39996.565639.781914.271393.56648.81503.38-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 9993.58 / MAX: 10001.76-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 5635.22 / MAX: 5646.39-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 1788.62 / MAX: 2033.221. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection FP16 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge714212835SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 30.100.187.2510.7318.3931.72-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge15003000450060007500SE +/- 76.95, N = 3SE +/- 0.51, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 36773.313871.3329.9926.5018.5613.07-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6616.83 / MAX: 6859.99-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 3868.49 / MAX: 3876.891. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge30060090012001500SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 1.07, N = 3SE +/- 0.37, N = 3SE +/- 0.79, N = 3SE +/- 0.51, N = 30.150.26466.41565.34645.741222.99-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.Dc6g.16xlargec2d-standard-56c7g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge16003200480064008000SE +/- 7.28, N = 3SE +/- 36.32, N = 3SE +/- 25.41, N = 13SE +/- 30.67, N = 3SE +/- 51.21, N = 5SE +/- 8.54, N = 32213.762579.923648.433783.604935.687501.761. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge5001000150020002500SE +/- 27.29, N = 3SE +/- 0.23, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32186.811255.024.534.363.523.07-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 2135.06 / MAX: 2233.34-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 1251.83 / MAX: 1261.91. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge2K4K6K8K10KSE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 2.82, N = 3SE +/- 2.08, N = 3SE +/- 6.52, N = 3SE +/- 3.60, N = 30.460.803195.584239.526605.1210382.22-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Machine Translation EN To DE FP16 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge160320480640800SE +/- 0.26, N = 3SE +/- 0.24, N = 3SE +/- 0.58, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.02, N = 3735.58431.90155.14149.9764.7050.72-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 734.34 / MAX: 738.23-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 430.12 / MAX: 435.25-pie - MIN: 114.75 / MAX: 224.64-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 81.05 / MAX: 193.41. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Machine Translation EN To DE FP16 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge70140210280350SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.37, N = 3SE +/- 0.25, N = 3SE +/- 0.11, N = 31.362.3293.2396.58185.29315.14-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Person Vehicle Bike Detection FP16 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge306090120150SE +/- 0.21, N = 3SE +/- 0.21, N = 3SE +/- 0.14, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3135.8798.5723.6413.666.795.07-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 132.35 / MAX: 148.64-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 94.4 / MAX: 118.88-pie - MIN: 9.57 / MAX: 42.221. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Person Vehicle Bike Detection FP16 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge7001400210028003500SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 3.66, N = 3SE +/- 1.21, N = 3SE +/- 4.38, N = 3SE +/- 1.10, N = 37.3610.15633.761022.811764.213146.04-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge80160240320400SE +/- 0.20, N = 3SE +/- 0.13, N = 3SE +/- 0.27, N = 3SE +/- 0.39, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3382.47221.5066.4643.3920.7715.22-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 381.67 / MAX: 383.43-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 220.78 / MAX: 223.35-pie - MIN: 25.85 / MAX: 122.21. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Road Segmentation ADAS FP16 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge2004006008001000SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.93, N = 3SE +/- 2.90, N = 3SE +/- 1.04, N = 3SE +/- 0.11, N = 32.614.51225.48322.27576.941049.66-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge90180270360450SE +/- 0.26, N = 3SE +/- 3.34, N = 3SE +/- 0.65, N = 3SE +/- 0.27, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 3423.95410.0092.7776.7239.3827.60-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 411.17 / MAX: 440.57-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 391.1 / MAX: 424.93-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 51.49 / MAX: 118.72-pie - MIN: 58.8 / MAX: 121.691. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge2004006008001000SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 2.14, N = 3SE +/- 1.37, N = 3SE +/- 1.75, N = 3SE +/- 0.18, N = 32.362.44301.60390.72761.141158.43-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16 - Device: CPUc7g.16xlargec6g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge90180270360450SE +/- 1.65, N = 3SE +/- 2.18, N = 3SE +/- 0.13, N = 3SE +/- 0.21, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3395.91394.9499.8181.0031.0822.53-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 380.35 / MAX: 415.38-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 380.82 / MAX: 408.15-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 55.12 / MAX: 150.04-pie - MIN: 64.65 / MAX: 134.641. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Handwritten English Recognition FP16 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge30060090012001500SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.37, N = 3SE +/- 0.96, N = 3SE +/- 0.60, N = 3SE +/- 0.70, N = 32.532.53280.26370.03964.461419.11-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge306090120150SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3153.1289.3640.6719.948.626.60-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 152.69 / MAX: 153.75-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 88.98 / MAX: 90.65-pie - MIN: 12.07 / MAX: 59.44-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 10.34 / MAX: 37.941. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Vehicle Detection FP16 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge5001000150020002500SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 1.55, N = 3SE +/- 0.69, N = 3SE +/- 6.33, N = 3SE +/- 1.65, N = 36.5311.19368.39701.001389.692417.34-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge4080120160200SE +/- 0.24, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3181.94109.9917.6911.328.205.16-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 180.71 / MAX: 184.11-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 108.37 / MAX: 112.831. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge13002600390052006500SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.68, N = 3SE +/- 3.26, N = 3SE +/- 3.28, N = 3SE +/- 4.43, N = 35.509.091581.232646.583650.576177.94-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge306090120150SE +/- 0.07, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3119.1769.8920.2815.9814.7610.19-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 118.73 / MAX: 120.21-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 69.44 / MAX: 71.41-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 14.46 / MAX: 30.61. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Weld Porosity Detection FP16 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge7001400210028003500SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.77, N = 3SE +/- 0.56, N = 3SE +/- 0.47, N = 3SE +/- 0.31, N = 38.3914.31689.561014.701875.283132.48-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge1122334455SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.14, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 348.0830.0811.656.302.872.21-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 47.55 / MAX: 50.49-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 29.37 / MAX: 32.17-pie - MIN: 3.76 / MAX: 29.1-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 3.57 / MAX: 21.51. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Face Detection Retail FP16 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge15003000450060007500SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 15.96, N = 3SE +/- 3.06, N = 3SE +/- 8.74, N = 3SE +/- 4.07, N = 320.7933.231285.472215.744166.397222.10-pie-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlarget2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge246810SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 37.335.520.610.600.400.27-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 6.54 / MAX: 9.95-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 4.5 / MAX: 9.531. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge20K40K60K80K100KSE +/- 0.48, N = 3SE +/- 0.71, N = 3SE +/- 26.90, N = 3SE +/- 332.93, N = 3SE +/- 45.46, N = 3SE +/- 453.62, N = 3136.16180.7935877.1444049.1554971.2692100.80-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge1.25552.5113.76655.0226.2775SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.583.861.250.990.520.38-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF -shared - MIN: 5.45 / MAX: 6.48-isystem -fPIC -fvisibility=hidden -std=c++14 -MD -MT -MF - MIN: 3.63 / MAX: 5.36-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem - MIN: 0.71 / MAX: 15.89-pie - MIN: 0.8 / MAX: 13.761. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.1Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge20K40K60K80K100KSE +/- 0.39, N = 3SE +/- 0.88, N = 3SE +/- 3.74, N = 3SE +/- 14.46, N = 3SE +/- 18.42, N = 3SE +/- 31.79, N = 3178.82258.5921951.0329668.1243607.0481996.81-fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF -isystem-pie-pie-pie1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.Cc6g.16xlargec7g.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge50K100K150K200K250KSE +/- 7.52, N = 3SE +/- 30.15, N = 3SE +/- 293.57, N = 3SE +/- 22.13, N = 3SE +/- 1463.52, N = 15SE +/- 661.61, N = 318807.7528356.7073563.1378848.0894247.77210544.871. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096c6g.16xlargec2d-standard-56c3d-standard-60 AMD Genoac7g.16xlarget2d-standard-60 AMD Milanm7a.16xlarge200K400K600K800K1000KSE +/- 6.55, N = 3SE +/- 418.00, N = 3SE +/- 50.91, N = 3SE +/- 189.99, N = 3SE +/- 644.45, N = 3SE +/- 367.82, N = 3215683.2468650.6493077.6713624.4860844.6996017.5-lssl -lcrypto-m64-m64 -lssl -lcrypto-lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096c6g.16xlargec2d-standard-56c7g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoam7a.16xlarge7K14K21K28K35KSE +/- 0.09, N = 3SE +/- 0.44, N = 3SE +/- 1.56, N = 3SE +/- 11.58, N = 3SE +/- 14.62, N = 3SE +/- 24.57, N = 32640.07181.410181.112973.020079.531583.8-m64-lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto-m64 -lssl -lcrypto1. (CC) gcc options: -pthread -O3 -ldl

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.Cc6g.16xlargec7g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56m7a.16xlarge20K40K60K80K100KSE +/- 0.76, N = 3SE +/- 40.05, N = 3SE +/- 45.01, N = 3SE +/- 555.92, N = 3SE +/- 113.84, N = 3SE +/- 91.17, N = 39716.9917223.9539919.7143228.1148423.52102392.401. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LavaMDc2d-standard-56c3d-standard-60 AMD Genoac6g.16xlarget2d-standard-60 AMD Milanc7g.16xlargem7a.16xlarge1632486480SE +/- 0.13, N = 3SE +/- 0.18, N = 3SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.14, N = 3SE +/- 0.24, N = 370.1664.8662.3050.9743.9143.291. (CXX) g++ options: -O2 -lOpenCL

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigc6g.16xlargec7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge20406080100SE +/- 0.82, N = 3SE +/- 0.65, N = 3SE +/- 0.54, N = 3SE +/- 0.37, N = 5SE +/- 0.29, N = 5102.2280.3541.1733.4027.71

Build: defconfig

c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status. E: linux-6.1/tools/objtool/include/objtool/elf.h:10:10: fatal error: gelf.h: No such file or directory

Laghos

Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point Problemc6g.16xlargec2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarget2d-standard-60 AMD Milanc7g.16xlarge50100150200250SE +/- 0.50, N = 3SE +/- 0.61, N = 3SE +/- 0.22, N = 3SE +/- 1.67, N = 3SE +/- 1.77, N = 3SE +/- 0.90, N = 3179.52199.80209.00218.86222.30231.851. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.Dc6g.16xlarget2d-standard-60 AMD Milanc7g.16xlargec2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge9001800270036004500SE +/- 0.58, N = 3SE +/- 142.62, N = 12SE +/- 1.25, N = 3SE +/- 3.14, N = 3SE +/- 36.45, N = 15SE +/- 3.14, N = 3915.801752.621797.762190.392422.404085.201. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.Cc6g.16xlargec7g.16xlargec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge40K80K120K160K200KSE +/- 7.69, N = 3SE +/- 19.48, N = 3SE +/- 80.45, N = 3SE +/- 122.23, N = 3SE +/- 42.77, N = 3SE +/- 560.75, N = 324229.1438934.1594143.8596257.48122720.61193219.121. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_barec6g.16xlargec2d-standard-56c7g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge246810SE +/- 0.001, N = 3SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.011, N = 3SE +/- 0.005, N = 3SE +/- 0.035, N = 32.7663.9514.1944.3915.2897.6551. (CXX) g++ options: -O3

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-Onlyc2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge1326395265SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 359.2245.2237.12

Blend File: Fishy Cat - Compute: CPU-Only

c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries if desired for complementary metrics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: ResNet-50t2d-standard-60 AMD Milanc2d-standard-56c3d-standard-60 AMD Genoam7a.16xlarge1530456075SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.20, N = 318.2920.0050.9969.55

Device: CPU - Batch Size: 16 - Model: ResNet-50

c6g.16xlarge: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

c7g.16xlarge: The test quit with a non-zero exit status. E: ModuleNotFoundError: No module named 'absl'

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlyc2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge1122334455SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 349.0134.2727.74

Blend File: BMW27 - Compute: CPU-Only

c3d-standard-60 AMD Genoa: The test quit with a non-zero exit status.

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionc3d-standard-60 AMD Genoac6g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc7g.16xlargem7a.16xlarge714212835SE +/- 0.25, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.31, N = 12SE +/- 0.03, N = 3SE +/- 0.04, N = 328.0225.8725.4224.5713.7911.591. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Leukocytec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge1122334455SE +/- 0.24, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.32, N = 346.7445.5042.0134.661. (CXX) g++ options: -O2 -lOpenCL

Test: OpenMP Leukocyte

c6g.16xlarge: The test quit with a non-zero exit status.

c7g.16xlarge: The test quit with a non-zero exit status.

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondc2d-standard-56c6g.16xlargec3d-standard-60 AMD Genoac7g.16xlarget2d-standard-60 AMD Milanm7a.16xlarge500K1000K1500K2000K2500KSE +/- 833.48, N = 3SE +/- 635.29, N = 3SE +/- 1295.68, N = 3SE +/- 11587.63, N = 15SE +/- 9191.61, N = 3SE +/- 1437.63, N = 31212209.821259870.721445843.521608877.081730658.452158639.271. (CC) gcc options: -O2 -lrt" -lrt

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingc2d-standard-56c3d-standard-60 AMD Genoac6g.16xlarget2d-standard-60 AMD Milanm7a.16xlargec7g.16xlarge60K120K180K240K300KSE +/- 140.45, N = 3SE +/- 519.21, N = 3SE +/- 57.33, N = 3SE +/- 347.74, N = 3SE +/- 342.37, N = 3SE +/- 141.62, N = 32081922262112340462472552825932855231. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingc2d-standard-56c6g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc7g.16xlargem7a.16xlarge70K140K210K280K350KSE +/- 214.91, N = 3SE +/- 359.95, N = 3SE +/- 346.51, N = 3SE +/- 388.74, N = 3SE +/- 291.75, N = 3SE +/- 400.99, N = 32304302397352717952789733105173306331. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac2d-standard-56c6g.16xlargec7g.16xlargem7a.16xlarge400M800M1200M1600M2000MSE +/- 1088162.98, N = 3SE +/- 2060519.90, N = 3SE +/- 999273.29, N = 3SE +/- 176147.98, N = 3SE +/- 536620.29, N = 3SE +/- 1428129.35, N = 39204277679628898339818772671032893667176734900018434443331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Remhos

Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Examplec3d-standard-60 AMD Genoac2d-standard-56c6g.16xlarget2d-standard-60 AMD Milanc7g.16xlargem7a.16xlarge816243240SE +/- 0.17, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 333.3624.6420.8216.3314.2613.871. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

libxsmm

Libxsmm is an open-source library for specialized dense and sparse matrix operations and deep learning primitives. Libxsmm supports making use of Intel AMX, AVX-512, and other modern CPU instruction set capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 64c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c6g.16xlargec7g.16xlargem7a.16xlarge30060090012001500SE +/- 0.12, N = 3SE +/- 0.25, N = 3SE +/- 1.99, N = 3SE +/- 0.96, N = 3SE +/- 0.03, N = 3SE +/- 0.52, N = 3489.7554.2581.1589.5785.01201.8-lquadmath -msse4.2-lquadmath -msse4.2-lquadmath -msse4.2-march=armv8.1-a-march=armv8.1-a-lquadmath -msse4.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden

OpenBenchmarking.orgGFLOPS/s, More Is Betterlibxsmm 2-1.17-3645M N K: 32c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56c6g.16xlargec7g.16xlargem7a.16xlarge140280420560700SE +/- 0.19, N = 3SE +/- 3.60, N = 4SE +/- 0.38, N = 3SE +/- 0.47, N = 3SE +/- 0.07, N = 3SE +/- 0.40, N = 3255.4289.2306.3312.7494.2643.4-lquadmath -msse4.2-lquadmath -msse4.2-lquadmath -msse4.2-march=armv8.1-a-march=armv8.1-a-lquadmath -msse4.21. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.Cc6g.16xlargec3d-standard-60 AMD Genoac7g.16xlargec2d-standard-56t2d-standard-60 AMD Milanm7a.16xlarge20K40K60K80K100KSE +/- 2.85, N = 3SE +/- 600.19, N = 15SE +/- 13.54, N = 3SE +/- 57.93, N = 3SE +/- 137.77, N = 3SE +/- 446.85, N = 321386.3739647.4739830.1847216.0554846.18103413.021. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.Cc6g.16xlarget2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac7g.16xlargec2d-standard-56m7a.16xlarge9K18K27K36K45KSE +/- 23.52, N = 3SE +/- 1215.82, N = 15SE +/- 77.92, N = 3SE +/- 25.33, N = 3SE +/- 206.16, N = 7SE +/- 178.46, N = 313343.3516649.3719597.8622031.2723089.4242007.571. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

Rodinia

Rodinia is a suite focused upon accelerating compute-intensive applications with accelerators. CUDA, OpenMP, and OpenCL parallel models are supported by the included applications. This profile utilizes select OpenCL, NVIDIA CUDA and OpenMP test binaries at the moment. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamclusterc6g.16xlargec7g.16xlargec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanm7a.16xlarge48121620SE +/- 0.017, N = 3SE +/- 0.124, N = 3SE +/- 0.009, N = 3SE +/- 0.104, N = 15SE +/- 0.009, N = 3SE +/- 0.049, N = 314.21211.4576.6586.4486.4235.9301. (CXX) g++ options: -O2 -lOpenCL

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solverc3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanm7a.16xlargec6g.16xlargec7g.16xlarge3691215SE +/- 0.013, N = 3SE +/- 0.013, N = 3SE +/- 0.034, N = 3SE +/- 0.007, N = 3SE +/- 0.001, N = 3SE +/- 0.011, N = 310.0259.3437.3686.4805.9834.3531. (CXX) g++ options: -O2 -lOpenCL

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6, Losslessc6g.16xlargec2d-standard-56t2d-standard-60 AMD Milanc3d-standard-60 AMD Genoac7g.16xlargem7a.16xlarge246810SE +/- 0.032, N = 3SE +/- 0.015, N = 3SE +/- 0.031, N = 3SE +/- 0.099, N = 3SE +/- 0.009, N = 3SE +/- 0.008, N = 38.8798.0807.6396.8895.9145.6781. (CXX) g++ options: -O3 -fPIC -lm

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc6g.16xlargec7g.16xlargem7a.16xlarge246810SE +/- 0.02575719, N = 3SE +/- 0.04970425, N = 3SE +/- 0.02210564, N = 3SE +/- 0.01888616, N = 3SE +/- 0.01236463, N = 3SE +/- 0.03993251, N = 36.144091925.871578855.630573275.618116863.158788372.896026611. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.Cc6g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc7g.16xlargec2d-standard-56m7a.16xlarge30K60K90K120K150KSE +/- 10.99, N = 3SE +/- 29.69, N = 3SE +/- 145.06, N = 3SE +/- 10.47, N = 3SE +/- 99.78, N = 3SE +/- 526.14, N = 325661.0442701.8347291.9649799.5754301.25121293.801. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.2

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 1.0Encoder Speed: 6c6g.16xlargec2d-standard-56c3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc7g.16xlargem7a.16xlarge1.00512.01023.01534.02045.0255SE +/- 0.014, N = 3SE +/- 0.010, N = 3SE +/- 0.007, N = 3SE +/- 0.013, N = 3SE +/- 0.004, N = 3SE +/- 0.015, N = 34.4673.8023.2503.2053.1782.6491. (CXX) g++ options: -O3 -fPIC -lm

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128c6g.16xlargec7g.16xlargec3d-standard-60 AMD Genoat2d-standard-60 AMD Milanc2d-standard-56m7a.16xlarge1632486480SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 1.29, N = 15SE +/- 0.66, N = 3SE +/- 0.78, N = 15SE +/- 0.63, N = 1532.3652.4857.3160.0361.1871.111. (CXX) g++ options: -O3

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: Rhodopsin Proteinc3d-standard-60 AMD Genoac2d-standard-56c6g.16xlarget2d-standard-60 AMD Milanm7a.16xlargec7g.16xlarge918273645SE +/- 0.54, N = 12SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.17, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 317.4218.0326.0427.8332.7937.46-lm-lm-lm-lm1. (CXX) g++ options: -O3 -ldl

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128c6g.16xlargec3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanm7a.16xlargec7g.16xlarge306090120150SE +/- 0.71, N = 3SE +/- 1.78, N = 12SE +/- 1.39, N = 3SE +/- 0.91, N = 3SE +/- 1.18, N = 15SE +/- 0.10, N = 379.0293.6099.65106.03124.36127.991. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128c3d-standard-60 AMD Genoac2d-standard-56m7a.16xlarget2d-standard-60 AMD Milanc6g.16xlargec7g.16xlarge60120180240300SE +/- 2.19, N = 12SE +/- 1.51, N = 6SE +/- 2.16, N = 15SE +/- 0.82, N = 3SE +/- 0.57, N = 3SE +/- 0.59, N = 3148.58164.72190.60196.95202.45291.971. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.3Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128c3d-standard-60 AMD Genoac2d-standard-56t2d-standard-60 AMD Milanm7a.16xlargec6g.16xlargec7g.16xlarge4080120160200SE +/- 0.52, N = 3SE +/- 1.06, N = 3SE +/- 0.78, N = 3SE +/- 1.44, N = 15SE +/- 0.11, N = 3SE +/- 0.44, N = 388.63104.53109.68121.51129.17175.731. (CXX) g++ options: -O3

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

Model: INIVOL and Fluid Structure Interaction Drop Container

c3d-standard-60 AMD Genoa: The test run did not produce a result. E: ** ERROR: INPUT FILE /fsi_drop_container NOT FOUND

t2d-standard-60 AMD Milan: The test run did not produce a result. E: ** ERROR: INPUT FILE /fsi_drop_container NOT FOUND

m7a.16xlarge: The test run did not produce a result. E: ** ERROR: INPUT FILE /fsi_drop_container_0001.rad NOT FOUND

c2d-standard-56: The test run did not produce a result. E: ** ERROR: INPUT FILE /fsi_drop_container NOT FOUND

119 Results Shown

PostgreSQL
Blender
LAMMPS Molecular Dynamics Simulator
PostgreSQL:
  100 - 800 - Read Write
  100 - 1000 - Read Write - Average Latency
Apache IoTDB:
  800 - 100 - 800 - 400:
    Average Latency
    point/sec
OpenRadioss
Rodinia
Timed Linux Kernel Compilation
PostgreSQL
Stockfish
nekRS:
  Kershaw
  TurboPipe Periodic
Apache IoTDB:
  800 - 100 - 500 - 400:
    Average Latency
    point/sec
  500 - 100 - 800 - 400:
    point/sec
    Average Latency
Timed Node.js Compilation
OpenRadioss
Apache Cassandra
Timed Gem5 Compilation
OpenSSL:
  AES-256-GCM
  ChaCha20
  AES-128-GCM
  ChaCha20-Poly1305
  SHA512
  SHA256
OpenRadioss
BRL-CAD
Apache IoTDB:
  500 - 100 - 500 - 400:
    Average Latency
    point/sec
PostgreSQL:
  100 - 800 - Read Only - Average Latency
  100 - 1000 - Read Only - Average Latency
OpenRadioss
TensorFlow
libavif avifenc
PostgreSQL:
  100 - 800 - Read Only
  100 - 1000 - Read Only
OpenRadioss
Blender
OpenVINO:
  Vehicle Detection FP16-INT8 - CPU:
    ms
    FPS
  Person Detection FP16 - CPU:
    ms
    FPS
  Person Detection FP32 - CPU:
    ms
    FPS
Laghos
Blender
nginx:
  1000
  500
OpenVINO:
  Face Detection FP16-INT8 - CPU:
    ms
    FPS
TensorFlow
libavif avifenc
OpenVINO:
  Face Detection FP16 - CPU:
    ms
    FPS
  Road Segmentation ADAS FP16-INT8 - CPU:
    ms
    FPS
NAS Parallel Benchmarks
OpenVINO:
  Face Detection Retail FP16-INT8 - CPU:
    ms
    FPS
  Machine Translation EN To DE FP16 - CPU:
    ms
    FPS
  Person Vehicle Bike Detection FP16 - CPU:
    ms
    FPS
  Road Segmentation ADAS FP16 - CPU:
    ms
    FPS
  Handwritten English Recognition FP16-INT8 - CPU:
    ms
    FPS
  Handwritten English Recognition FP16 - CPU:
    ms
    FPS
  Vehicle Detection FP16 - CPU:
    ms
    FPS
  Weld Porosity Detection FP16-INT8 - CPU:
    ms
    FPS
  Weld Porosity Detection FP16 - CPU:
    ms
    FPS
  Face Detection Retail FP16 - CPU:
    ms
    FPS
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU:
    ms
    FPS
  Age Gender Recognition Retail 0013 FP16 - CPU:
    ms
    FPS
NAS Parallel Benchmarks
OpenSSL:
  RSA4096:
    verify/s
    sign/s
NAS Parallel Benchmarks
Rodinia
Timed Linux Kernel Compilation
Laghos
NAS Parallel Benchmarks:
  IS.D
  BT.C
GROMACS
Blender
TensorFlow
Blender
Xcompact3d Incompact3d
Rodinia
Coremark
7-Zip Compression:
  Decompression Rating
  Compression Rating
Algebraic Multi-Grid Benchmark
Remhos
libxsmm:
  64
  32
NAS Parallel Benchmarks:
  FT.C
  CG.C
Rodinia:
  OpenMP Streamcluster
  OpenMP CFD Solver
libavif avifenc
Xcompact3d Incompact3d
NAS Parallel Benchmarks
libavif avifenc
HeFFTe - Highly Efficient FFT for Exascale
LAMMPS Molecular Dynamics Simulator
HeFFTe - Highly Efficient FFT for Exascale:
  r2c - FFTW - double - 128
  r2c - FFTW - float - 128
  c2c - FFTW - float - 128