7773x

Tests for a future article. 2 x AMD EPYC 7573X 32-Core testing with a AMD DAYTONA_X (RYM1009B BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2305044-NE-7773X849132&grr&rdt.

7773x ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen Resolutionab5 a5 b5 2p a5 2p bAMD EPYC 7773X 64-Core @ 2.20GHz (64 Cores / 128 Threads)AMD DAYTONA_X (RYM1009B BIOS)AMD Starship/Matisse256GB3841GB Micron_9300_MTFDHAL3T8TDPASPEEDVE2282 x Mellanox MT27710Ubuntu 22.045.15.0-47-generic (x86_64)GNOME Shell 42.4X Server 1.21.1.31.2.204GCC 11.2.0ext41920x1080AMD EPYC 7573X 32-Core @ 2.80GHz (32 Cores / 64 Threads)2 x AMD EPYC 7573X 32-Core @ 2.80GHz (64 Cores / 128 Threads)512GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001229Python Details- Python 3.10.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

7773x petsc: Streamsopenvkl: vklBenchmark ISPCopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeblender: Barbershop - CPU-Onlylczero: Eigenlczero: BLASbuild-llvm: Unix Makefilesopencv: Graph APIffmpeg: libx265 - Platformffmpeg: libx265 - Platformffmpeg: libx265 - Video On Demandffmpeg: libx265 - Video On Demandncnn: CPU - FastestDetncnn: CPU - vision_transformerncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetffmpeg: libx265 - Uploadffmpeg: libx265 - Uploadbuild-llvm: Ninjaopencv: Stitchingaskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingclickhouse: 100M Rows Hits Dataset, Third Runclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, First Run / Cold Cachevvenc: Bosphorus 4K - Fastblender: Pabellon Barcelona - CPU-Onlyopencv: Coreblender: Classroom - CPU-Onlyffmpeg: libx265 - Liveffmpeg: libx265 - Liveonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUjohn-the-ripper: MD5john-the-ripper: HMAC-SHA512compress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedvvenc: Bosphorus 4K - Fasteropenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUcompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 12 - Decompression Speedcompress-zstd: 12 - Compression Speedcompress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopencv: DNN - Deep Neural Networkvvenc: Bosphorus 1080p - Fastblender: Fishy Cat - CPU-Onlysvt-av1: Preset 4 - Bosphorus 4Kopencv: Object Detectionspecfem3d: Layered Halfspacegromacs: MPI CPU - water_GMX50_barespecfem3d: Water-layered Halfspaceblender: BMW27 - CPU-Onlyjohn-the-ripper: WPA PSKjohn-the-ripper: bcryptjohn-the-ripper: Blowfishquantlib: svt-av1: Preset 8 - Bosphorus 4Kcompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingvvenc: Bosphorus 1080p - Fasterembree: Pathtracer - Asian Dragon Objembree: Pathtracer ISPC - Asian Dragon Objespeak: Text-To-Speech Synthesismt-dgemm: Sustained Floating-Point Ratespecfem3d: Homogeneous Halfspacebuild-ffmpeg: Time To Compilesvt-av1: Preset 4 - Bosphorus 1080pspecfem3d: Tomographic Modelonednn: Deconvolution Batch shapes_1d - f32 - CPUaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingspecfem3d: Mount St. Helensembree: Pathtracer ISPC - Crownincompact3d: input.i3d 193 Cells Per Directionembree: Pathtracer - Crownonednn: IP Shapes 1D - f32 - CPUembree: Pathtracer - Asian Dragonembree: Pathtracer ISPC - Asian Dragonaskap: Hogbom Clean OpenMPcloverleaf: Lagrangian-Eulerian Hydrodynamicslulesh: svt-av1: Preset 8 - Bosphorus 1080ppennant: sedovbigdraco: Church Facadeonednn: IP Shapes 3D - f32 - CPUaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingdraco: Lionsvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Konednn: Convolution Batch Shapes Auto - f32 - CPUpennant: leblancbigincompact3d: input.i3d 129 Cells Per Directiononednn: Deconvolution Batch shapes_3d - f32 - CPUsvt-av1: Preset 12 - Bosphorus 1080psvt-av1: Preset 13 - Bosphorus 1080pab5 a5 b5 2p a5 2p b47040.48165225.348583260.83246.95743.73173.2343.67173.4721.59116.93165.7076.28688.7672.45105.7847.74553400013679600011.36740.45056824.72995716.55234.774.86330.5189126617.29028.08154095027.6120138886230864602746.573.44230.00670.523864.511729.38349617.20934908216.92810.29513.86220230011.67841461663.371169.107778.475173.8446109.98457934867222.837195.924604.842552.24156185.7504469363.7359120.39337259.8751425554248.48123581043.90172.5695669443.90172.53393950619.08133.3953.5219.3223.9219.587.1310.8924.0419.098.1215.4411.7115.610.2911.317.8421.62116.80915814164.4492011608308.335493.35437.35439.07428.516.12888.737706172.02106.9847.2067748891183.63749.62255430001351650001190.89.311.3643911.978.0540.24442825.59608539278.041276.817.62608.3412.161175.0127.03230.27138.841300.8752.61434.2744.51445.9270.81390988.51277.3281916.931888.216.931889.0526.541205.1126.681198.4523.922674.291.7435737.821.6238091.463975616.37434.614.8263191030.1158997047.29927.46754027327.620295791276873602692.467.54841121339692830.11464.640928.15729.0456916.81708024617.04210.23413.5301793946.9214536827.532799.511.7533214663.364717.21184731.2610773.828806.45211.3422257.137109.7119.5636158331.2461819018.320481.24885214.933198.1350.8276225.1040784.438041213.12487601.984549.83431992.6068340428.6592117.4378408.7912381419290.53920697046.70162.1946.89161.549.25126.2626.3414.8519.9814.85.488.1221.0814.113.829.036.087.736.196.4613.722.95110.01230.3781840329611.056599.68456.30457.81437.206.406136.5868343113.31110.4945.711182.61679.6453621000960590001229.19.5411.9612881.045.4850.74590422.3373962886.125.471304.218.21935.68.23886.4117.92172.5392.611361.5854.21460.1792.11478.1289.11438.81067.81326.52856.612.771252.0512.841244.8619841.4919.45822.1918.11767.131.2625209.621.1627340.963999716.84652.684.7432773949.1361494695.02245.7094271342.5213084860192603252756.965.33724422027191831.83743.958939.194127.82617.874524.98744586720.37810.7619.0896648574.4956524990.120991.718.8621990740.240819.630500843.94921.3799648.522844.9118869.56512.0920920.487107.38214.0011157180.6259319018.317750.44760220.813199.691.146567.9740225.067564012.76234629.3571.084339427.71276117.6045297.22322.93110.14231.0916.39110.2945.7936050009492400012.0250.69975222.70639917.2534.7648.73867645444.33457433513107260152603452820.470.85631.79544.188439.223618.640325.08962279720.6410.82819.63009634219.35555309240.119943.978748.507445.1054108.364226.733196.063622.732560.565452205.4639299.384145213.682868284219.40344.31170.94271146344.97168.45694118640.45144.31100.6537.1738.180.8118.638.9433.5593.2914.6535.1238.4933.2721.2627.777.7722.38112.819760537138.71712506.77738.59461.53463.68445.185.54871.6458.5592.5254.581390.361212.846845000790020001222.99.579.5742975.1510.5433.00107922.5639462946.9710.641315.6182013.5215.86914.2434.83182.1175.481332.3808.51454.7702.81482.7283.61437.510241306.72783.712.662523.5712.762504.3619.581632.8120.061594.1418.263501.691.3739455.8614.81227.964.78425.2210067458.28523.93606599722.592594131185021185792760.962.83645399640373924.49268.692264.409127.86831.34451514.49141946915.45510.4111.22476781310.522149980.141983.39.60945386873.330110.624130281.45082.6780176.806773.9612436.68115.6742030.038107.17.65636956820.9118061664112102.54734169.112169.6580.6743364.2947312.486542941.85167540.729525.15474130.1789452203.6511899.93268213.4678167991221.78941970243.64173.5600295843.77173.05225728353.25145.49128.3535.8145.89120.2728.5551.832.81112.4324.5660.7949.4637.8928.8838.67115.6822.10114.261775562138.56528749211068.87176.41448.03467.46440.565.76871.7323644558.57107.2947.071385.331305.966683000732600001227.19.59.9062948.3910.6332.97431623.3352752952.7610.641312182020.2115.76912.4234.91180.91176.761336.5741.51445.1702.21498.9282.81441.11013.41309.52775.912.662524.0512.792498.7419.691623.8320.071592.7118.253504.151.438800.341.3340352.488713315.04427.94.8428855325.4721902918.22223.26803264122.512584571175421177712850.869.53644074039684424.26868.702264.044128.13731.76087614.41593115815.34410.49511.79752558510.368551199.141983.39.5947693573.260210.762441681.09132.8845176.628773.7779434.78315.5142551.898104.1697.79332557290.91722615662.112678.94818176.107172.5730.6693494.4180122.483869081.89859565.057546.034OpenBenchmarking.org

PETSc

Test: Streams

OpenBenchmarking.orgMB/s, More Is BetterPETSc 3.19Test: Streamsb5 a5 2p b16K32K48K64K80K56185.7531992.6174130.181. (CC) gcc options: -fPIC -O3 -O2 -lpthread -ludev -lpciaccess -lm

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCab5 a5 b5 2p a5 2p b100200300400500SE +/- 0.33, N = 3470469340339452452MIN: 84 / MAX: 2616MIN: 84 / MAX: 2565MIN: 55 / MAX: 2309MIN: 54 / MAX: 2307MIN: 98 / MAX: 1875MIN: 99 / MAX: 2013

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Timeab5 a5 b5 2p a5 2p b9018027036045040.48363.74428.66427.71205.46203.651. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Timeab5 a5 b5 2p a5 2p b30609012015025.35120.39117.44117.6099.3899.931. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Barbershop - Compute: CPU-Onlyab5 a5 2p a5 2p b90180270360450SE +/- 0.09, N = 3260.83259.87408.79213.60213.46

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenb5 a5 2p a5 2p b2K4K6K8K10K51421238828678161. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASb5 a5 2p a5 2p b2K4K6K8K10K55541419828479911. (CXX) g++ options: -flto -pthread

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Unix Makefilesab5 a5 b5 2p a5 2p b60120180240300SE +/- 0.96, N = 3246.96248.48290.54297.22219.40221.79

OpenCV

Test: Graph API

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Graph APIb5 a5 2p b90K180K270K360K450K2358102069704197021. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

FFmpeg

Encoder: libx265 - Scenario: Platform

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Platformab5 a5 2p a5 2p b1122334455SE +/- 0.02, N = 343.7343.9046.7044.3143.641. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Platform

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Platformab5 a5 2p a5 2p b4080120160200SE +/- 0.08, N = 3173.23172.57162.19170.94173.561. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Video On Demand

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Video On Demandab5 a5 2p a5 2p b1122334455SE +/- 0.04, N = 343.6743.9046.8944.9743.771. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Video On Demand

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Video On Demandab5 a5 2p a5 2p b4080120160200SE +/- 0.15, N = 3173.47172.53161.54168.46173.051. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: FastestDetb5 a5 2p a5 2p b122436486019.089.2540.4553.25MIN: 13.65 / MAX: 21.7MIN: 9.12 / MAX: 9.81MIN: 27.4 / MAX: 462.07MIN: 30.07 / MAX: 66.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vision_transformerb5 a5 2p a5 2p b306090120150133.39126.26144.31145.49MIN: 129.7 / MAX: 252.77MIN: 125.42 / MAX: 132.03MIN: 140.5 / MAX: 157.63MIN: 141.27 / MAX: 245.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: regnety_400mb5 a5 2p a5 2p b30609012015053.5226.34100.65128.35MIN: 50.93 / MAX: 71.06MIN: 25.99 / MAX: 28.26MIN: 97.81 / MAX: 136.12MIN: 111.43 / MAX: 240.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: squeezenet_ssdb5 a5 2p a5 2p b91827364519.3214.8537.1735.81MIN: 18.86 / MAX: 22.39MIN: 14.49 / MAX: 25.32MIN: 30.62 / MAX: 51.54MIN: 29.14 / MAX: 58.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: yolov4-tinyb5 a5 2p a5 2p b102030405023.9219.9838.1045.89MIN: 23.17 / MAX: 30.48MIN: 19.53 / MAX: 23.19MIN: 29.08 / MAX: 101.85MIN: 34.08 / MAX: 64.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet50b5 a5 2p a5 2p b30609012015019.5814.8080.81120.27MIN: 19.15 / MAX: 44.79MIN: 14.6 / MAX: 16.84MIN: 62.51 / MAX: 112.38MIN: 42.99 / MAX: 192.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: alexnetb5 a5 2p a5 2p b7142128357.135.4818.6028.55MIN: 6.97 / MAX: 7.76MIN: 5.36 / MAX: 6.34MIN: 17.59 / MAX: 34.64MIN: 12.6 / MAX: 62.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet18b5 a5 2p a5 2p b122436486010.898.1238.9451.80MIN: 10.68 / MAX: 11.74MIN: 8.01 / MAX: 10.04MIN: 16.05 / MAX: 124.98MIN: 16.09 / MAX: 93.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vgg16b5 a5 2p a5 2p b81624324024.0421.0833.5532.81MIN: 23.49 / MAX: 30.66MIN: 20.78 / MAX: 24.35MIN: 28.65 / MAX: 42.53MIN: 30.06 / MAX: 44.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: googlenetb5 a5 2p a5 2p b30609012015019.0914.1193.29112.43MIN: 18.77 / MAX: 25.76MIN: 13.96 / MAX: 17.22MIN: 52.46 / MAX: 137.19MIN: 29.99 / MAX: 148.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: blazefaceb5 a5 2p a5 2p b6121824308.123.8214.6524.56MIN: 6.94 / MAX: 11.1MIN: 3.45 / MAX: 79.74MIN: 11.34 / MAX: 73.36MIN: 20.64 / MAX: 91.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: efficientnet-b0b5 a5 2p a5 2p b142842567015.449.0335.1260.79MIN: 13.63 / MAX: 18.67MIN: 8.93 / MAX: 11.07MIN: 33.83 / MAX: 41.96MIN: 47.67 / MAX: 141.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mnasnetb5 a5 2p a5 2p b112233445511.716.0838.4949.46MIN: 9.43 / MAX: 20.81MIN: 6.01 / MAX: 6.54MIN: 25.51 / MAX: 75.24MIN: 36.42 / MAX: 176.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: shufflenet-v2b5 a5 2p a5 2p b91827364515.607.7333.2737.89MIN: 12.95 / MAX: 19.73MIN: 7.58 / MAX: 9.71MIN: 29.37 / MAX: 96.69MIN: 34.63 / MAX: 113.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v3-v3 - Model: mobilenet-v3b5 a5 2p a5 2p b71421283510.296.1921.2628.88MIN: 9.51 / MAX: 11.93MIN: 6.05 / MAX: 6.93MIN: 20.79 / MAX: 28.25MIN: 23.66 / MAX: 168.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v2-v2 - Model: mobilenet-v2b5 a5 2p a5 2p b91827364511.306.4627.7038.67MIN: 9.74 / MAX: 14.96MIN: 6.35 / MAX: 8.73MIN: 23.18 / MAX: 43.54MIN: 28.02 / MAX: 119.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mobilenetb5 a5 2p a5 2p b30609012015017.8413.7077.77115.68MIN: 17.55 / MAX: 25.78MIN: 13.55 / MAX: 14.38MIN: 67.11 / MAX: 156MIN: 64.45 / MAX: 159.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

FFmpeg

Encoder: libx265 - Scenario: Upload

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Uploadab5 a5 b5 2p a5 2p b510152025SE +/- 0.01, N = 321.5921.6222.9522.9322.3822.101. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Upload

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Uploadab5 a5 b5 2p a5 2p b306090120150SE +/- 0.05, N = 3116.93116.81110.01110.14112.82114.261. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninjaab5 a5 b5 2p a5 2p b50100150200250SE +/- 0.08, N = 3165.71164.45230.38231.09138.72138.57

OpenCV

Test: Stitching

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Stitchingb5 a5 2p b60K120K180K240K300K2011601840322874921. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degriddingb5 a5 2p a5 2p b3K6K9K12K15K8308.339611.0512506.7011068.801. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Griddingb5 a5 2p a5 2p b170034005100680085005493.356599.687738.597176.411. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third Runb5 a5 2p a5 2p b100200300400500437.35456.30461.53448.03MIN: 35.59 / MAX: 5454.55MIN: 24.65 / MAX: 5454.55MIN: 41.49 / MAX: 3000MIN: 41.38 / MAX: 2608.7

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second Runb5 a5 2p a5 2p b100200300400500439.07457.81463.68467.46MIN: 35.82 / MAX: 6000MIN: 24.13 / MAX: 5454.55MIN: 41.64 / MAX: 4615.38MIN: 40.6 / MAX: 4615.38

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold Cacheb5 a5 2p a5 2p b100200300400500428.51437.20445.18440.56MIN: 34.8 / MAX: 5454.55MIN: 24.3 / MAX: 6000MIN: 41.18 / MAX: 3157.89MIN: 40.98 / MAX: 4000

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 4K - Video Preset: Fastab5 a5 b5 2p a5 2p b246810SE +/- 0.010, N = 36.2866.1286.4066.3905.5485.7681. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Pabellon Barcelona - Compute: CPU-Onlyab5 a5 2p a5 2p b306090120150SE +/- 0.10, N = 388.7688.73136.5871.6471.73

OpenCV

Test: Core

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Coreb5 a5 2p b50K100K150K200K250K77061683432364451. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Classroom - Compute: CPU-Onlyab5 a5 2p a5 2p b306090120150SE +/- 0.17, N = 372.4572.02113.3158.5558.57

FFmpeg

Encoder: libx265 - Scenario: Live

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Liveab5 a5 b5 2p a5 2p b20406080100SE +/- 0.29, N = 3105.78106.98110.49110.2992.52107.291. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Live

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Liveab5 a5 b5 2p a5 2p b1224364860SE +/- 0.13, N = 347.7447.2145.7145.7954.5847.071. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUb5 a5 2p a5 2p b300600900120015001183.631182.611390.361385.33MIN: 1162.73MIN: 1161.83MIN: 1287.43MIN: 1314.761. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUb5 a5 2p a5 2p b30060090012001500749.62679.651212.841305.96MIN: 731.75MIN: 664.64MIN: 1070.52MIN: 1060.611. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5ab5 a5 b5 2p a5 2p b1.5M3M4.5M6M7.5MSE +/- 1000.00, N = 35534000554300036210003605000684500066830001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512ab5 a5 b5 2p a5 2p b30M60M90M120M150MSE +/- 266743.32, N = 3136796000135165000960590009492400079002000732600001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression Speedb5 a5 2p a5 2p b300600900120015001190.81229.11222.91227.11. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression Speedb5 a5 2p a5 2p b36912159.309.549.579.501. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 4K - Video Preset: Fasterab5 a5 b5 2p a5 2p b3691215SE +/- 0.022, N = 311.36711.36411.96112.0209.5749.9061. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUb5 a5 2p a5 2p b80016002400320040003911.972881.042975.152948.39MIN: 3337.4 / MAX: 4451.63MIN: 1536.21 / MAX: 3142.35MIN: 2241.37 / MAX: 3616.82MIN: 1547.54 / MAX: 3537.31. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPUb5 a5 2p a5 2p b36912158.055.4810.5410.631. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Timeab5 a5 b5 2p a5 2p b112233445540.4540.2450.7550.7033.0032.971. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Timeab5 a5 b5 2p a5 2p b61218243024.7325.6022.3422.7122.5623.341. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPUb5 a5 2p a5 2p b80016002400320040003927.002886.122946.972952.76MIN: 3402.58 / MAX: 4474.47MIN: 1694.68 / MAX: 3104.67MIN: 2004.15 / MAX: 3534.38MIN: 2193.59 / MAX: 3652.321. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPUb5 a5 2p a5 2p b36912158.045.4710.6410.641. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression Speedb5 a5 2p a5 2p b300600900120015001276.81304.21315.61312.01. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression Speedb5 a5 2p a5 2p b4812162017.618.218.018.01. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUb5 a5 2p a5 2p b60012001800240030002608.341935.602013.522020.21MIN: 2421.38 / MAX: 2754.89MIN: 1852.05 / MAX: 1974.19MIN: 1890.96 / MAX: 2802.82MIN: 1823.96 / MAX: 3111.511. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPUb5 a5 2p a5 2p b4812162012.168.2315.8615.761. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUb5 a5 2p a5 2p b300600900120015001175.01886.41914.24912.42MIN: 982.69 / MAX: 1202.21MIN: 851.09 / MAX: 900.44MIN: 797.83 / MAX: 966.84MIN: 878.63 / MAX: 988.741. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPUb5 a5 2p a5 2p b81624324027.0317.9234.8334.911. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPUb5 a5 2p a5 2p b50100150200250230.27172.53182.10180.91MIN: 166.99 / MAX: 311.89MIN: 81.35 / MAX: 207.95MIN: 117.14 / MAX: 548.18MIN: 124.49 / MAX: 288.011. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPUb5 a5 2p a5 2p b4080120160200138.8492.61175.48176.761. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3, Long Mode - Decompression Speedb5 a5 2p a5 2p b300600900120015001300.81361.51332.31336.51. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3, Long Mode - Compression Speedb5 a5 2p a5 2p b2004006008001000752.6854.2808.5741.51. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8, Long Mode - Decompression Speedb5 a5 2p a5 2p b300600900120015001434.21460.11454.71445.11. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8, Long Mode - Compression Speedb5 a5 2p a5 2p b2004006008001000744.5792.1702.8702.21. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 12 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Decompression Speedb5 a5 2p a5 2p b300600900120015001445.91478.11482.71498.91. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 12 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Compression Speedb5 a5 2p a5 2p b60120180240300270.8289.1283.6282.81. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8 - Decompression Speedb5 a5 2p a5 2p b300600900120015001390.01438.81437.51441.11. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8 - Compression Speedb5 a5 2p a5 2p b2004006008001000988.51067.81024.01013.41. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3 - Decompression Speedb5 a5 2p a5 2p b300600900120015001277.31326.51306.71309.51. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3 - Compression Speedb5 a5 2p a5 2p b60012001800240030002819.02856.62783.72775.91. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPUb5 a5 2p a5 2p b4812162016.9312.7712.6612.66MIN: 13.88 / MAX: 33.11MIN: 7.39 / MAX: 23.62MIN: 7.58 / MAX: 53.45MIN: 8.38 / MAX: 48.881. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPUb5 a5 2p a5 2p b50010001500200025001888.201252.052523.572524.051. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPUb5 a5 2p a5 2p b4812162016.9312.8412.7612.79MIN: 10.73 / MAX: 31.24MIN: 6.96 / MAX: 23.28MIN: 7.65 / MAX: 43.58MIN: 7.68 / MAX: 43.181. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPUb5 a5 2p a5 2p b50010001500200025001889.051244.862504.362498.741. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPUb5 a5 2p a5 2p b61218243026.5419.0019.5819.69MIN: 14.19 / MAX: 63.19MIN: 13.62 / MAX: 32.75MIN: 11.25 / MAX: 73.87MIN: 11.03 / MAX: 75.831. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPUb5 a5 2p a5 2p b4008001200160020001205.11841.491632.811623.831. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUb5 a5 2p a5 2p b61218243026.6819.4520.0620.07MIN: 15.32 / MAX: 46.54MIN: 11.98 / MAX: 28.4MIN: 11.41 / MAX: 47.48MIN: 13.26 / MAX: 75.831. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPUb5 a5 2p a5 2p b300600900120015001198.45822.191594.141592.711. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPUb5 a5 2p a5 2p b61218243023.9218.1018.2618.25MIN: 15.02 / MAX: 35.74MIN: 9.23 / MAX: 28.11MIN: 10.53 / MAX: 60.19MIN: 8.84 / MAX: 40.731. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPUb5 a5 2p a5 2p b80016002400320040002674.291767.133501.693504.151. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUb5 a5 2p a5 2p b0.39150.7831.17451.5661.95751.741.261.371.40MIN: 0.84 / MAX: 14.57MIN: 0.69 / MAX: 12.96MIN: 0.67 / MAX: 28.86MIN: 0.68 / MAX: 42.11. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPUb5 a5 2p a5 2p b8K16K24K32K40K35737.8225209.6239455.8638800.341. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUb5 a5 2p b0.36450.7291.09351.4581.82251.621.161.33MIN: 0.69 / MAX: 13.34MIN: 0.66 / MAX: 12.23MIN: 0.64 / MAX: 26.341. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUb5 a5 2p b9K18K27K36K45K38091.4627340.9640352.481. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenCV

Test: DNN - Deep Neural Network

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: DNN - Deep Neural Networkb5 a5 2p b20K40K60K80K100K3975639997871331. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

VVenC

Video Input: Bosphorus 1080p - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 1080p - Video Preset: Fastab5 a5 b5 2p a5 2p b48121620SE +/- 0.10, N = 316.5516.3716.8517.2514.8115.041. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Fishy Cat - Compute: CPU-Onlyab5 a5 2p a5 2p b1224364860SE +/- 0.04, N = 334.7734.6152.6827.9627.90

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 4 - Input: Bosphorus 4Kab5 a5 b5 2p a5 2p b1.09422.18843.28264.37685.471SE +/- 0.021, N = 34.8634.8264.7434.7604.7844.8421. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenCV

Test: Object Detection

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Object Detectionb5 a5 2p b20K40K60K80K100K3191027739885531. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered Halfspaceab5 a5 b5 2p a5 2p b1122334455SE +/- 0.29, N = 330.5230.1249.1448.7425.2225.471. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_bareab5 a5 2p a5 2p b246810SE +/- 0.026, N = 37.2907.2995.0228.2858.2221. (CXX) g++ options: -O3

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered Halfspaceab5 a5 b5 2p a5 2p b1020304050SE +/- 0.13, N = 328.0827.4745.7144.3323.9423.271. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: BMW27 - Compute: CPU-Onlyab5 a5 2p a5 2p b1020304050SE +/- 0.03, N = 327.6127.6042.5222.5922.51

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKab5 a5 b5 2p a5 2p b60K120K180K240K300KSE +/- 297.59, N = 32013882029571308481310722594132584571. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptab5 a5 b5 2p a5 2p b30K60K90K120K150KSE +/- 99.80, N = 3862309127660192601521185021175421. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: Blowfishab5 a5 b5 2p a5 2p b30K60K90K120K150KSE +/- 185.66, N = 3864608736060325603451185791177711. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.30ab5 a5 b5 2p a5 2p b6001200180024003000SE +/- 25.52, N = 32746.52692.42756.92820.42760.92850.81. (CXX) g++ options: -O3 -march=native -fPIE -pie

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 8 - Input: Bosphorus 4Kab5 a5 b5 2p a5 2p b1632486480SE +/- 0.56, N = 1273.4467.5565.3470.8662.8469.541. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Ratingb5 a5 2p a5 2p b100K200K300K400K500K4112132442204539964407401. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingb5 a5 2p a5 2p b90K180K270K360K450K3969282719184037393968441. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

VVenC

Video Input: Bosphorus 1080p - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 1080p - Video Preset: Fasterab5 a5 b5 2p a5 2p b714212835SE +/- 0.03, N = 330.0130.1131.8431.8024.4924.271. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer - Model: Asian Dragon Obja5 a5 b5 2p a5 2p b1632486480SE +/- 0.08, N = 370.5243.9644.1968.6968.70MIN: 69.67 / MAX: 71.94MIN: 43.73 / MAX: 44.3MIN: 43.94 / MAX: 44.51MIN: 68.03 / MAX: 70.15MIN: 67.96 / MAX: 69.64

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer ISPC - Model: Asian Dragon Objab5 a5 b5 2p a5 2p b1428425670SE +/- 0.03, N = 364.5164.6439.1939.2264.4164.04MIN: 63.96 / MAX: 66.18MIN: 64.13 / MAX: 65.46MIN: 38.97 / MAX: 39.59MIN: 39 / MAX: 39.63MIN: 63.78 / MAX: 65.49MIN: 63.29 / MAX: 64.92

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesisb5 a5 2p a5 2p b71421283528.1627.8327.8728.141. (CC) gcc options: -O2 -std=c99

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rateab5 a5 b5 2p a5 2p b714212835SE +/- 0.25, N = 1529.3829.0517.8718.6431.3431.761. (CC) gcc options: -O3 -march=native -fopenmp

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous Halfspaceab5 a5 b5 2p a5 2p b612182430SE +/- 0.20, N = 317.2116.8224.9925.0914.4914.421. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 6.0Time To Compileab5 a5 b5 2p a5 2p b510152025SE +/- 0.03, N = 316.9317.0420.3820.6415.4615.34

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 4 - Input: Bosphorus 1080pab5 a5 b5 2p a5 2p b3691215SE +/- 0.02, N = 310.3010.2310.7610.8310.4110.501. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic Modelab5 a5 b5 2p a5 2p b510152025SE +/- 0.15, N = 313.8613.5319.0919.6311.2211.801. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUb5 a5 2p a5 2p b36912156.921454.4956510.5221010.36850MIN: 6.32MIN: 3.9MIN: 8.88MIN: 8.541. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Griddingb5 a5 2p a5 2p b11K22K33K44K55K36827.524990.149980.151199.11. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degriddingb5 a5 2p a5 2p b9K18K27K36K45K32799.520991.741983.341983.31. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. Helensab5 a5 b5 2p a5 2p b510152025SE +/- 0.038048290, N = 311.67841461611.75332146018.86219907019.3555530929.6094538689.5947693501. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer ISPC - Model: Crownab5 a5 b5 2p a5 2p b1632486480SE +/- 0.11, N = 363.3763.3640.2440.1273.3373.26MIN: 62.18 / MAX: 66.58MIN: 62.44 / MAX: 66.52MIN: 39.64 / MAX: 40.95MIN: 39.64 / MAX: 40.56MIN: 71.99 / MAX: 75.06MIN: 72.13 / MAX: 74.82

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionb5 a5 2p a5 2p b51015202517.2119.6310.6210.761. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer - Model: Crowna5 a5 b5 2p a5 2p b20406080100SE +/- 0.05, N = 369.1143.9543.9881.4581.09MIN: 68.18 / MAX: 71.75MIN: 43.49 / MAX: 44.42MIN: 43.29 / MAX: 44.67MIN: 80.49 / MAX: 83.06MIN: 80.03 / MAX: 82.56

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUb5 a5 2p a5 2p b0.6491.2981.9472.5963.2451.261071.379962.678012.88451MIN: 1.06MIN: 1.25MIN: 1.89MIN: 1.791. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer - Model: Asian Dragona5 a5 b5 2p a5 2p b20406080100SE +/- 0.14, N = 378.4848.5248.5176.8176.63MIN: 77.64 / MAX: 80.46MIN: 48.32 / MAX: 48.91MIN: 48.29 / MAX: 48.79MIN: 76.07 / MAX: 77.73MIN: 75.97 / MAX: 78.17

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer ISPC - Model: Asian Dragonab5 a5 b5 2p a5 2p b1632486480SE +/- 0.01, N = 373.8473.8344.9145.1173.9673.78MIN: 73.31 / MAX: 75.9MIN: 73.34 / MAX: 74.66MIN: 44.68 / MAX: 45.26MIN: 44.88 / MAX: 45.48MIN: 73.12 / MAX: 75.27MIN: 72.91 / MAX: 74.82

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPb5 a5 2p a5 2p b2004006008001000806.45869.57436.68434.781. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamicsb5 a5 2p a5 2p b4812162011.3412.0915.6715.511. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.3b5 a5 2p a5 2p b9K18K27K36K45K22257.1420920.4942030.0442551.901. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 8 - Input: Bosphorus 1080pab5 a5 b5 2p a5 2p b20406080100SE +/- 1.23, N = 5109.98109.71107.38108.36107.10104.171. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigb5 a5 2p a5 2p b481216209.56361014.0011107.6563697.7933251. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Church Facadeab5 a5 2p a5 2p b13002600390052006500SE +/- 3.21, N = 3579358335718568257291. (CXX) g++ options: -O3

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUb5 a5 2p a5 2p b0.28040.56080.84121.12161.4021.2461800.6259300.9118060.917226MIN: 1.13MIN: 0.57MIN: 0.78MIN: 0.771. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degriddingb5 a5 2p a5 2p b4K8K12K16K20K19018.319018.316641.015662.11. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Griddingb5 a5 2p a5 2p b4K8K12K16K20K20481.217750.412102.512678.91. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Lionab5 a5 2p a5 2p b10002000300040005000SE +/- 7.02, N = 3486748854760473448181. (CXX) g++ options: -O3

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 12 - Input: Bosphorus 4Kab5 a5 b5 2p a5 2p b50100150200250SE +/- 0.73, N = 3222.84214.93220.81226.73169.11176.111. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 13 - Input: Bosphorus 4Kab5 a5 b5 2p a5 2p b4080120160200SE +/- 0.70, N = 3195.92198.14199.69196.06169.66172.571. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUb5 a5 2p a5 2p b0.2580.5160.7741.0321.290.8276221.1465600.6743360.669349MIN: 0.78MIN: 1.09MIN: 0.62MIN: 0.61. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbigb5 a5 2p a5 2p b2468105.1040787.9740224.2947314.4180121. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionb5 a5 2p a5 2p b1.14022.28043.42064.56085.7014.438041215.067564012.486542942.483869081. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUb5 a5 2p a5 2p b0.70311.40622.10932.81243.51553.124872.762341.851671.89859MIN: 2.06MIN: 2.59MIN: 1.61MIN: 1.581. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 12 - Input: Bosphorus 1080pab5 a5 b5 2p a5 2p b140280420560700SE +/- 2.97, N = 3604.84601.98629.30622.73540.73565.061. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 13 - Input: Bosphorus 1080pab5 a5 b5 2p a5 2p b120240360480600SE +/- 5.77, N = 3552.24549.83571.08560.57525.15546.031. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq


Phoronix Test Suite v10.8.5