7773x

Tests for a future article. 2 x AMD EPYC 7573X 32-Core testing with a AMD DAYTONA_X (RYM1009B BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2305044-NE-7773X849132&sro&grr.

7773x ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen Resolutionab5 a5 b5 2p a5 2p bAMD EPYC 7773X 64-Core @ 2.20GHz (64 Cores / 128 Threads)AMD DAYTONA_X (RYM1009B BIOS)AMD Starship/Matisse256GB3841GB Micron_9300_MTFDHAL3T8TDPASPEEDVE2282 x Mellanox MT27710Ubuntu 22.045.15.0-47-generic (x86_64)GNOME Shell 42.4X Server 1.21.1.31.2.204GCC 11.2.0ext41920x1080AMD EPYC 7573X 32-Core @ 2.80GHz (32 Cores / 64 Threads)2 x AMD EPYC 7573X 32-Core @ 2.80GHz (64 Cores / 128 Threads)512GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001229Python Details- Python 3.10.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

7773x petsc: Streamsopenvkl: vklBenchmark ISPCopenfoam: drivaerFastback, Medium Mesh Size - Execution Timeopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeblender: Barbershop - CPU-Onlylczero: Eigenlczero: BLASbuild-llvm: Unix Makefilesopencv: Graph APIffmpeg: libx265 - Platformffmpeg: libx265 - Platformffmpeg: libx265 - Video On Demandffmpeg: libx265 - Video On Demandncnn: CPU - FastestDetncnn: CPU - vision_transformerncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetffmpeg: libx265 - Uploadffmpeg: libx265 - Uploadbuild-llvm: Ninjaopencv: Stitchingaskap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingclickhouse: 100M Rows Hits Dataset, Third Runclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, First Run / Cold Cachevvenc: Bosphorus 4K - Fastblender: Pabellon Barcelona - CPU-Onlyopencv: Coreblender: Classroom - CPU-Onlyffmpeg: libx265 - Liveffmpeg: libx265 - Liveonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUjohn-the-ripper: MD5john-the-ripper: HMAC-SHA512compress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedvvenc: Bosphorus 4K - Fasteropenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUcompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 12 - Decompression Speedcompress-zstd: 12 - Compression Speedcompress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopencv: DNN - Deep Neural Networkvvenc: Bosphorus 1080p - Fastblender: Fishy Cat - CPU-Onlysvt-av1: Preset 4 - Bosphorus 4Kopencv: Object Detectionspecfem3d: Layered Halfspacegromacs: MPI CPU - water_GMX50_barespecfem3d: Water-layered Halfspaceblender: BMW27 - CPU-Onlyjohn-the-ripper: WPA PSKjohn-the-ripper: bcryptjohn-the-ripper: Blowfishquantlib: svt-av1: Preset 8 - Bosphorus 4Kcompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingvvenc: Bosphorus 1080p - Fasterembree: Pathtracer - Asian Dragon Objembree: Pathtracer ISPC - Asian Dragon Objespeak: Text-To-Speech Synthesismt-dgemm: Sustained Floating-Point Ratespecfem3d: Homogeneous Halfspacebuild-ffmpeg: Time To Compilesvt-av1: Preset 4 - Bosphorus 1080pspecfem3d: Tomographic Modelonednn: Deconvolution Batch shapes_1d - f32 - CPUaskap: tConvolve MPI - Griddingaskap: tConvolve MPI - Degriddingspecfem3d: Mount St. Helensembree: Pathtracer ISPC - Crownincompact3d: input.i3d 193 Cells Per Directionembree: Pathtracer - Crownonednn: IP Shapes 1D - f32 - CPUembree: Pathtracer - Asian Dragonembree: Pathtracer ISPC - Asian Dragonaskap: Hogbom Clean OpenMPcloverleaf: Lagrangian-Eulerian Hydrodynamicslulesh: svt-av1: Preset 8 - Bosphorus 1080ppennant: sedovbigdraco: Church Facadeonednn: IP Shapes 3D - f32 - CPUaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingdraco: Lionsvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Konednn: Convolution Batch Shapes Auto - f32 - CPUpennant: leblancbigincompact3d: input.i3d 129 Cells Per Directiononednn: Deconvolution Batch shapes_3d - f32 - CPUsvt-av1: Preset 12 - Bosphorus 1080psvt-av1: Preset 13 - Bosphorus 1080pab5 a5 b5 2p a5 2p b47040.48165225.348583260.83246.95743.73173.2343.67173.4721.59116.93165.7076.28688.7672.45105.7847.74553400013679600011.36740.45056824.72995716.55234.774.86330.5189126617.29028.08154095027.6120138886230864602746.573.44230.00670.523864.511729.38349617.20934908216.92810.29513.86220230011.67841461663.371169.107778.475173.8446109.98457934867222.837195.924604.842552.24156185.7504469363.7359120.39337259.8751425554248.48123581043.90172.5695669443.90172.53393950619.08133.3953.5219.3223.9219.587.1310.8924.0419.098.1215.4411.7115.610.2911.317.8421.62116.80915814164.4492011608308.335493.35437.35439.07428.516.12888.737706172.02106.9847.2067748891183.63749.62255430001351650001190.89.311.3643911.978.0540.24442825.59608539278.041276.817.62608.3412.161175.0127.03230.27138.841300.8752.61434.2744.51445.9270.81390988.51277.3281916.931888.216.931889.0526.541205.1126.681198.4523.922674.291.7435737.821.6238091.463975616.37434.614.8263191030.1158997047.29927.46754027327.620295791276873602692.467.54841121339692830.11464.640928.15729.0456916.81708024617.04210.23413.5301793946.9214536827.532799.511.7533214663.364717.21184731.2610773.828806.45211.3422257.137109.7119.5636158331.2461819018.320481.24885214.933198.1350.8276225.1040784.438041213.12487601.984549.83431992.6068340428.6592117.4378408.7912381419290.53920697046.70162.1946.89161.549.25126.2626.3414.8519.9814.85.488.1221.0814.113.829.036.087.736.196.4613.722.95110.01230.3781840329611.056599.68456.30457.81437.206.406136.5868343113.31110.4945.711182.61679.6453621000960590001229.19.5411.9612881.045.4850.74590422.3373962886.125.471304.218.21935.68.23886.4117.92172.5392.611361.5854.21460.1792.11478.1289.11438.81067.81326.52856.612.771252.0512.841244.8619841.4919.45822.1918.11767.131.2625209.621.1627340.963999716.84652.684.7432773949.1361494695.02245.7094271342.5213084860192603252756.965.33724422027191831.83743.958939.194127.82617.874524.98744586720.37810.7619.0896648574.4956524990.120991.718.8621990740.240819.630500843.94921.3799648.522844.9118869.56512.0920920.487107.38214.0011157180.6259319018.317750.44760220.813199.691.146567.9740225.067564012.76234629.3571.084339427.71276117.6045297.22322.93110.14231.0916.39110.2945.7936050009492400012.0250.69975222.70639917.2534.7648.73867645444.33457433513107260152603452820.470.85631.79544.188439.223618.640325.08962279720.6410.82819.63009634219.35555309240.119943.978748.507445.1054108.364226.733196.063622.732560.565452205.4639299.384145213.682868284219.40344.31170.94271146344.97168.45694118640.45144.31100.6537.1738.180.8118.638.9433.5593.2914.6535.1238.4933.2721.2627.777.7722.38112.819760537138.71712506.77738.59461.53463.68445.185.54871.6458.5592.5254.581390.361212.846845000790020001222.99.579.5742975.1510.5433.00107922.5639462946.9710.641315.6182013.5215.86914.2434.83182.1175.481332.3808.51454.7702.81482.7283.61437.510241306.72783.712.662523.5712.762504.3619.581632.8120.061594.1418.263501.691.3739455.8614.81227.964.78425.2210067458.28523.93606599722.592594131185021185792760.962.83645399640373924.49268.692264.409127.86831.34451514.49141946915.45510.4111.22476781310.522149980.141983.39.60945386873.330110.624130281.45082.6780176.806773.9612436.68115.6742030.038107.17.65636956820.9118061664112102.54734169.112169.6580.6743364.2947312.486542941.85167540.729525.15474130.1789452203.6511899.93268213.4678167991221.78941970243.64173.5600295843.77173.05225728353.25145.49128.3535.8145.89120.2728.5551.832.81112.4324.5660.7949.4637.8928.8838.67115.6822.10114.261775562138.56528749211068.87176.41448.03467.46440.565.76871.7323644558.57107.2947.071385.331305.966683000732600001227.19.59.9062948.3910.6332.97431623.3352752952.7610.641312182020.2115.76912.4234.91180.91176.761336.5741.51445.1702.21498.9282.81441.11013.41309.52775.912.662524.0512.792498.7419.691623.8320.071592.7118.253504.151.438800.341.3340352.488713315.04427.94.8428855325.4721902918.22223.26803264122.512584571175421177712850.869.53644074039684424.26868.702264.044128.13731.76087614.41593115815.34410.49511.79752558510.368551199.141983.39.5947693573.260210.762441681.09132.8845176.628773.7779434.78315.5142551.898104.1697.79332557290.91722615662.112678.94818176.107172.5730.6693494.4180122.483869081.89859565.057546.034OpenBenchmarking.org

PETSc

Test: Streams

OpenBenchmarking.orgMB/s, More Is BetterPETSc 3.19Test: Streams5 2p b5 ab16K32K48K64K80K74130.1831992.6156185.751. (CC) gcc options: -fPIC -O3 -O2 -lpthread -ludev -lpciaccess -lm

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPC5 2p a5 2p b5 a5 bab100200300400500SE +/- 0.33, N = 3452452340339470469MIN: 98 / MAX: 1875MIN: 99 / MAX: 2013MIN: 55 / MAX: 2309MIN: 54 / MAX: 2307MIN: 84 / MAX: 2616MIN: 84 / MAX: 2565

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time5 2p a5 2p b5 a5 bab90180270360450205.46203.65428.66427.7140.48363.741. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh Time5 2p a5 2p b5 a5 bab30609012015099.3899.93117.44117.6025.35120.391. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Barbershop - Compute: CPU-Only5 2p a5 2p b5 aab90180270360450SE +/- 0.09, N = 3213.60213.46408.79260.83259.87

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigen5 2p a5 2p b5 ab2K4K6K8K10K82867816123851421. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLAS5 2p a5 2p b5 ab2K4K6K8K10K82847991141955541. (CXX) g++ options: -flto -pthread

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Unix Makefiles5 2p a5 2p b5 a5 bab60120180240300SE +/- 0.96, N = 3219.40221.79290.54297.22246.96248.48

OpenCV

Test: Graph API

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Graph API5 2p b5 ab90K180K270K360K450K4197022069702358101. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

FFmpeg

Encoder: libx265 - Scenario: Platform

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Platform5 2p a5 2p b5 aab1122334455SE +/- 0.02, N = 344.3143.6446.7043.7343.901. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Platform

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Platform5 2p a5 2p b5 aab4080120160200SE +/- 0.08, N = 3170.94173.56162.19173.23172.571. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Video On Demand

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Video On Demand5 2p a5 2p b5 aab1122334455SE +/- 0.04, N = 344.9743.7746.8943.6743.901. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Video On Demand

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Video On Demand5 2p a5 2p b5 aab4080120160200SE +/- 0.15, N = 3168.46173.05161.54173.47172.531. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: FastestDet5 2p a5 2p b5 ab122436486040.4553.259.2519.08MIN: 27.4 / MAX: 462.07MIN: 30.07 / MAX: 66.77MIN: 9.12 / MAX: 9.81MIN: 13.65 / MAX: 21.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vision_transformer5 2p a5 2p b5 ab306090120150144.31145.49126.26133.39MIN: 140.5 / MAX: 157.63MIN: 141.27 / MAX: 245.34MIN: 125.42 / MAX: 132.03MIN: 129.7 / MAX: 252.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: regnety_400m5 2p a5 2p b5 ab306090120150100.65128.3526.3453.52MIN: 97.81 / MAX: 136.12MIN: 111.43 / MAX: 240.27MIN: 25.99 / MAX: 28.26MIN: 50.93 / MAX: 71.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: squeezenet_ssd5 2p a5 2p b5 ab91827364537.1735.8114.8519.32MIN: 30.62 / MAX: 51.54MIN: 29.14 / MAX: 58.45MIN: 14.49 / MAX: 25.32MIN: 18.86 / MAX: 22.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: yolov4-tiny5 2p a5 2p b5 ab102030405038.1045.8919.9823.92MIN: 29.08 / MAX: 101.85MIN: 34.08 / MAX: 64.66MIN: 19.53 / MAX: 23.19MIN: 23.17 / MAX: 30.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet505 2p a5 2p b5 ab30609012015080.81120.2714.8019.58MIN: 62.51 / MAX: 112.38MIN: 42.99 / MAX: 192.41MIN: 14.6 / MAX: 16.84MIN: 19.15 / MAX: 44.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: alexnet5 2p a5 2p b5 ab71421283518.6028.555.487.13MIN: 17.59 / MAX: 34.64MIN: 12.6 / MAX: 62.18MIN: 5.36 / MAX: 6.34MIN: 6.97 / MAX: 7.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet185 2p a5 2p b5 ab122436486038.9451.808.1210.89MIN: 16.05 / MAX: 124.98MIN: 16.09 / MAX: 93.56MIN: 8.01 / MAX: 10.04MIN: 10.68 / MAX: 11.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vgg165 2p a5 2p b5 ab81624324033.5532.8121.0824.04MIN: 28.65 / MAX: 42.53MIN: 30.06 / MAX: 44.76MIN: 20.78 / MAX: 24.35MIN: 23.49 / MAX: 30.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: googlenet5 2p a5 2p b5 ab30609012015093.29112.4314.1119.09MIN: 52.46 / MAX: 137.19MIN: 29.99 / MAX: 148.59MIN: 13.96 / MAX: 17.22MIN: 18.77 / MAX: 25.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: blazeface5 2p a5 2p b5 ab61218243014.6524.563.828.12MIN: 11.34 / MAX: 73.36MIN: 20.64 / MAX: 91.2MIN: 3.45 / MAX: 79.74MIN: 6.94 / MAX: 11.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: efficientnet-b05 2p a5 2p b5 ab142842567035.1260.799.0315.44MIN: 33.83 / MAX: 41.96MIN: 47.67 / MAX: 141.85MIN: 8.93 / MAX: 11.07MIN: 13.63 / MAX: 18.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mnasnet5 2p a5 2p b5 ab112233445538.4949.466.0811.71MIN: 25.51 / MAX: 75.24MIN: 36.42 / MAX: 176.46MIN: 6.01 / MAX: 6.54MIN: 9.43 / MAX: 20.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: shufflenet-v25 2p a5 2p b5 ab91827364533.2737.897.7315.60MIN: 29.37 / MAX: 96.69MIN: 34.63 / MAX: 113.32MIN: 7.58 / MAX: 9.71MIN: 12.95 / MAX: 19.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v3-v3 - Model: mobilenet-v35 2p a5 2p b5 ab71421283521.2628.886.1910.29MIN: 20.79 / MAX: 28.25MIN: 23.66 / MAX: 168.43MIN: 6.05 / MAX: 6.93MIN: 9.51 / MAX: 11.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU-v2-v2 - Model: mobilenet-v25 2p a5 2p b5 ab91827364527.7038.676.4611.30MIN: 23.18 / MAX: 43.54MIN: 28.02 / MAX: 119.64MIN: 6.35 / MAX: 8.73MIN: 9.74 / MAX: 14.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mobilenet5 2p a5 2p b5 ab30609012015077.77115.6813.7017.84MIN: 67.11 / MAX: 156MIN: 64.45 / MAX: 159.93MIN: 13.55 / MAX: 14.38MIN: 17.55 / MAX: 25.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

FFmpeg

Encoder: libx265 - Scenario: Upload

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Upload5 2p a5 2p b5 a5 bab510152025SE +/- 0.01, N = 322.3822.1022.9522.9321.5921.621. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Upload

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Upload5 2p a5 2p b5 a5 bab306090120150SE +/- 0.05, N = 3112.82114.26110.01110.14116.93116.811. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Ninja5 2p a5 2p b5 a5 bab50100150200250SE +/- 0.08, N = 3138.72138.57230.38231.09165.71164.45

OpenCV

Test: Stitching

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Stitching5 2p b5 ab60K120K180K240K300K2874921840322011601. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding5 2p a5 2p b5 ab3K6K9K12K15K12506.7011068.809611.058308.331. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding5 2p a5 2p b5 ab170034005100680085007738.597176.416599.685493.351. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ClickHouse

100M Rows Hits Dataset, Third Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third Run5 2p a5 2p b5 ab100200300400500461.53448.03456.30437.35MIN: 41.49 / MAX: 3000MIN: 41.38 / MAX: 2608.7MIN: 24.65 / MAX: 5454.55MIN: 35.59 / MAX: 5454.55

ClickHouse

100M Rows Hits Dataset, Second Run

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second Run5 2p a5 2p b5 ab100200300400500463.68467.46457.81439.07MIN: 41.64 / MAX: 4615.38MIN: 40.6 / MAX: 4615.38MIN: 24.13 / MAX: 5454.55MIN: 35.82 / MAX: 6000

ClickHouse

100M Rows Hits Dataset, First Run / Cold Cache

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold Cache5 2p a5 2p b5 ab100200300400500445.18440.56437.20428.51MIN: 41.18 / MAX: 3157.89MIN: 40.98 / MAX: 4000MIN: 24.3 / MAX: 6000MIN: 34.8 / MAX: 5454.55

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 4K - Video Preset: Fast5 2p a5 2p b5 a5 bab246810SE +/- 0.010, N = 35.5485.7686.4066.3906.2866.1281. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Pabellon Barcelona - Compute: CPU-Only5 2p a5 2p b5 aab306090120150SE +/- 0.10, N = 371.6471.73136.5888.7688.73

OpenCV

Test: Core

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Core5 2p b5 ab50K100K150K200K250K23644568343770611. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Classroom - Compute: CPU-Only5 2p a5 2p b5 aab306090120150SE +/- 0.17, N = 358.5558.57113.3172.4572.02

FFmpeg

Encoder: libx265 - Scenario: Live

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Live5 2p a5 2p b5 a5 bab20406080100SE +/- 0.29, N = 392.52107.29110.49110.29105.78106.981. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

FFmpeg

Encoder: libx265 - Scenario: Live

OpenBenchmarking.orgSeconds, Fewer Is BetterFFmpeg 6.0Encoder: libx265 - Scenario: Live5 2p a5 2p b5 a5 bab1224364860SE +/- 0.13, N = 354.5847.0745.7145.7947.7447.211. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU5 2p a5 2p b5 ab300600900120015001390.361385.331182.611183.63MIN: 1287.43MIN: 1314.76MIN: 1161.83MIN: 1162.731. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU5 2p a5 2p b5 ab300600900120015001212.841305.96679.65749.62MIN: 1070.52MIN: 1060.61MIN: 664.64MIN: 731.751. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD55 2p a5 2p b5 a5 bab1.5M3M4.5M6M7.5MSE +/- 1000.00, N = 36845000668300036210003605000553400055430001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: HMAC-SHA512

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA5125 2p a5 2p b5 a5 bab30M60M90M120M150MSE +/- 266743.32, N = 3790020007326000096059000949240001367960001351650001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression Speed5 2p a5 2p b5 ab300600900120015001222.91227.11229.11190.81. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression Speed5 2p a5 2p b5 ab36912159.579.509.549.301. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 4K - Video Preset: Faster5 2p a5 2p b5 a5 bab3691215SE +/- 0.022, N = 39.5749.90611.96112.02011.36711.3641. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPU5 2p a5 2p b5 ab80016002400320040002975.152948.392881.043911.97MIN: 2241.37 / MAX: 3616.82MIN: 1547.54 / MAX: 3537.3MIN: 1536.21 / MAX: 3142.35MIN: 3337.4 / MAX: 4451.631. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPU5 2p a5 2p b5 ab369121510.5410.635.488.051. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Time5 2p a5 2p b5 a5 bab112233445533.0032.9750.7550.7040.4540.241. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Time5 2p a5 2p b5 a5 bab61218243022.5623.3422.3422.7124.7325.601. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPU5 2p a5 2p b5 ab80016002400320040002946.972952.762886.123927.00MIN: 2004.15 / MAX: 3534.38MIN: 2193.59 / MAX: 3652.32MIN: 1694.68 / MAX: 3104.67MIN: 3402.58 / MAX: 4474.471. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPU5 2p a5 2p b5 ab369121510.6410.645.478.041. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression Speed5 2p a5 2p b5 ab300600900120015001315.61312.01304.21276.81. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression Speed5 2p a5 2p b5 ab4812162018.018.018.217.61. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPU5 2p a5 2p b5 ab60012001800240030002013.522020.211935.602608.34MIN: 1890.96 / MAX: 2802.82MIN: 1823.96 / MAX: 3111.51MIN: 1852.05 / MAX: 1974.19MIN: 2421.38 / MAX: 2754.891. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPU5 2p a5 2p b5 ab4812162015.8615.768.2312.161. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPU5 2p a5 2p b5 ab30060090012001500914.24912.42886.411175.01MIN: 797.83 / MAX: 966.84MIN: 878.63 / MAX: 988.74MIN: 851.09 / MAX: 900.44MIN: 982.69 / MAX: 1202.211. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPU5 2p a5 2p b5 ab81624324034.8334.9117.9227.031. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPU5 2p a5 2p b5 ab50100150200250182.10180.91172.53230.27MIN: 117.14 / MAX: 548.18MIN: 124.49 / MAX: 288.01MIN: 81.35 / MAX: 207.95MIN: 166.99 / MAX: 311.891. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPU5 2p a5 2p b5 ab4080120160200175.48176.7692.61138.841. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3, Long Mode - Decompression Speed5 2p a5 2p b5 ab300600900120015001332.31336.51361.51300.81. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3, Long Mode - Compression Speed5 2p a5 2p b5 ab2004006008001000808.5741.5854.2752.61. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8, Long Mode - Decompression Speed5 2p a5 2p b5 ab300600900120015001454.71445.11460.11434.21. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8, Long Mode - Compression Speed5 2p a5 2p b5 ab2004006008001000702.8702.2792.1744.51. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 12 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Decompression Speed5 2p a5 2p b5 ab300600900120015001482.71498.91478.11445.91. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 12 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Compression Speed5 2p a5 2p b5 ab60120180240300283.6282.8289.1270.81. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8 - Decompression Speed5 2p a5 2p b5 ab300600900120015001437.51441.11438.81390.01. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 8 - Compression Speed5 2p a5 2p b5 ab20040060080010001024.01013.41067.8988.51. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3 - Decompression Speed5 2p a5 2p b5 ab300600900120015001306.71309.51326.51277.31. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 3 - Compression Speed5 2p a5 2p b5 ab60012001800240030002783.72775.92856.62819.01. (CC) gcc options: -O3 -pthread -lz -llzma -llz4

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPU5 2p a5 2p b5 ab4812162012.6612.6612.7716.93MIN: 7.58 / MAX: 53.45MIN: 8.38 / MAX: 48.88MIN: 7.39 / MAX: 23.62MIN: 13.88 / MAX: 33.111. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPU5 2p a5 2p b5 ab50010001500200025002523.572524.051252.051888.201. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPU5 2p a5 2p b5 ab4812162012.7612.7912.8416.93MIN: 7.65 / MAX: 43.58MIN: 7.68 / MAX: 43.18MIN: 6.96 / MAX: 23.28MIN: 10.73 / MAX: 31.241. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPU5 2p a5 2p b5 ab50010001500200025002504.362498.741244.861889.051. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPU5 2p a5 2p b5 ab61218243019.5819.6919.0026.54MIN: 11.25 / MAX: 73.87MIN: 11.03 / MAX: 75.83MIN: 13.62 / MAX: 32.75MIN: 14.19 / MAX: 63.191. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPU5 2p a5 2p b5 ab4008001200160020001632.811623.83841.491205.111. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPU5 2p a5 2p b5 ab61218243020.0620.0719.4526.68MIN: 11.41 / MAX: 47.48MIN: 13.26 / MAX: 75.83MIN: 11.98 / MAX: 28.4MIN: 15.32 / MAX: 46.541. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPU5 2p a5 2p b5 ab300600900120015001594.141592.71822.191198.451. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPU5 2p a5 2p b5 ab61218243018.2618.2518.1023.92MIN: 10.53 / MAX: 60.19MIN: 8.84 / MAX: 40.73MIN: 9.23 / MAX: 28.11MIN: 15.02 / MAX: 35.741. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPU5 2p a5 2p b5 ab80016002400320040003501.693504.151767.132674.291. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU5 2p a5 2p b5 ab0.39150.7831.17451.5661.95751.371.401.261.74MIN: 0.67 / MAX: 28.86MIN: 0.68 / MAX: 42.1MIN: 0.69 / MAX: 12.96MIN: 0.84 / MAX: 14.571. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU5 2p a5 2p b5 ab8K16K24K32K40K39455.8638800.3425209.6235737.821. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU5 2p b5 ab0.36450.7291.09351.4581.82251.331.161.62MIN: 0.64 / MAX: 26.34MIN: 0.66 / MAX: 12.23MIN: 0.69 / MAX: 13.341. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU5 2p b5 ab9K18K27K36K45K40352.4827340.9638091.461. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenCV

Test: DNN - Deep Neural Network

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: DNN - Deep Neural Network5 2p b5 ab20K40K60K80K100K8713339997397561. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

VVenC

Video Input: Bosphorus 1080p - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 1080p - Video Preset: Fast5 2p a5 2p b5 a5 bab48121620SE +/- 0.10, N = 314.8115.0416.8517.2516.5516.371. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Fishy Cat - Compute: CPU-Only5 2p a5 2p b5 aab1224364860SE +/- 0.04, N = 327.9627.9052.6834.7734.61

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 4 - Input: Bosphorus 4K5 2p a5 2p b5 a5 bab1.09422.18843.28264.37685.471SE +/- 0.021, N = 34.7844.8424.7434.7604.8634.8261. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenCV

Test: Object Detection

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Object Detection5 2p b5 ab20K40K60K80K100K8855327739319101. (CXX) g++ options: -fPIC -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -shared

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered Halfspace5 2p a5 2p b5 a5 bab1122334455SE +/- 0.29, N = 325.2225.4749.1448.7430.5230.121. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_bare5 2p a5 2p b5 aab246810SE +/- 0.026, N = 38.2858.2225.0227.2907.2991. (CXX) g++ options: -O3

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered Halfspace5 2p a5 2p b5 a5 bab1020304050SE +/- 0.13, N = 323.9423.2745.7144.3328.0827.471. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: BMW27 - Compute: CPU-Only5 2p a5 2p b5 aab1020304050SE +/- 0.03, N = 322.5922.5142.5227.6127.60

John The Ripper

Test: WPA PSK

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSK5 2p a5 2p b5 a5 bab60K120K180K240K300KSE +/- 297.59, N = 32594132584571308481310722013882029571. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcrypt5 2p a5 2p b5 a5 bab30K60K90K120K150KSE +/- 99.80, N = 3118502117542601926015286230912761. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: Blowfish5 2p a5 2p b5 a5 bab30K60K90K120K150KSE +/- 185.66, N = 3118579117771603256034586460873601. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.305 2p a5 2p b5 a5 bab6001200180024003000SE +/- 25.52, N = 32760.92850.82756.92820.42746.52692.41. (CXX) g++ options: -O3 -march=native -fPIE -pie

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 8 - Input: Bosphorus 4K5 2p a5 2p b5 a5 bab1632486480SE +/- 0.56, N = 1262.8469.5465.3470.8673.4467.551. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Rating5 2p a5 2p b5 ab100K200K300K400K500K4539964407402442204112131. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Rating5 2p a5 2p b5 ab90K180K270K360K450K4037393968442719183969281. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

VVenC

Video Input: Bosphorus 1080p - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 1080p - Video Preset: Faster5 2p a5 2p b5 a5 bab714212835SE +/- 0.03, N = 324.4924.2731.8431.8030.0130.111. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer - Model: Asian Dragon Obj5 2p a5 2p b5 a5 ba1632486480SE +/- 0.08, N = 368.6968.7043.9644.1970.52MIN: 68.03 / MAX: 70.15MIN: 67.96 / MAX: 69.64MIN: 43.73 / MAX: 44.3MIN: 43.94 / MAX: 44.51MIN: 69.67 / MAX: 71.94

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer ISPC - Model: Asian Dragon Obj5 2p a5 2p b5 a5 bab1428425670SE +/- 0.03, N = 364.4164.0439.1939.2264.5164.64MIN: 63.78 / MAX: 65.49MIN: 63.29 / MAX: 64.92MIN: 38.97 / MAX: 39.59MIN: 39 / MAX: 39.63MIN: 63.96 / MAX: 66.18MIN: 64.13 / MAX: 65.46

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis5 2p a5 2p b5 ab71421283527.8728.1427.8328.161. (CC) gcc options: -O2 -std=c99

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate5 2p a5 2p b5 a5 bab714212835SE +/- 0.25, N = 1531.3431.7617.8718.6429.3829.051. (CC) gcc options: -O3 -march=native -fopenmp

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous Halfspace5 2p a5 2p b5 a5 bab612182430SE +/- 0.20, N = 314.4914.4224.9925.0917.2116.821. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 6.0Time To Compile5 2p a5 2p b5 a5 bab510152025SE +/- 0.03, N = 315.4615.3420.3820.6416.9317.04

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 4 - Input: Bosphorus 1080p5 2p a5 2p b5 a5 bab3691215SE +/- 0.02, N = 310.4110.5010.7610.8310.3010.231. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic Model5 2p a5 2p b5 a5 bab510152025SE +/- 0.15, N = 311.2211.8019.0919.6313.8613.531. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU5 2p a5 2p b5 ab369121510.5221010.368504.495656.92145MIN: 8.88MIN: 8.54MIN: 3.9MIN: 6.321. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding5 2p a5 2p b5 ab11K22K33K44K55K49980.151199.124990.136827.51. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding5 2p a5 2p b5 ab9K18K27K36K45K41983.341983.320991.732799.51. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. Helens5 2p a5 2p b5 a5 bab510152025SE +/- 0.038048290, N = 39.6094538689.59476935018.86219907019.35555309211.67841461611.7533214601. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer ISPC - Model: Crown5 2p a5 2p b5 a5 bab1632486480SE +/- 0.11, N = 373.3373.2640.2440.1263.3763.36MIN: 71.99 / MAX: 75.06MIN: 72.13 / MAX: 74.82MIN: 39.64 / MAX: 40.95MIN: 39.64 / MAX: 40.56MIN: 62.18 / MAX: 66.58MIN: 62.44 / MAX: 66.52

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction5 2p a5 2p b5 ab51015202510.6210.7619.6317.211. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer - Model: Crown5 2p a5 2p b5 a5 ba20406080100SE +/- 0.05, N = 381.4581.0943.9543.9869.11MIN: 80.49 / MAX: 83.06MIN: 80.03 / MAX: 82.56MIN: 43.49 / MAX: 44.42MIN: 43.29 / MAX: 44.67MIN: 68.18 / MAX: 71.75

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU5 2p a5 2p b5 ab0.6491.2981.9472.5963.2452.678012.884511.379961.26107MIN: 1.89MIN: 1.79MIN: 1.25MIN: 1.061. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer - Model: Asian Dragon5 2p a5 2p b5 a5 ba20406080100SE +/- 0.14, N = 376.8176.6348.5248.5178.48MIN: 76.07 / MAX: 77.73MIN: 75.97 / MAX: 78.17MIN: 48.32 / MAX: 48.91MIN: 48.29 / MAX: 48.79MIN: 77.64 / MAX: 80.46

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer ISPC - Model: Asian Dragon5 2p a5 2p b5 a5 bab1632486480SE +/- 0.01, N = 373.9673.7844.9145.1173.8473.83MIN: 73.12 / MAX: 75.27MIN: 72.91 / MAX: 74.82MIN: 44.68 / MAX: 45.26MIN: 44.88 / MAX: 45.48MIN: 73.31 / MAX: 75.9MIN: 73.34 / MAX: 74.66

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP5 2p a5 2p b5 ab2004006008001000436.68434.78869.57806.451. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics5 2p a5 2p b5 ab4812162015.6715.5112.0911.341. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.35 2p a5 2p b5 ab9K18K27K36K45K42030.0442551.9020920.4922257.141. (CXX) g++ options: -O3 -fopenmp -lm -lmpi_cxx -lmpi

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 8 - Input: Bosphorus 1080p5 2p a5 2p b5 a5 bab20406080100SE +/- 1.23, N = 5107.10104.17107.38108.36109.98109.711. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Pennant

Test: sedovbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig5 2p a5 2p b5 ab481216207.6563697.79332514.0011109.5636101. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Church Facade5 2p a5 2p b5 aab13002600390052006500SE +/- 3.21, N = 3568257295718579358331. (CXX) g++ options: -O3

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU5 2p a5 2p b5 ab0.28040.56080.84121.12161.4020.9118060.9172260.6259301.246180MIN: 0.78MIN: 0.77MIN: 0.57MIN: 1.131. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding5 2p a5 2p b5 ab4K8K12K16K20K16641.015662.119018.319018.31. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding5 2p a5 2p b5 ab4K8K12K16K20K12102.512678.917750.420481.21. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Lion5 2p a5 2p b5 aab10002000300040005000SE +/- 7.02, N = 3473448184760486748851. (CXX) g++ options: -O3

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 12 - Input: Bosphorus 4K5 2p a5 2p b5 a5 bab50100150200250SE +/- 0.73, N = 3169.11176.11220.81226.73222.84214.931. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 13 - Input: Bosphorus 4K5 2p a5 2p b5 a5 bab4080120160200SE +/- 0.70, N = 3169.66172.57199.69196.06195.92198.141. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU5 2p a5 2p b5 ab0.2580.5160.7741.0321.290.6743360.6693491.1465600.827622MIN: 0.62MIN: 0.6MIN: 1.09MIN: 0.781. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Pennant

Test: leblancbig

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig5 2p a5 2p b5 ab2468104.2947314.4180127.9740225.1040781. (CXX) g++ options: -fopenmp -lmpi_cxx -lmpi

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction5 2p a5 2p b5 ab1.14022.28043.42064.56085.7012.486542942.483869085.067564014.438041211. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU5 2p a5 2p b5 ab0.70311.40622.10932.81243.51551.851671.898592.762343.12487MIN: 1.61MIN: 1.58MIN: 2.59MIN: 2.061. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 12 - Input: Bosphorus 1080p5 2p a5 2p b5 a5 bab140280420560700SE +/- 2.97, N = 3540.73565.06629.30622.73604.84601.981. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.5Encoder Mode: Preset 13 - Input: Bosphorus 1080p5 2p a5 2p b5 a5 bab120240360480600SE +/- 5.77, N = 3525.15546.03571.08560.57552.24549.831. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq


Phoronix Test Suite v10.8.5