7713 2P Tests for a future article. 2 x AMD EPYC 7303 16-Core testing with a AMD DAYTONA_X (RYM1009B BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2310231-NE-77132P99738&sor&grw .
7713 2P Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution a b c AMD EPYC 7303 16-Core d e f g 2 x AMD EPYC 7713 64-Core @ 2.00GHz (128 Cores / 256 Threads) AMD DAYTONA_X (RYM1009B BIOS) AMD Starship/Matisse 256GB 3841GB Micron_9300_MTFDHAL3T8TDP ASPEED VE228 2 x Mellanox MT27710 Ubuntu 22.04 5.15.0-47-generic (x86_64) GNOME Shell 42.4 X Server 1.21.1.3 1.2.204 GCC 11.2.0 ext4 1920x1080 AMD EPYC 7303 16-Core @ 2.40GHz (16 Cores / 32 Threads) 2 x AMD EPYC 7203 8-Core @ 2.80GHz (16 Cores / 32 Threads) 512GB 1024x768 2 x AMD EPYC 7303 16-Core @ 2.40GHz (32 Cores / 64 Threads) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 Python Details - Python 3.10.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
7713 2P openradioss: Bumper Beam remhos: Sample Remap Example openradioss: Chrysler Neon 1M openradioss: Cell Phone Drop Test openradioss: Bird Strike on Windshield openradioss: Rubber O-Ring Seal Installation openradioss: INIVOL and Fluid Structure Interaction Drop Container laghos: Sedov Blast Wave, ube_922_hex.mesh laghos: Triple Point Problem onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU libxsmm: 64 onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU openvino: Face Detection FP16-INT8 - CPU easywave: e2Asean Grid + BengkuluSept2007 Source - 1200 openvino: Face Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU compress-7zip: Compression Rating compress-7zip: Decompression Rating build-linux-kernel: defconfig build-linux-kernel: allmodconfig blender: BMW27 - CPU-Only blender: Classroom - CPU-Only blender: Fishy Cat - CPU-Only blender: Barbershop - CPU-Only blender: Pabellon Barcelona - CPU-Only avifenc: 0 avifenc: 2 avifenc: 6 avifenc: 6, Lossless avifenc: 10, Lossless embree: Pathtracer - Asian Dragon embree: Pathtracer - Asian Dragon Obj embree: Pathtracer - Crown embree: Pathtracer ISPC - Asian Dragon embree: Pathtracer ISPC - Asian Dragon Obj embree: Pathtracer ISPC - Crown oidn: RT.hdr_alb_nrm.3840x2160 - CPU-Only oidn: RT.ldr_alb_nrm.3840x2160 - CPU-Only oidn: RTLightmap.hdr.4096x4096 - CPU-Only openvkl: vklBenchmarkCPU Scalar openvkl: vklBenchmarkCPU ISPC easywave: e2Asean Grid + BengkuluSept2007 Source - 240 openvino: Person Vehicle Bike Detection FP16 - CPU easywave: e2Asean Grid + BengkuluSept2007 Source - 2400 libxsmm: 32 a b c AMD EPYC 7303 16-Core d e f g 115.89 11.284 174.49 26.54 147.64 130.4 132.37 282.55 164.19 202.68 157.75 980.5 51.12 623.9 3308.28 9.66 9543.23 3.34 1342.68 23.81 226.15 141.33 5333.24 23.98 2687.19 899.15 142.22 98494.41 0.85 507611 643823 23.875 182.491 16.08 40.28 20.44 153.78 49.66 75.603 41.424 3.248 6.786 5.474 11.89 540.3 116.73 11.02 173.59 26.35 146.17 134.61 129.61 282.08 162.65 203.14 157.4 923.9 51.17 623.64 3302.83 9.68 9544.95 3.34 1340.29 23.85 226.27 141.24 5334.87 23.98 2655.09 897.77 142.42 101883.13 0.85 505036 647995 23.974 179.986 16.08 40.21 20.73 152.94 49.32 75.778 40.837 3.331 6.881 5.485 12.03 471 116.3 11.709 174.45 26.53 146.31 131.25 132.8 279.20 167.28 203.96 156.74 920.8 51.2 623.6 3303.59 9.67 9527.76 3.35 1340.12 23.86 222.73 143.55 5335.15 23.97 2694.27 898.69 142.29 101985.21 0.86 499347 678765 23.939 179.539 16.19 40.04 20.45 152.5 48.96 75.836 42.41 3.309 6.929 5.535 11.86 467.3 118.33 37.307 551.24 65.55 202.93 102.8 357.27 148.49 132.65 42.11 189.92 406.9 8.66 921.14 570.58 14 1754.67 4.55 237.79 33.62 47.46 168.3 858.21 18.63 520.89 155.5 102.82 23110.2 0.68 135892 116102 67.408 815.395 89.45 226.06 109.21 871.17 280.03 120.85 60.19 5.282 9.186 5.779 15.34 273.2 134.32 39.71 489.75 63.71 235.98 129.36 429.22 132.46 114.18 2.67723 4.27338 5.54136 1.94173 4.63185 1.92811 2.34565 1.6509 4.51333 0.768449 3019.31 3037.72 3028.36 1451.2 1445.86 37.06 107.81 442.5 1466.25 8.35 65.733 478.92 470.46 8.49 1232.95 3.24 195.5 20.44 43.18 92.53 856.63 18.65 369.7 171.61 93.16 23725.57 0.66 113384 106826 68.675 830.492 94.86 237.87 116.66 974.94 296.74 125.918 64.146 5.771 10.133 6.186 17.4784 15.7439 16.1253 15.8702 13.8445 14.3991 0.56 0.56 0.28 144 267 3.291 10.8 149.203 356.5 105.44 23.463 304.03 41.05 192.24 118.07 302.03 234.135345612 182.23 1.74244 4.10412 5.67018 1.93352 3.14371 1.16856 1.89405 1.44749 3.84916 1.05324 3494.51 3379.25 3216.13 1114.67 1161.8 72.71 109.89 727.2 1137.11 17.2 55.336 463.57 1096.25 7.29 3153.9 2.53 415.32 19.25 85.51 93.45 1727.34 18.46 873.34 303.8 105.22 46823.73 0.67 210620 214511 43.41 462.138 50.21 116.68 62.52 505.43 155.52 92.447 48.724 3.965 7.801 5.855 32.9696 29.6195 30.1028 29.9805 26.0698 26.5233 0.95 0.95 0.48 273 495 2.831 9.13 140.297 484.7 105.68 23.488 306.37 41.01 192.62 120.33 305.55 233.54 181.55 1.69817 3.83995 5.66435 1.9522 3.06111 1.16545 1.77064 1.92348 4.18974 1.14379 3341.93 3281.24 3462.9 1139.54 1153.45 73.23 109.16 737.3 1128.3 17.27 60.049 462.78 1095.27 7.29 3148.82 2.53 415.12 19.25 85.46 93.5 1708.45 18.67 865.28 303.83 105.23 45126.45 0.7 212139 216258 43.477 462.599 50.25 117.05 62.18 504.18 157.08 91.913 48.844 3.976 7.824 5.817 33.0392 29.5206 30.3378 30.1314 26.2737 26.6112 0.95 0.95 0.48 275 497 2.672 9.21 130.447 482.9 107.04 23.723 303.58 40.85 192.49 122.81 307.24 233.047063651 180.93 1.84652 4.05768 5.63319 1.95024 3.14105 1.17023 2.1207 2.28469 3.94733 1.08661 3451.09 3556.44 3272.36 1129.54 1145.62 72.14 110.71 723.4 1149.15 17.28 56.337 462.57 1095.68 7.29 3148.97 2.53 415.15 19.25 84.86 94.17 1725.37 18.48 875.64 303.59 105.33 47926.34 0.66 209887 218417 43.318 462.696 50.09 118.34 62.31 506.55 157.59 92.528 48.731 3.962 7.825 5.815 32.9662 29.5908 30.2552 30.0261 26.1574 26.3986 0.95 0.95 0.48 274 495 2.766 9.11 133.177 470.9 OpenBenchmarking.org
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam e f g a c b AMD EPYC 7303 16-Core d 30 60 90 120 150 105.44 105.68 107.04 115.89 116.30 116.73 118.33 134.32
Remhos Test: Sample Remap Example OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example b a c e f g AMD EPYC 7303 16-Core d 9 18 27 36 45 11.02 11.28 11.71 23.46 23.49 23.72 37.31 39.71 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenRadioss Model: Chrysler Neon 1M OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M b c a g e f d AMD EPYC 7303 16-Core 120 240 360 480 600 173.59 174.45 174.49 303.58 304.03 306.37 489.75 551.24
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test b c a g f e d AMD EPYC 7303 16-Core 15 30 45 60 75 26.35 26.53 26.54 40.85 41.01 41.05 63.71 65.55
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield b c a e g f AMD EPYC 7303 16-Core d 50 100 150 200 250 146.17 146.31 147.64 192.24 192.49 192.62 202.93 235.98
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation AMD EPYC 7303 16-Core e f g d a c b 30 60 90 120 150 102.80 118.07 120.33 122.81 129.36 130.40 131.25 134.61
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container b a c e f g AMD EPYC 7303 16-Core d 90 180 270 360 450 129.61 132.37 132.80 302.03 305.55 307.24 357.27 429.22
Laghos Test: Sedov Blast Wave, ube_922_hex.mesh OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh a b c e f g AMD EPYC 7303 16-Core d 60 120 180 240 300 282.55 282.08 279.20 234.14 233.54 233.05 148.49 132.46 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
Laghos Test: Triple Point Problem OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem e f g c a b AMD EPYC 7303 16-Core d 40 80 120 160 200 182.23 181.55 180.93 167.28 164.19 162.65 132.65 114.18 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU f e g d 0.6024 1.2048 1.8072 2.4096 3.012 1.69817 1.74244 1.84652 2.67723 MIN: 1.41 MIN: 1.45 MIN: 1.33 MIN: 2.34 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU f g e d 0.9615 1.923 2.8845 3.846 4.8075 3.83995 4.05768 4.10412 4.27338 MIN: 3.1 MIN: 2.86 MIN: 3.26 MIN: 3.64 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU d g f e 1.2758 2.5516 3.8274 5.1032 6.379 5.54136 5.63319 5.66435 5.67018 MIN: 4.33 MIN: 4.7 MIN: 4.73 MIN: 4.65 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU e d g f 0.4392 0.8784 1.3176 1.7568 2.196 1.93352 1.94173 1.95024 1.95220 MIN: 1.72 MIN: 1.77 MIN: 1.78 MIN: 1.79 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU f g e d 1.0422 2.0844 3.1266 4.1688 5.211 3.06111 3.14105 3.14371 4.63185 MIN: 2.86 MIN: 2.86 MIN: 2.85 MIN: 4.26 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU f e g d 0.4338 0.8676 1.3014 1.7352 2.169 1.16545 1.16856 1.17023 1.92811 MIN: 1.11 MIN: 1.1 MIN: 1.1 MIN: 1.74 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU f e g d 0.5278 1.0556 1.5834 2.1112 2.639 1.77064 1.89405 2.12070 2.34565 MIN: 1.42 MIN: 1.51 MIN: 1.65 MIN: 2.11 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU e d f g 0.5141 1.0282 1.5423 2.0564 2.5705 1.44749 1.65090 1.92348 2.28469 MIN: 1.01 MIN: 1.3 MIN: 1.39 MIN: 1.48 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU e g f d 1.0155 2.031 3.0465 4.062 5.0775 3.84916 3.94733 4.18974 4.51333 MIN: 3.54 MIN: 3.55 MIN: 3.58 MIN: 2.92 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU d e g f 0.2574 0.5148 0.7722 1.0296 1.287 0.768449 1.053240 1.086610 1.143790 MIN: 0.6 MIN: 0.81 MIN: 0.86 MIN: 0.83 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU d f g e 700 1400 2100 2800 3500 3019.31 3341.93 3451.09 3494.51 MIN: 2971.6 MIN: 3231.53 MIN: 3397.7 MIN: 3430.83 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU d f e g 800 1600 2400 3200 4000 3037.72 3281.24 3379.25 3556.44 MIN: 2984.39 MIN: 3185.12 MIN: 3265.73 MIN: 3506.87 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU d e g f 700 1400 2100 2800 3500 3028.36 3216.13 3272.36 3462.90 MIN: 2984.7 MIN: 3111.44 MIN: 3161.82 MIN: 3392.2 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU e g f d 300 600 900 1200 1500 1114.67 1129.54 1139.54 1451.20 MIN: 1074.44 MIN: 1079.67 MIN: 1087.78 MIN: 1412.26 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU g f e d 300 600 900 1200 1500 1145.62 1153.45 1161.80 1445.86 MIN: 1102.84 MIN: 1089.33 MIN: 1098.59 MIN: 1399.09 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU c b a f e g AMD EPYC 7303 16-Core d 40 80 120 160 200 203.96 203.14 202.68 73.23 72.71 72.14 42.11 37.06 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU d f e g c b a AMD EPYC 7303 16-Core 40 80 120 160 200 107.81 109.16 109.89 110.71 156.74 157.40 157.75 189.92 MIN: 102.18 / MAX: 124.72 MIN: 99.46 / MAX: 143.57 MIN: 93.05 / MAX: 140.85 MIN: 98.56 / MAX: 144.8 MIN: 131.77 / MAX: 297.92 MIN: 128.42 / MAX: 281.86 MIN: 108.19 / MAX: 274.81 MIN: 169.75 / MAX: 200.94 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
libxsmm M N K: 64 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 a b c f e g d AMD EPYC 7303 16-Core 200 400 600 800 1000 980.5 923.9 920.8 737.3 727.2 723.4 442.5 406.9 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU f e g d 300 600 900 1200 1500 1128.30 1137.11 1149.15 1466.25 MIN: 1073.3 MIN: 1093.13 MIN: 1097.47 MIN: 1422.8 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU c b a g f e AMD EPYC 7303 16-Core d 12 24 36 48 60 51.20 51.17 51.12 17.28 17.27 17.20 8.66 8.35 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 e g f d 15 30 45 60 75 55.34 56.34 60.05 65.73 1. (CXX) g++ options: -O3 -fopenmp
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU g f e d c b a AMD EPYC 7303 16-Core 200 400 600 800 1000 462.57 462.78 463.57 478.92 623.60 623.64 623.90 921.14 MIN: 455.99 / MAX: 481.91 MIN: 456.11 / MAX: 479.6 MIN: 456.35 / MAX: 480.41 MIN: 474.19 / MAX: 500.74 MIN: 599.33 / MAX: 668.6 MIN: 593.36 / MAX: 672.04 MIN: 594.87 / MAX: 669.12 MIN: 883.65 / MAX: 932.87 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU a c b e g f AMD EPYC 7303 16-Core d 700 1400 2100 2800 3500 3308.28 3303.59 3302.83 1096.25 1095.68 1095.27 570.58 470.46 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU e f g d a c b AMD EPYC 7303 16-Core 4 8 12 16 20 7.29 7.29 7.29 8.49 9.66 9.67 9.68 14.00 MIN: 7.14 / MAX: 16.48 MIN: 7.14 / MAX: 16.05 MIN: 7.15 / MAX: 16.28 MIN: 7.96 / MAX: 14.96 MIN: 7.98 / MAX: 45.57 MIN: 7.95 / MAX: 41.68 MIN: 7.94 / MAX: 44.33 MIN: 7.54 / MAX: 24.09 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU b a c e g f AMD EPYC 7303 16-Core d 2K 4K 6K 8K 10K 9544.95 9543.23 9527.76 3153.90 3148.97 3148.82 1754.67 1232.95 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU e f g d a b c AMD EPYC 7303 16-Core 1.0238 2.0476 3.0714 4.0952 5.119 2.53 2.53 2.53 3.24 3.34 3.34 3.35 4.55 MIN: 2.48 / MAX: 9.57 MIN: 2.48 / MAX: 10.15 MIN: 2.47 / MAX: 10.12 MIN: 2.94 / MAX: 8.15 MIN: 2.82 / MAX: 23.64 MIN: 2.79 / MAX: 25.65 MIN: 2.81 / MAX: 23.74 MIN: 2.67 / MAX: 16.04 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU a b c e g f AMD EPYC 7303 16-Core d 300 600 900 1200 1500 1342.68 1340.29 1340.12 415.32 415.15 415.12 237.79 195.50 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU e f g d a b c AMD EPYC 7303 16-Core 8 16 24 32 40 19.25 19.25 19.25 20.44 23.81 23.85 23.86 33.62 MIN: 17.55 / MAX: 32.52 MIN: 17.46 / MAX: 28.16 MIN: 17.49 / MAX: 27.05 MIN: 18.84 / MAX: 37.99 MIN: 20.24 / MAX: 65.85 MIN: 20.69 / MAX: 64.82 MIN: 20.96 / MAX: 60.47 MIN: 25.37 / MAX: 42.95 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU b a c e f g AMD EPYC 7303 16-Core d 50 100 150 200 250 226.27 226.15 222.73 85.51 85.46 84.86 47.46 43.18 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU d e f g b a c AMD EPYC 7303 16-Core 40 80 120 160 200 92.53 93.45 93.50 94.17 141.24 141.33 143.55 168.30 MIN: 87.83 / MAX: 184.88 MIN: 77.59 / MAX: 139.64 MIN: 82.68 / MAX: 155.42 MIN: 85.17 / MAX: 160.36 MIN: 114.93 / MAX: 488.84 MIN: 113.19 / MAX: 547.49 MIN: 114.18 / MAX: 525.63 MIN: 141.17 / MAX: 184.72 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU c b a e g f AMD EPYC 7303 16-Core d 1100 2200 3300 4400 5500 5335.15 5334.87 5333.24 1727.34 1725.37 1708.45 858.21 856.63 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU e g AMD EPYC 7303 16-Core d f c a b 6 12 18 24 30 18.46 18.48 18.63 18.65 18.67 23.97 23.98 23.98 MIN: 17.85 / MAX: 32.35 MIN: 17.82 / MAX: 33.34 MIN: 8.99 / MAX: 30.48 MIN: 17.81 / MAX: 33.21 MIN: 17.84 / MAX: 33.76 MIN: 21.4 / MAX: 33.21 MIN: 21.19 / MAX: 37.06 MIN: 21.31 / MAX: 35.8 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU c a b g e f AMD EPYC 7303 16-Core d 600 1200 1800 2400 3000 2694.27 2687.19 2655.09 875.64 873.34 865.28 520.89 369.70 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU a c b f e g d AMD EPYC 7303 16-Core 200 400 600 800 1000 899.15 898.69 897.77 303.83 303.80 303.59 171.61 155.50 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU d AMD EPYC 7303 16-Core e f g a c b 30 60 90 120 150 93.16 102.82 105.22 105.23 105.33 142.22 142.29 142.42 MIN: 86.16 / MAX: 102.67 MIN: 69.31 / MAX: 117.98 MIN: 91.67 / MAX: 111.86 MIN: 92.98 / MAX: 112.15 MIN: 94.87 / MAX: 111.06 MIN: 111.28 / MAX: 182.41 MIN: 110.33 / MAX: 173.87 MIN: 106.02 / MAX: 184.44 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU c b a g e f d AMD EPYC 7303 16-Core 20K 40K 60K 80K 100K 101985.21 101883.13 98494.41 47926.34 46823.73 45126.45 23725.57 23110.20 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU d g e AMD EPYC 7303 16-Core f a b c 0.1935 0.387 0.5805 0.774 0.9675 0.66 0.66 0.67 0.68 0.70 0.85 0.85 0.86 MIN: 0.61 / MAX: 6.63 MIN: 0.61 / MAX: 10.25 MIN: 0.61 / MAX: 10.58 MIN: 0.4 / MAX: 11.17 MIN: 0.61 / MAX: 10.2 MIN: 0.72 / MAX: 27.72 MIN: 0.72 / MAX: 21.71 MIN: 0.71 / MAX: 40.39 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating a b c f e g AMD EPYC 7303 16-Core d 110K 220K 330K 440K 550K 507611 505036 499347 212139 210620 209887 135892 113384 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating c b a g f e AMD EPYC 7303 16-Core d 150K 300K 450K 600K 750K 678765 647995 643823 218417 216258 214511 116102 106826 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig a c b g e f AMD EPYC 7303 16-Core d 15 30 45 60 75 23.88 23.94 23.97 43.32 43.41 43.48 67.41 68.68
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig c b a e f g AMD EPYC 7303 16-Core d 200 400 600 800 1000 179.54 179.99 182.49 462.14 462.60 462.70 815.40 830.49
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only a b c g e f AMD EPYC 7303 16-Core d 20 40 60 80 100 16.08 16.08 16.19 50.09 50.21 50.25 89.45 94.86
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only c b a e f g AMD EPYC 7303 16-Core d 50 100 150 200 250 40.04 40.21 40.28 116.68 117.05 118.34 226.06 237.87
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only a c b f g e AMD EPYC 7303 16-Core d 30 60 90 120 150 20.44 20.45 20.73 62.18 62.31 62.52 109.21 116.66
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only c b a f e g AMD EPYC 7303 16-Core d 200 400 600 800 1000 152.50 152.94 153.78 504.18 505.43 506.55 871.17 974.94
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only c b a e f g AMD EPYC 7303 16-Core d 60 120 180 240 300 48.96 49.32 49.66 155.52 157.08 157.59 280.03 296.74
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 0 a b c f e g AMD EPYC 7303 16-Core d 30 60 90 120 150 75.60 75.78 75.84 91.91 92.45 92.53 120.85 125.92 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 b a c e g f AMD EPYC 7303 16-Core d 14 28 42 56 70 40.84 41.42 42.41 48.72 48.73 48.84 60.19 64.15 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 a c b g e f AMD EPYC 7303 16-Core d 1.2985 2.597 3.8955 5.194 6.4925 3.248 3.309 3.331 3.962 3.965 3.976 5.282 5.771 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless a b c e f g AMD EPYC 7303 16-Core d 3 6 9 12 15 6.786 6.881 6.929 7.801 7.824 7.825 9.186 10.133 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 10, Lossless a b c AMD EPYC 7303 16-Core g f e d 2 4 6 8 10 5.474 5.485 5.535 5.779 5.815 5.817 5.855 6.186 1. (CXX) g++ options: -O3 -fPIC -lm
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Asian Dragon f e g d 8 16 24 32 40 33.04 32.97 32.97 17.48 MIN: 32.83 / MAX: 33.53 MIN: 32.77 / MAX: 33.49 MIN: 32.78 / MAX: 33.29 MIN: 17.31 / MAX: 17.65
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Asian Dragon Obj e g f d 7 14 21 28 35 29.62 29.59 29.52 15.74 MIN: 29.37 / MAX: 30.21 MIN: 29.36 / MAX: 30.31 MIN: 29.27 / MAX: 30.16 MIN: 15.64 / MAX: 15.92
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Crown f g e d 7 14 21 28 35 30.34 30.26 30.10 16.13 MIN: 30.01 / MAX: 30.87 MIN: 29.92 / MAX: 30.97 MIN: 29.78 / MAX: 30.77 MIN: 15.95 / MAX: 16.45
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon f g e d 7 14 21 28 35 30.13 30.03 29.98 15.87 MIN: 29.94 / MAX: 30.64 MIN: 29.84 / MAX: 30.34 MIN: 29.79 / MAX: 30.37 MIN: 15.72 / MAX: 16.2
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon Obj f g e d 6 12 18 24 30 26.27 26.16 26.07 13.84 MIN: 26.05 / MAX: 26.62 MIN: 25.98 / MAX: 26.75 MIN: 25.87 / MAX: 26.79 MIN: 13.7 / MAX: 14.06
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown f e g d 6 12 18 24 30 26.61 26.52 26.40 14.40 MIN: 26.29 / MAX: 27.11 MIN: 26.18 / MAX: 27.13 MIN: 26.05 / MAX: 27.08 MIN: 14.25 / MAX: 14.69
Intel Open Image Denoise Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.1 Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only g f e d 0.2138 0.4276 0.6414 0.8552 1.069 0.95 0.95 0.95 0.56
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.1 Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only g f e d 0.2138 0.4276 0.6414 0.8552 1.069 0.95 0.95 0.95 0.56
Intel Open Image Denoise Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.1 Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only g f e d 0.108 0.216 0.324 0.432 0.54 0.48 0.48 0.48 0.28
OpenVKL Benchmark: vklBenchmarkCPU Scalar OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 2.0.0 Benchmark: vklBenchmarkCPU Scalar f g e d 60 120 180 240 300 275 274 273 144 MIN: 21 / MAX: 5065 MIN: 21 / MAX: 5053 MIN: 21 / MAX: 5037 MIN: 11 / MAX: 2667
OpenVKL Benchmark: vklBenchmarkCPU ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 2.0.0 Benchmark: vklBenchmarkCPU ISPC f g e d 110 220 330 440 550 497 495 495 267 MIN: 43 / MAX: 6162 MIN: 43 / MAX: 6166 MIN: 44 / MAX: 6142 MIN: 23 / MAX: 3404
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 f g e d 0.7405 1.481 2.2215 2.962 3.7025 2.672 2.766 2.831 3.291 1. (CXX) g++ options: -O3 -fopenmp
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU g e f d c a b AMD EPYC 7303 16-Core 4 8 12 16 20 9.11 9.13 9.21 10.80 11.86 11.89 12.03 15.34 MIN: 8.35 / MAX: 22.58 MIN: 8.44 / MAX: 24.1 MIN: 8.24 / MAX: 23.34 MIN: 9.8 / MAX: 19.62 MIN: 9.49 / MAX: 57.72 MIN: 10.03 / MAX: 55.71 MIN: 10.4 / MAX: 55.41 MIN: 9.22 / MAX: 27.56 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400 f g e d 30 60 90 120 150 130.45 133.18 140.30 149.20 1. (CXX) g++ options: -O3 -fopenmp
libxsmm M N K: 32 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 a e f b g c d AMD EPYC 7303 16-Core 120 240 360 480 600 540.3 484.7 482.9 471.0 470.9 467.3 356.5 273.2 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
Phoronix Test Suite v10.8.5