7713 2P Tests for a future article. 2 x AMD EPYC 7303 16-Core testing with a AMD DAYTONA_X (RYM1009B BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2310231-NE-77132P99738&rdt .
7713 2P Processor Motherboard Chipset Memory Disk Graphics Monitor Network OS Kernel Desktop Display Server Vulkan Compiler File-System Screen Resolution a b c AMD EPYC 7303 16-Core d e f g 2 x AMD EPYC 7713 64-Core @ 2.00GHz (128 Cores / 256 Threads) AMD DAYTONA_X (RYM1009B BIOS) AMD Starship/Matisse 256GB 3841GB Micron_9300_MTFDHAL3T8TDP ASPEED VE228 2 x Mellanox MT27710 Ubuntu 22.04 5.15.0-47-generic (x86_64) GNOME Shell 42.4 X Server 1.21.1.3 1.2.204 GCC 11.2.0 ext4 1920x1080 AMD EPYC 7303 16-Core @ 2.40GHz (16 Cores / 32 Threads) 2 x AMD EPYC 7203 8-Core @ 2.80GHz (16 Cores / 32 Threads) 512GB 1024x768 2 x AMD EPYC 7303 16-Core @ 2.40GHz (32 Cores / 64 Threads) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa001173 Python Details - Python 3.10.6 Security Details - itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected
7713 2P libxsmm: 32 libxsmm: 64 laghos: Triple Point Problem laghos: Sedov Blast Wave, ube_922_hex.mesh openradioss: Bumper Beam openradioss: Chrysler Neon 1M openradioss: Cell Phone Drop Test openradioss: Bird Strike on Windshield openradioss: Rubber O-Ring Seal Installation openradioss: INIVOL and Fluid Structure Interaction Drop Container remhos: Sample Remap Example compress-7zip: Compression Rating compress-7zip: Decompression Rating avifenc: 0 avifenc: 2 avifenc: 6 avifenc: 6, Lossless avifenc: 10, Lossless build-linux-kernel: defconfig build-linux-kernel: allmodconfig blender: BMW27 - CPU-Only blender: Classroom - CPU-Only blender: Fishy Cat - CPU-Only blender: Barbershop - CPU-Only blender: Pabellon Barcelona - CPU-Only openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU embree: Pathtracer - Asian Dragon embree: Pathtracer - Asian Dragon Obj embree: Pathtracer - Crown embree: Pathtracer ISPC - Asian Dragon embree: Pathtracer ISPC - Asian Dragon Obj embree: Pathtracer ISPC - Crown openvkl: vklBenchmarkCPU Scalar openvkl: vklBenchmarkCPU ISPC oidn: RT.hdr_alb_nrm.3840x2160 - CPU-Only oidn: RT.ldr_alb_nrm.3840x2160 - CPU-Only oidn: RTLightmap.hdr.4096x4096 - CPU-Only onednn: Convolution Batch Shapes Auto - f32 - CPU onednn: Convolution Batch Shapes Auto - u8s8f32 - CPU onednn: Deconvolution Batch shapes_1d - f32 - CPU onednn: Deconvolution Batch shapes_1d - u8s8f32 - CPU onednn: Deconvolution Batch shapes_3d - f32 - CPU onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPU onednn: IP Shapes 1D - f32 - CPU onednn: IP Shapes 1D - u8s8f32 - CPU onednn: IP Shapes 3D - f32 - CPU onednn: IP Shapes 3D - u8s8f32 - CPU onednn: Recurrent Neural Network Training - f32 - CPU onednn: Recurrent Neural Network Training - u8s8f32 - CPU onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Inference - u8s8f32 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU easywave: e2Asean Grid + BengkuluSept2007 Source - 240 easywave: e2Asean Grid + BengkuluSept2007 Source - 1200 easywave: e2Asean Grid + BengkuluSept2007 Source - 2400 a b c AMD EPYC 7303 16-Core d e f g 540.3 980.5 164.19 282.55 115.89 174.49 26.54 147.64 130.4 132.37 11.284 507611 643823 75.603 41.424 3.248 6.786 5.474 23.875 182.491 16.08 40.28 20.44 153.78 49.66 202.68 157.75 51.12 623.9 3308.28 9.66 9543.23 3.34 1342.68 23.81 226.15 141.33 5333.24 23.98 2687.19 11.89 899.15 142.22 98494.41 0.85 471 923.9 162.65 282.08 116.73 173.59 26.35 146.17 134.61 129.61 11.02 505036 647995 75.778 40.837 3.331 6.881 5.485 23.974 179.986 16.08 40.21 20.73 152.94 49.32 203.14 157.4 51.17 623.64 3302.83 9.68 9544.95 3.34 1340.29 23.85 226.27 141.24 5334.87 23.98 2655.09 12.03 897.77 142.42 101883.13 0.85 467.3 920.8 167.28 279.20 116.3 174.45 26.53 146.31 131.25 132.8 11.709 499347 678765 75.836 42.41 3.309 6.929 5.535 23.939 179.539 16.19 40.04 20.45 152.5 48.96 203.96 156.74 51.2 623.6 3303.59 9.67 9527.76 3.35 1340.12 23.86 222.73 143.55 5335.15 23.97 2694.27 11.86 898.69 142.29 101985.21 0.86 273.2 406.9 132.65 148.49 118.33 551.24 65.55 202.93 102.8 357.27 37.307 135892 116102 120.85 60.19 5.282 9.186 5.779 67.408 815.395 89.45 226.06 109.21 871.17 280.03 42.11 189.92 8.66 921.14 570.58 14 1754.67 4.55 237.79 33.62 47.46 168.3 858.21 18.63 520.89 15.34 155.5 102.82 23110.2 0.68 356.5 442.5 114.18 132.46 134.32 489.75 63.71 235.98 129.36 429.22 39.71 113384 106826 125.918 64.146 5.771 10.133 6.186 68.675 830.492 94.86 237.87 116.66 974.94 296.74 37.06 107.81 8.35 478.92 470.46 8.49 1232.95 3.24 195.5 20.44 43.18 92.53 856.63 18.65 369.7 10.8 171.61 93.16 23725.57 0.66 17.4784 15.7439 16.1253 15.8702 13.8445 14.3991 144 267 0.56 0.56 0.28 2.67723 4.27338 5.54136 1.94173 4.63185 1.92811 2.34565 1.6509 4.51333 0.768449 3019.31 3037.72 3028.36 1451.2 1445.86 1466.25 3.291 65.733 149.203 484.7 727.2 182.23 234.135345612 105.44 304.03 41.05 192.24 118.07 302.03 23.463 210620 214511 92.447 48.724 3.965 7.801 5.855 43.41 462.138 50.21 116.68 62.52 505.43 155.52 72.71 109.89 17.2 463.57 1096.25 7.29 3153.9 2.53 415.32 19.25 85.51 93.45 1727.34 18.46 873.34 9.13 303.8 105.22 46823.73 0.67 32.9696 29.6195 30.1028 29.9805 26.0698 26.5233 273 495 0.95 0.95 0.48 1.74244 4.10412 5.67018 1.93352 3.14371 1.16856 1.89405 1.44749 3.84916 1.05324 3494.51 3379.25 3216.13 1114.67 1161.8 1137.11 2.831 55.336 140.297 482.9 737.3 181.55 233.54 105.68 306.37 41.01 192.62 120.33 305.55 23.488 212139 216258 91.913 48.844 3.976 7.824 5.817 43.477 462.599 50.25 117.05 62.18 504.18 157.08 73.23 109.16 17.27 462.78 1095.27 7.29 3148.82 2.53 415.12 19.25 85.46 93.5 1708.45 18.67 865.28 9.21 303.83 105.23 45126.45 0.7 33.0392 29.5206 30.3378 30.1314 26.2737 26.6112 275 497 0.95 0.95 0.48 1.69817 3.83995 5.66435 1.9522 3.06111 1.16545 1.77064 1.92348 4.18974 1.14379 3341.93 3281.24 3462.9 1139.54 1153.45 1128.3 2.672 60.049 130.447 470.9 723.4 180.93 233.047063651 107.04 303.58 40.85 192.49 122.81 307.24 23.723 209887 218417 92.528 48.731 3.962 7.825 5.815 43.318 462.696 50.09 118.34 62.31 506.55 157.59 72.14 110.71 17.28 462.57 1095.68 7.29 3148.97 2.53 415.15 19.25 84.86 94.17 1725.37 18.48 875.64 9.11 303.59 105.33 47926.34 0.66 32.9662 29.5908 30.2552 30.0261 26.1574 26.3986 274 495 0.95 0.95 0.48 1.84652 4.05768 5.63319 1.95024 3.14105 1.17023 2.1207 2.28469 3.94733 1.08661 3451.09 3556.44 3272.36 1129.54 1145.62 1149.15 2.766 56.337 133.177 OpenBenchmarking.org
libxsmm M N K: 32 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 32 a b c AMD EPYC 7303 16-Core d e f g 120 240 360 480 600 540.3 471.0 467.3 273.2 356.5 484.7 482.9 470.9 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
libxsmm M N K: 64 OpenBenchmarking.org GFLOPS/s, More Is Better libxsmm 2-1.17-3645 M N K: 64 a b c AMD EPYC 7303 16-Core d e f g 200 400 600 800 1000 980.5 923.9 920.8 406.9 442.5 727.2 737.3 723.4 1. (CXX) g++ options: -dynamic -Bstatic -static-libgcc -lgomp -lm -lrt -ldl -lquadmath -lstdc++ -pthread -fPIC -std=c++14 -O2 -fopenmp-simd -funroll-loops -ftree-vectorize -fdata-sections -ffunction-sections -fvisibility=hidden -msse4.2
Laghos Test: Triple Point Problem OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem a b c AMD EPYC 7303 16-Core d e f g 40 80 120 160 200 164.19 162.65 167.28 132.65 114.18 182.23 181.55 180.93 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
Laghos Test: Sedov Blast Wave, ube_922_hex.mesh OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh a b c AMD EPYC 7303 16-Core d e f g 60 120 180 240 300 282.55 282.08 279.20 148.49 132.46 234.14 233.54 233.05 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam a b c AMD EPYC 7303 16-Core d e f g 30 60 90 120 150 115.89 116.73 116.30 118.33 134.32 105.44 105.68 107.04
OpenRadioss Model: Chrysler Neon 1M OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Chrysler Neon 1M a b c AMD EPYC 7303 16-Core d e f g 120 240 360 480 600 174.49 173.59 174.45 551.24 489.75 304.03 306.37 303.58
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test a b c AMD EPYC 7303 16-Core d e f g 15 30 45 60 75 26.54 26.35 26.53 65.55 63.71 41.05 41.01 40.85
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield a b c AMD EPYC 7303 16-Core d e f g 50 100 150 200 250 147.64 146.17 146.31 202.93 235.98 192.24 192.62 192.49
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation a b c AMD EPYC 7303 16-Core d e f g 30 60 90 120 150 130.40 134.61 131.25 102.80 129.36 118.07 120.33 122.81
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container a b c AMD EPYC 7303 16-Core d e f g 90 180 270 360 450 132.37 129.61 132.80 357.27 429.22 302.03 305.55 307.24
Remhos Test: Sample Remap Example OpenBenchmarking.org Seconds, Fewer Is Better Remhos 1.0 Test: Sample Remap Example a b c AMD EPYC 7303 16-Core d e f g 9 18 27 36 45 11.28 11.02 11.71 37.31 39.71 23.46 23.49 23.72 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating a b c AMD EPYC 7303 16-Core d e f g 110K 220K 330K 440K 550K 507611 505036 499347 135892 113384 210620 212139 209887 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating a b c AMD EPYC 7303 16-Core d e f g 150K 300K 450K 600K 750K 643823 647995 678765 116102 106826 214511 216258 218417 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
libavif avifenc Encoder Speed: 0 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 0 a b c AMD EPYC 7303 16-Core d e f g 30 60 90 120 150 75.60 75.78 75.84 120.85 125.92 92.45 91.91 92.53 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 a b c AMD EPYC 7303 16-Core d e f g 14 28 42 56 70 41.42 40.84 42.41 60.19 64.15 48.72 48.84 48.73 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 a b c AMD EPYC 7303 16-Core d e f g 1.2985 2.597 3.8955 5.194 6.4925 3.248 3.331 3.309 5.282 5.771 3.965 3.976 3.962 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless a b c AMD EPYC 7303 16-Core d e f g 3 6 9 12 15 6.786 6.881 6.929 9.186 10.133 7.801 7.824 7.825 1. (CXX) g++ options: -O3 -fPIC -lm
libavif avifenc Encoder Speed: 10, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 10, Lossless a b c AMD EPYC 7303 16-Core d e f g 2 4 6 8 10 5.474 5.485 5.535 5.779 6.186 5.855 5.817 5.815 1. (CXX) g++ options: -O3 -fPIC -lm
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig a b c AMD EPYC 7303 16-Core d e f g 15 30 45 60 75 23.88 23.97 23.94 67.41 68.68 43.41 43.48 43.32
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig a b c AMD EPYC 7303 16-Core d e f g 200 400 600 800 1000 182.49 179.99 179.54 815.40 830.49 462.14 462.60 462.70
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only a b c AMD EPYC 7303 16-Core d e f g 20 40 60 80 100 16.08 16.08 16.19 89.45 94.86 50.21 50.25 50.09
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Classroom - Compute: CPU-Only a b c AMD EPYC 7303 16-Core d e f g 50 100 150 200 250 40.28 40.21 40.04 226.06 237.87 116.68 117.05 118.34
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only a b c AMD EPYC 7303 16-Core d e f g 30 60 90 120 150 20.44 20.73 20.45 109.21 116.66 62.52 62.18 62.31
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Barbershop - Compute: CPU-Only a b c AMD EPYC 7303 16-Core d e f g 200 400 600 800 1000 153.78 152.94 152.50 871.17 974.94 505.43 504.18 506.55
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Pabellon Barcelona - Compute: CPU-Only a b c AMD EPYC 7303 16-Core d e f g 60 120 180 240 300 49.66 49.32 48.96 280.03 296.74 155.52 157.08 157.59
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 40 80 120 160 200 202.68 203.14 203.96 42.11 37.06 72.71 73.23 72.14 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Detection FP16 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 40 80 120 160 200 157.75 157.40 156.74 189.92 107.81 109.89 109.16 110.71 MIN: 108.19 / MAX: 274.81 MIN: 128.42 / MAX: 281.86 MIN: 131.77 / MAX: 297.92 MIN: 169.75 / MAX: 200.94 MIN: 102.18 / MAX: 124.72 MIN: 93.05 / MAX: 140.85 MIN: 99.46 / MAX: 143.57 MIN: 98.56 / MAX: 144.8 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 12 24 36 48 60 51.12 51.17 51.20 8.66 8.35 17.20 17.27 17.28 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 200 400 600 800 1000 623.90 623.64 623.60 921.14 478.92 463.57 462.78 462.57 MIN: 594.87 / MAX: 669.12 MIN: 593.36 / MAX: 672.04 MIN: 599.33 / MAX: 668.6 MIN: 883.65 / MAX: 932.87 MIN: 474.19 / MAX: 500.74 MIN: 456.35 / MAX: 480.41 MIN: 456.11 / MAX: 479.6 MIN: 455.99 / MAX: 481.91 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 700 1400 2100 2800 3500 3308.28 3302.83 3303.59 570.58 470.46 1096.25 1095.27 1095.68 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Vehicle Detection FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 4 8 12 16 20 9.66 9.68 9.67 14.00 8.49 7.29 7.29 7.29 MIN: 7.98 / MAX: 45.57 MIN: 7.94 / MAX: 44.33 MIN: 7.95 / MAX: 41.68 MIN: 7.54 / MAX: 24.09 MIN: 7.96 / MAX: 14.96 MIN: 7.14 / MAX: 16.48 MIN: 7.14 / MAX: 16.05 MIN: 7.15 / MAX: 16.28 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 2K 4K 6K 8K 10K 9543.23 9544.95 9527.76 1754.67 1232.95 3153.90 3148.82 3148.97 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Face Detection Retail FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 1.0238 2.0476 3.0714 4.0952 5.119 3.34 3.34 3.35 4.55 3.24 2.53 2.53 2.53 MIN: 2.82 / MAX: 23.64 MIN: 2.79 / MAX: 25.65 MIN: 2.81 / MAX: 23.74 MIN: 2.67 / MAX: 16.04 MIN: 2.94 / MAX: 8.15 MIN: 2.48 / MAX: 9.57 MIN: 2.48 / MAX: 10.15 MIN: 2.47 / MAX: 10.12 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 300 600 900 1200 1500 1342.68 1340.29 1340.12 237.79 195.50 415.32 415.12 415.15 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 8 16 24 32 40 23.81 23.85 23.86 33.62 20.44 19.25 19.25 19.25 MIN: 20.24 / MAX: 65.85 MIN: 20.69 / MAX: 64.82 MIN: 20.96 / MAX: 60.47 MIN: 25.37 / MAX: 42.95 MIN: 18.84 / MAX: 37.99 MIN: 17.55 / MAX: 32.52 MIN: 17.46 / MAX: 28.16 MIN: 17.49 / MAX: 27.05 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 50 100 150 200 250 226.15 226.27 222.73 47.46 43.18 85.51 85.46 84.86 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Machine Translation EN To DE FP16 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 40 80 120 160 200 141.33 141.24 143.55 168.30 92.53 93.45 93.50 94.17 MIN: 113.19 / MAX: 547.49 MIN: 114.93 / MAX: 488.84 MIN: 114.18 / MAX: 525.63 MIN: 141.17 / MAX: 184.72 MIN: 87.83 / MAX: 184.88 MIN: 77.59 / MAX: 139.64 MIN: 82.68 / MAX: 155.42 MIN: 85.17 / MAX: 160.36 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 1100 2200 3300 4400 5500 5333.24 5334.87 5335.15 858.21 856.63 1727.34 1708.45 1725.37 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Weld Porosity Detection FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 6 12 18 24 30 23.98 23.98 23.97 18.63 18.65 18.46 18.67 18.48 MIN: 21.19 / MAX: 37.06 MIN: 21.31 / MAX: 35.8 MIN: 21.4 / MAX: 33.21 MIN: 8.99 / MAX: 30.48 MIN: 17.81 / MAX: 33.21 MIN: 17.85 / MAX: 32.35 MIN: 17.84 / MAX: 33.76 MIN: 17.82 / MAX: 33.34 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 600 1200 1800 2400 3000 2687.19 2655.09 2694.27 520.89 369.70 873.34 865.28 875.64 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Person Vehicle Bike Detection FP16 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 4 8 12 16 20 11.89 12.03 11.86 15.34 10.80 9.13 9.21 9.11 MIN: 10.03 / MAX: 55.71 MIN: 10.4 / MAX: 55.41 MIN: 9.49 / MAX: 57.72 MIN: 9.22 / MAX: 27.56 MIN: 9.8 / MAX: 19.62 MIN: 8.44 / MAX: 24.1 MIN: 8.24 / MAX: 23.34 MIN: 8.35 / MAX: 22.58 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 200 400 600 800 1000 899.15 897.77 898.69 155.50 171.61 303.80 303.83 303.59 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Handwritten English Recognition FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 30 60 90 120 150 142.22 142.42 142.29 102.82 93.16 105.22 105.23 105.33 MIN: 111.28 / MAX: 182.41 MIN: 106.02 / MAX: 184.44 MIN: 110.33 / MAX: 173.87 MIN: 69.31 / MAX: 117.98 MIN: 86.16 / MAX: 102.67 MIN: 91.67 / MAX: 111.86 MIN: 92.98 / MAX: 112.15 MIN: 94.87 / MAX: 111.06 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 20K 40K 60K 80K 100K 98494.41 101883.13 101985.21 23110.20 23725.57 46823.73 45126.45 47926.34 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.1 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU a b c AMD EPYC 7303 16-Core d e f g 0.1935 0.387 0.5805 0.774 0.9675 0.85 0.85 0.86 0.68 0.66 0.67 0.70 0.66 MIN: 0.72 / MAX: 27.72 MIN: 0.72 / MAX: 21.71 MIN: 0.71 / MAX: 40.39 MIN: 0.4 / MAX: 11.17 MIN: 0.61 / MAX: 6.63 MIN: 0.61 / MAX: 10.58 MIN: 0.61 / MAX: 10.2 MIN: 0.61 / MAX: 10.25 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl
Embree Binary: Pathtracer - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Asian Dragon d e f g 8 16 24 32 40 17.48 32.97 33.04 32.97 MIN: 17.31 / MAX: 17.65 MIN: 32.77 / MAX: 33.49 MIN: 32.83 / MAX: 33.53 MIN: 32.78 / MAX: 33.29
Embree Binary: Pathtracer - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Asian Dragon Obj d e f g 7 14 21 28 35 15.74 29.62 29.52 29.59 MIN: 15.64 / MAX: 15.92 MIN: 29.37 / MAX: 30.21 MIN: 29.27 / MAX: 30.16 MIN: 29.36 / MAX: 30.31
Embree Binary: Pathtracer - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer - Model: Crown d e f g 7 14 21 28 35 16.13 30.10 30.34 30.26 MIN: 15.95 / MAX: 16.45 MIN: 29.78 / MAX: 30.77 MIN: 30.01 / MAX: 30.87 MIN: 29.92 / MAX: 30.97
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon d e f g 7 14 21 28 35 15.87 29.98 30.13 30.03 MIN: 15.72 / MAX: 16.2 MIN: 29.79 / MAX: 30.37 MIN: 29.94 / MAX: 30.64 MIN: 29.84 / MAX: 30.34
Embree Binary: Pathtracer ISPC - Model: Asian Dragon Obj OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon Obj d e f g 6 12 18 24 30 13.84 26.07 26.27 26.16 MIN: 13.7 / MAX: 14.06 MIN: 25.87 / MAX: 26.79 MIN: 26.05 / MAX: 26.62 MIN: 25.98 / MAX: 26.75
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown d e f g 6 12 18 24 30 14.40 26.52 26.61 26.40 MIN: 14.25 / MAX: 14.69 MIN: 26.18 / MAX: 27.13 MIN: 26.29 / MAX: 27.11 MIN: 26.05 / MAX: 27.08
OpenVKL Benchmark: vklBenchmarkCPU Scalar OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 2.0.0 Benchmark: vklBenchmarkCPU Scalar d e f g 60 120 180 240 300 144 273 275 274 MIN: 11 / MAX: 2667 MIN: 21 / MAX: 5037 MIN: 21 / MAX: 5065 MIN: 21 / MAX: 5053
OpenVKL Benchmark: vklBenchmarkCPU ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 2.0.0 Benchmark: vklBenchmarkCPU ISPC d e f g 110 220 330 440 550 267 495 497 495 MIN: 23 / MAX: 3404 MIN: 44 / MAX: 6142 MIN: 43 / MAX: 6162 MIN: 43 / MAX: 6166
Intel Open Image Denoise Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.1 Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only d e f g 0.2138 0.4276 0.6414 0.8552 1.069 0.56 0.95 0.95 0.95
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.1 Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only d e f g 0.2138 0.4276 0.6414 0.8552 1.069 0.56 0.95 0.95 0.95
Intel Open Image Denoise Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.1 Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only d e f g 0.108 0.216 0.324 0.432 0.54 0.28 0.48 0.48 0.48
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU d e f g 0.6024 1.2048 1.8072 2.4096 3.012 2.67723 1.74244 1.69817 1.84652 MIN: 2.34 MIN: 1.45 MIN: 1.41 MIN: 1.33 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU d e f g 0.9615 1.923 2.8845 3.846 4.8075 4.27338 4.10412 3.83995 4.05768 MIN: 3.64 MIN: 3.26 MIN: 3.1 MIN: 2.86 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU d e f g 1.2758 2.5516 3.8274 5.1032 6.379 5.54136 5.67018 5.66435 5.63319 MIN: 4.33 MIN: 4.65 MIN: 4.73 MIN: 4.7 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU d e f g 0.4392 0.8784 1.3176 1.7568 2.196 1.94173 1.93352 1.95220 1.95024 MIN: 1.77 MIN: 1.72 MIN: 1.79 MIN: 1.78 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU d e f g 1.0422 2.0844 3.1266 4.1688 5.211 4.63185 3.14371 3.06111 3.14105 MIN: 4.26 MIN: 2.85 MIN: 2.86 MIN: 2.86 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU d e f g 0.4338 0.8676 1.3014 1.7352 2.169 1.92811 1.16856 1.16545 1.17023 MIN: 1.74 MIN: 1.1 MIN: 1.11 MIN: 1.1 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU d e f g 0.5278 1.0556 1.5834 2.1112 2.639 2.34565 1.89405 1.77064 2.12070 MIN: 2.11 MIN: 1.51 MIN: 1.42 MIN: 1.65 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU d e f g 0.5141 1.0282 1.5423 2.0564 2.5705 1.65090 1.44749 1.92348 2.28469 MIN: 1.3 MIN: 1.01 MIN: 1.39 MIN: 1.48 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU d e f g 1.0155 2.031 3.0465 4.062 5.0775 4.51333 3.84916 4.18974 3.94733 MIN: 2.92 MIN: 3.54 MIN: 3.58 MIN: 3.55 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU d e f g 0.2574 0.5148 0.7722 1.0296 1.287 0.768449 1.053240 1.143790 1.086610 MIN: 0.6 MIN: 0.81 MIN: 0.83 MIN: 0.86 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU d e f g 700 1400 2100 2800 3500 3019.31 3494.51 3341.93 3451.09 MIN: 2971.6 MIN: 3430.83 MIN: 3231.53 MIN: 3397.7 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU d e f g 800 1600 2400 3200 4000 3037.72 3379.25 3281.24 3556.44 MIN: 2984.39 MIN: 3265.73 MIN: 3185.12 MIN: 3506.87 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU d e f g 700 1400 2100 2800 3500 3028.36 3216.13 3462.90 3272.36 MIN: 2984.7 MIN: 3111.44 MIN: 3392.2 MIN: 3161.82 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU d e f g 300 600 900 1200 1500 1451.20 1114.67 1139.54 1129.54 MIN: 1412.26 MIN: 1074.44 MIN: 1087.78 MIN: 1079.67 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU d e f g 300 600 900 1200 1500 1445.86 1161.80 1153.45 1145.62 MIN: 1399.09 MIN: 1098.59 MIN: 1089.33 MIN: 1102.84 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU d e f g 300 600 900 1200 1500 1466.25 1137.11 1128.30 1149.15 MIN: 1422.8 MIN: 1093.13 MIN: 1073.3 MIN: 1097.47 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 d e f g 0.7405 1.481 2.2215 2.962 3.7025 3.291 2.831 2.672 2.766 1. (CXX) g++ options: -O3 -fopenmp
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 d e f g 15 30 45 60 75 65.73 55.34 60.05 56.34 1. (CXX) g++ options: -O3 -fopenmp
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400 d e f g 30 60 90 120 150 149.20 140.30 130.45 133.18 1. (CXX) g++ options: -O3 -fopenmp
Phoronix Test Suite v10.8.5