8490h april

2 x Intel Xeon Platinum 8490H testing with a Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS) and ASPEED on Ubuntu 22.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2304136-NE-8490HAPRI45&sor&grr.

8490h aprilProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen Resolutionabcde2 x Intel Xeon Platinum 8490H @ 3.50GHz (120 Cores / 240 Threads)Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS)Intel Device 1bce16 x 64 GB 4800MT/s Samsung M321R8GA0BB0-CQKEG2 x 1920GB SAMSUNG MZWLJ1T9HBJR-00007 + 960GB INTEL SSDSC2KG96ASPEEDVGA HDMI4 x Intel E810-C for QSFP + 2 x Intel X710 for 10GBASE-TUbuntu 22.046.2.0-060200rc7daily20230208-generic (x86_64)GNOME Shell 42.2X Server 1.21.1.31.2.204GCC 11.3.0 + Clang 14.0.0-1ubuntu1ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-xKiWfi/gcc-11-11.3.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0000c0Python Details- Python 3.10.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

8490h aprilsrsran: PUSCH Processor Benchmark, Throughput Totaltensorflow: CPU - 512 - ResNet-50srsran: PUSCH Processor Benchmark, Throughput Threadtensorflow: CPU - 256 - ResNet-50onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUvvenc: Bosphorus 4K - Fasterblender: Barbershop - CPU-Onlytensorflow: CPU - 512 - GoogLeNetonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUvvenc: Bosphorus 4K - Fastnginx: 500apache: 500srsran: Downlink Processor Benchmarktensorflow: CPU - 64 - ResNet-50tensorflow: CPU - 256 - GoogLeNetonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUblender: Pabellon Barcelona - CPU-Onlytensorflow: CPU - 512 - AlexNettensorflow: CPU - 32 - ResNet-50onednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 1D - f32 - CPUvvenc: Bosphorus 1080p - Fastblender: Classroom - CPU-Onlyonednn: IP Shapes 1D - u8s8f32 - CPUtensorflow: CPU - 16 - ResNet-50tensorflow: CPU - 256 - AlexNetonednn: IP Shapes 3D - bf16bf16bf16 - CPUvvenc: Bosphorus 1080p - Fastertensorflow: CPU - 64 - GoogLeNetonednn: Deconvolution Batch shapes_1d - f32 - CPUblender: Fishy Cat - CPU-Onlyonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUblender: BMW27 - CPU-Onlytensorflow: CPU - 32 - GoogLeNettensorflow: CPU - 16 - GoogLeNettensorflow: CPU - 32 - AlexNettensorflow: CPU - 64 - AlexNetonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUtensorflow: CPU - 16 - AlexNetonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUnginx: 200abcde7122.4135.8829.9130.441205.29873.147881.2321304.571216.9910.065147.25472.26861.1486.308250533.3780395.59326.5103.48444.170.43465848.811227.6983.130.4703365.884723.5675917.14736.55.1937964.281091.423.1677928.6634814.266719.360.2399614.03257.33173.64531.68743.732.497570.872476372.880.4087110.2287410.4642690.2289710.7244136898.6134.3429.8128.891184.14832.338840.9561232.871155.7710.055147.73465.31878.4896.332246156.1183834.81324.2102.21442.930.41002947.841231.8584.420.4513935.387343.0500017.39636.664.6247864.311077.573.0463830.988342.2614.628419.700.31442714.20267.02185.78556.34741.872.676990.978428386.550.4029830.2231420.4660450.2251970.7187466547.4135.2229.7127.521228.77844.358852.5771081.71209.399.967146.59467.33904.2686.314246619.5477777.03326.7104.52437.970.39743547.651214.3684.980.4578935.420713.6358517.24436.795.3076963.781063.472.9138130.211334.1114.544419.940.43352314.21265.08176.84557.68751.672.528481.15332370.670.4056770.217420.4579960.2193410.7164197079.5134.7628.8128.81184.12832.574731.0951205.381182.329.956147.18462.37888.7326.443247581.6484694.76320.8103.14441.290.39195747.731225.5484.170.440414.977183.5048517.21136.314.8754863.971071.622.8375427.619346.1114.489120.130.29615214.04249.74184.8536.63739.022.374790.981361391.880.4084160.219490.4625890.2257420.7113066774.5133.928.9128.231112.04845.726848.6521200.191120.6410.067148.11469.14818.4386.388248416.8585357.84324.1102.87441.440.41373547.431230.383.450.4462325.5583.4467716.78936.365.3475564.961062.063.0218830.369346.214.221219.540.30550314.3270.31185.22564.79745.332.808690.989308386.340.4003250.222020.4538850.2193480.712248OpenBenchmarking.org

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Total

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.3Test: PUSCH Processor Benchmark, Throughput Totaladbec15003000450060007500SE +/- 87.81, N = 97122.47079.56898.66774.56547.4MIN: 4599.2 / MAX: 12734.9MIN: 4942.3 / MAX: 12824.3MIN: 2932.3 / MAX: 13017.6MIN: 3650.8 / MAX: 12618.4MIN: 3614.7 / MAX: 127221. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -march=native -mfma -lgtest

TensorFlow

Device: CPU - Batch Size: 512 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: ResNet-50acdbe306090120150SE +/- 1.30, N = 3135.88135.22134.76134.34133.90

srsRAN Project

Test: PUSCH Processor Benchmark, Throughput Thread

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.3Test: PUSCH Processor Benchmark, Throughput Threadabced714212835SE +/- 0.22, N = 329.929.829.728.928.8MIN: 19.5 / MAX: 52.7MIN: 18.3 / MAX: 53.3MIN: 18.8 / MAX: 52.3MIN: 18.9 / MAX: 52.7MIN: 15.8 / MAX: 52.31. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -march=native -mfma -lgtest

TensorFlow

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: ResNet-50abdec306090120150SE +/- 0.05, N = 3130.44128.89128.80128.23127.52

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUedbac30060090012001500SE +/- 17.42, N = 151112.041184.121184.141205.291228.77MIN: 1093.03MIN: 1154.2MIN: 1007.85MIN: 1166.28MIN: 1195.531. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUbdcea2004006008001000SE +/- 14.33, N = 15832.34832.57844.36845.73873.15MIN: 744.45MIN: 807.52MIN: 819.84MIN: 832.31MIN: 841.291. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUdbeca2004006008001000SE +/- 10.27, N = 15731.10840.96848.65852.58881.23MIN: 715.12MIN: 756.78MIN: 823.74MIN: 818.16MIN: 840.731. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUcedba30060090012001500SE +/- 23.27, N = 141081.701200.191205.381232.871304.57MIN: 1010MIN: 1170.19MIN: 1177.67MIN: 1015.69MIN: 1219.161. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUebdca30060090012001500SE +/- 38.79, N = 121120.641155.771182.321209.391216.99MIN: 1089.24MIN: 781.24MIN: 1123.65MIN: 1153.33MIN: 1149.611. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 4K - Video Preset: Fastereabcd3691215SE +/- 0.068, N = 1310.06710.06510.0559.9679.9561. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Barbershop - Compute: CPU-Onlycdabe306090120150SE +/- 0.81, N = 3146.59147.18147.25147.73148.11

TensorFlow

Device: CPU - Batch Size: 512 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: GoogLeNetaecbd100200300400500SE +/- 4.06, N = 3472.26469.14467.33465.31462.37

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUeabdc2004006008001000SE +/- 9.64, N = 5818.44861.15878.49888.73904.27MIN: 804.26MIN: 828.35MIN: 833.55MIN: 874.25MIN: 846.061. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 4K - Video Preset: Fastdebca246810SE +/- 0.037, N = 36.4436.3886.3326.3146.3081. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500aedcb50K100K150K200K250KSE +/- 1323.62, N = 3250533.37248416.85247581.64246619.54246156.111. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache HTTP Server

Concurrent Requests: 500

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 500edbac20K40K60K80K100KSE +/- 98.05, N = 385357.8484694.7683834.8180395.5977777.031. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

srsRAN Project

Test: Downlink Processor Benchmark

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 23.3Test: Downlink Processor Benchmarkcabed70140210280350SE +/- 2.03, N = 3326.7326.5324.2324.1320.8MIN: 72.5 / MAX: 731.1MIN: 71.2 / MAX: 731.7MIN: 68.9 / MAX: 734.8MIN: 69.9 / MAX: 729.7MIN: 71.3 / MAX: 723.11. (CXX) g++ options: -O3 -fno-trapping-math -fno-math-errno -march=native -mfma -lgtest

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: ResNet-50cadeb20406080100SE +/- 0.44, N = 3104.52103.48103.14102.87102.21

TensorFlow

Device: CPU - Batch Size: 256 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: GoogLeNetabedc100200300400500SE +/- 3.52, N = 3444.17442.93441.44441.29437.97

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUdcbea0.09780.19560.29340.39120.489SE +/- 0.003330, N = 150.3919570.3974350.4100290.4137350.434658MIN: 0.32MIN: 0.32MIN: 0.31MIN: 0.33MIN: 0.331. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Pabellon Barcelona - Compute: CPU-Onlyecdba1122334455SE +/- 0.10, N = 347.4347.6547.7347.8448.81

TensorFlow

Device: CPU - Batch Size: 512 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 512 - Model: AlexNetbeadc30060090012001500SE +/- 2.69, N = 31231.851230.301227.691225.541214.36

TensorFlow

Device: CPU - Batch Size: 32 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: ResNet-50cbdea20406080100SE +/- 0.33, N = 384.9884.4284.1783.4583.13

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUdebca0.10580.21160.31740.42320.529SE +/- 0.003398, N = 110.4404100.4462320.4513930.4578930.470336MIN: 0.35MIN: 0.35MIN: 0.34MIN: 0.35MIN: 0.361. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUdbcea1.32412.64823.97235.29646.6205SE +/- 0.08402, N = 154.977185.387345.420715.558005.88472MIN: 3.92MIN: 3.77MIN: 4.25MIN: 4.37MIN: 4.651. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUbedac0.81811.63622.45433.27244.0905SE +/- 0.17695, N = 153.050003.446773.504853.567593.63585MIN: 1.6MIN: 3.04MIN: 2.9MIN: 3.02MIN: 3.111. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

VVenC

Video Input: Bosphorus 1080p - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 1080p - Video Preset: Fastbcdae48121620SE +/- 0.04, N = 317.4017.2417.2117.1516.791. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Classroom - Compute: CPU-Onlydeabc816243240SE +/- 0.30, N = 336.3136.3636.5036.6636.79

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUbdace1.20322.40643.60964.81286.016SE +/- 0.18778, N = 124.624784.875485.193795.307695.34755MIN: 2.46MIN: 3.78MIN: 3.98MIN: 3.99MIN: 4.191. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

TensorFlow

Device: CPU - Batch Size: 16 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: ResNet-50ebadc1428425670SE +/- 0.45, N = 364.9664.3164.2863.9763.78

TensorFlow

Device: CPU - Batch Size: 256 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 256 - Model: AlexNetabdce2004006008001000SE +/- 5.13, N = 31091.421077.571071.621063.471062.06

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUdceba0.71281.42562.13842.85123.564SE +/- 0.03679, N = 152.837542.913813.021883.046383.16779MIN: 2.21MIN: 2.28MIN: 2.44MIN: 2.17MIN: 2.491. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

VVenC

Video Input: Bosphorus 1080p - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.8Video Input: Bosphorus 1080p - Video Preset: Fasterbecad714212835SE +/- 0.17, N = 330.9930.3730.2128.6627.621. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

TensorFlow

Device: CPU - Batch Size: 64 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: GoogLeNetaedbc80160240320400SE +/- 2.89, N = 3348.00346.20346.11342.26334.11

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUeadcb48121620SE +/- 0.05, N = 314.2214.2714.4914.5414.63MIN: 12.7MIN: 12.67MIN: 12.72MIN: 12.86MIN: 12.831. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: Fishy Cat - Compute: CPU-Onlyaebcd510152025SE +/- 0.09, N = 319.3619.5419.7019.9420.13

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUadebc0.09750.1950.29250.390.4875SE +/- 0.019200, N = 150.2399600.2961520.3055030.3144270.433523MIN: 0.18MIN: 0.18MIN: 0.18MIN: 0.17MIN: 0.181. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.5Blend File: BMW27 - Compute: CPU-Onlyadbce48121620SE +/- 0.15, N = 414.0314.0414.2014.2114.30

TensorFlow

Device: CPU - Batch Size: 32 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: GoogLeNetebcad60120180240300SE +/- 0.97, N = 3270.31267.02265.08257.33249.74

TensorFlow

Device: CPU - Batch Size: 16 - Model: GoogLeNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: GoogLeNetbedca4080120160200SE +/- 1.60, N = 3185.78185.22184.80176.84173.64

TensorFlow

Device: CPU - Batch Size: 32 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 32 - Model: AlexNetecbda120240360480600SE +/- 5.22, N = 6564.79557.68556.34536.63531.68

TensorFlow

Device: CPU - Batch Size: 64 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 64 - Model: AlexNetceabd160320480640800SE +/- 6.00, N = 3751.67745.33743.73741.87739.02

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUdacbe0.6321.2641.8962.5283.16SE +/- 0.03552, N = 32.374792.497572.528482.676992.80869MIN: 1.92MIN: 2.05MIN: 2.05MIN: 2.13MIN: 2.241. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUabdec0.25950.5190.77851.0381.2975SE +/- 0.002492, N = 30.8724760.9784280.9813610.9893081.153320MIN: 0.67MIN: 0.77MIN: 0.78MIN: 0.78MIN: 0.921. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

TensorFlow

Device: CPU - Batch Size: 16 - Model: AlexNet

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.12Device: CPU - Batch Size: 16 - Model: AlexNetdbeac90180270360450SE +/- 3.12, N = 3391.88386.55386.34372.88370.67

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUebcda0.0920.1840.2760.3680.46SE +/- 0.000551, N = 30.4003250.4029830.4056770.4084160.408711MIN: 0.36MIN: 0.36MIN: 0.36MIN: 0.36MIN: 0.361. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUcdeba0.05150.1030.15450.2060.2575SE +/- 0.002565, N = 30.2174200.2194900.2220200.2231420.228741MIN: 0.19MIN: 0.2MIN: 0.2MIN: 0.19MIN: 0.191. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUecdab0.10490.20980.31470.41960.5245SE +/- 0.002656, N = 30.4538850.4579960.4625890.4642690.466045MIN: 0.4MIN: 0.39MIN: 0.37MIN: 0.38MIN: 0.381. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUcebda0.05150.1030.15450.2060.2575SE +/- 0.001233, N = 30.2193410.2193480.2251970.2257420.228971MIN: 0.2MIN: 0.21MIN: 0.2MIN: 0.21MIN: 0.21. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.1Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUdecba0.1630.3260.4890.6520.815SE +/- 0.002808, N = 30.7113060.7122480.7164190.7187460.724413MIN: 0.65MIN: 0.66MIN: 0.66MIN: 0.66MIN: 0.661. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread


Phoronix Test Suite v10.8.4