new mtl framework

Intel Core Ultra 7 155H testing with a Framework Laptop 13 (Intel Core Ultra 1) FRANMECP05 (03.01 BIOS) and Intel Arc MTL 8GB on Ubuntu 24.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2408139-NE-NEWMTLFRA01&grs&sor&rro.

new mtl frameworkProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionabcIntel Core Ultra 7 155H @ 4.50GHz (16 Cores / 22 Threads)Framework Laptop 13 (Intel Core Ultra 1) FRANMECP05 (03.01 BIOS)Intel Device 7e7f2 x 8GB DDR5-5600MT/s A-DATA AD5S56008G-SFW512GB Western Digital WD PC SN740 SDDPNQD-512GIntel Arc MTL 8GBRealtek ALC285MEDIATEK MT7922 802.11ax PCIUbuntu 24.046.10.0-061000rc4daily20240621-generic (x86_64)GNOME Shell 46.0X Server + Wayland4.6 Mesa 24.2~git2406250600.5cb15a~oibaf~n (git-5cb15a6 2024-06-25 noble-oibaf-ppa)GCC 13.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x1e - Thermald 2.5.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected

new mtl frameworkxnnpack: FP16MobileNetV2xnnpack: FP32MobileNetV3Largexnnpack: QU8MobileNetV3Largemnn: mobilenetV3xnnpack: QU8MobileNetV3Smallxnnpack: QU8MobileNetV2xnnpack: FP16MobileNetV3Smallxnnpack: FP16MobileNetV3Largexnnpack: FP32MobileNetV3Smallxnnpack: FP32MobileNetV2stockfish: Chess Benchmarklczero: Eigenlczero: BLASx265: Bosphorus 1080pmnn: SqueezeNetV1.0mnn: mobilenet-v1-1.0simdjson: Kostyaetcpak: Multi-Threaded - ETC2y-cruncher: 500Mpovray: Trace Timemnn: squeezenetv1.1mnn: MobileNetV2_224mnn: nasnety-cruncher: 1Bbuild2: Time To Compilesimdjson: LargeRandmnn: resnet-v2-50mnn: inception-v3x265: Bosphorus 4Kgromacs: water_GMX50_baresimdjson: TopTweetsimdjson: DistinctUserIDmt-dgemm: Sustained Floating-Point Ratesimdjson: PartialTweetsabc4251662736053.11121323737238552472246438810688485252651.069.8146.2164.2220.52431.85157.7615.8616.69223.72674.994297.7571.4648.54669.54712.10.5796.816.8152.0274916.624377569737473.15722643554252348172362407710530329242749.519.6696.2534.21223.2831.87258.3985.8816.70123.63375.403298.5841.4748.74669.19612.080.5816.816.8352.0675466.625409535644393.77218964238227747842160439711305323252650.619.96.1524.26222.06831.50257.9515.9216.75723.84175.633299.9531.4648.79669.41212.130.5816.796.8352.0940876.62OpenBenchmarking.org

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV2cba120024003600480060005409437742511. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Largeabc140028004200560070006627569753561. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Largecba100020003000400050004439374736051. (CXX) g++ options: -O3 -lrt -lm

Mobile Neural Network

Model: mobilenetV3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: mobilenetV3cba0.84871.69742.54613.39484.24353.7723.1573.111MIN: 2.45 / MAX: 17.15MIN: 2.44 / MAX: 19.36MIN: 2.56 / MAX: 10.641. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

XNNPACK

Model: QU8MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Smallbac50010001500200025002264213218961. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV2cab90018002700360045004238373735541. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Smallbac50010001500200025002523238522771. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Largeabc110022003300440055005247481747841. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Smallbac50010001500200025002362224621601. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV2cab90018002700360045004397438840771. (CXX) g++ options: -O3 -lrt -lm

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfishChess Benchmarkbac2M4M6M8M10M1053032910688485113053231. Stockfish 16 by the Stockfish developers (see AUTHORS file)

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: Eigenbac6121824302425251. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: BLASacb6121824302626271. (CXX) g++ options: -flto -pthread

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 1080pbca122436486049.5150.6151.061. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: SqueezeNetV1.0cab36912159.9009.8149.669MIN: 7.61 / MAX: 30.63MIN: 7.23 / MAX: 25.89MIN: 7.59 / MAX: 16.031. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: mobilenet-v1-1.0bac2468106.2536.2166.152MIN: 3.61 / MAX: 31.97MIN: 3.56 / MAX: 26.49MIN: 3.54 / MAX: 24.171. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: Kostyaabc0.95851.9172.87553.8344.79254.204.214.261. (CXX) g++ options: -O3 -lrt

Etcpak

Benchmark: Multi-Threaded - Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC2acb50100150200250220.52222.07223.281. (CXX) g++ options: -flto -pthread

Y-Cruncher

Pi Digits To Calculate: 500M

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 500Mbac71421283531.8731.8531.50

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-RayTrace Timebca132639526558.4057.9557.761. POV-Ray 3.7.0.10.unofficial

Mobile Neural Network

Model: squeezenetv1.1

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: squeezenetv1.1cba1.33222.66443.99665.32886.6615.9215.8815.861MIN: 4.44 / MAX: 25.13MIN: 4.62 / MAX: 25.58MIN: 4.69 / MAX: 21.361. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: MobileNetV2_224cba2468106.7576.7016.692MIN: 5.05 / MAX: 29.03MIN: 5.19 / MAX: 26.48MIN: 5.01 / MAX: 16.161. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

Mobile Neural Network

Model: nasnet

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: nasnetcab61218243023.8423.7323.63MIN: 19.99 / MAX: 46.4MIN: 20 / MAX: 52.87MIN: 19.46 / MAX: 56.991. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

Y-Cruncher

Pi Digits To Calculate: 1B

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 1Bcba2040608010075.6375.4074.99

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compilecba70140210280350299.95298.58297.76

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandomacb0.33080.66160.99241.32321.6541.461.461.471. (CXX) g++ options: -O3 -lrt

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: resnet-v2-50cba112233445548.8048.7548.55MIN: 38.05 / MAX: 133.88MIN: 37.7 / MAX: 70.01MIN: 36.42 / MAX: 73.251. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: inception-v3acb153045607569.5569.4169.20MIN: 58.98 / MAX: 120.56MIN: 55.7 / MAX: 110.33MIN: 57.21 / MAX: 117.211. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 4Kbac369121512.0812.1012.131. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACSInput: water_GMX50_bareabc0.13070.26140.39210.52280.65350.5790.5810.5811. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweetcab2468106.796.816.811. (CXX) g++ options: -O3 -lrt

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: DistinctUserIDabc2468106.816.836.831. (CXX) g++ options: -O3 -lrt

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rateabc122436486052.0352.0752.091. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: PartialTweetsabc2468106.626.626.621. (CXX) g++ options: -O3 -lrt


Phoronix Test Suite v10.8.5