new mtl framework

Intel Core Ultra 7 155H testing with a Framework Laptop 13 (Intel Core Ultra 1) FRANMECP05 (03.01 BIOS) and Intel Arc MTL 8GB on Ubuntu 24.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2408139-NE-NEWMTLFRA01&grs&rdt.

new mtl frameworkProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionabcIntel Core Ultra 7 155H @ 4.50GHz (16 Cores / 22 Threads)Framework Laptop 13 (Intel Core Ultra 1) FRANMECP05 (03.01 BIOS)Intel Device 7e7f2 x 8GB DDR5-5600MT/s A-DATA AD5S56008G-SFW512GB Western Digital WD PC SN740 SDDPNQD-512GIntel Arc MTL 8GBRealtek ALC285MEDIATEK MT7922 802.11ax PCIUbuntu 24.046.10.0-061000rc4daily20240621-generic (x86_64)GNOME Shell 46.0X Server + Wayland4.6 Mesa 24.2~git2406250600.5cb15a~oibaf~n (git-5cb15a6 2024-06-25 noble-oibaf-ppa)GCC 13.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-uJ7kn6/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x1e - Thermald 2.5.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; RSB filling; PBRSB-eIBRS: Not affected; BHI: BHI_DIS_S + srbds: Not affected + tsx_async_abort: Not affected

new mtl frameworkxnnpack: FP16MobileNetV2xnnpack: FP32MobileNetV3Largexnnpack: QU8MobileNetV3Largemnn: mobilenetV3xnnpack: QU8MobileNetV3Smallxnnpack: QU8MobileNetV2xnnpack: FP16MobileNetV3Smallxnnpack: FP16MobileNetV3Largexnnpack: FP32MobileNetV3Smallxnnpack: FP32MobileNetV2stockfish: Chess Benchmarklczero: Eigenlczero: BLASx265: Bosphorus 1080pmnn: SqueezeNetV1.0mnn: mobilenet-v1-1.0simdjson: Kostyaetcpak: Multi-Threaded - ETC2y-cruncher: 500Mpovray: Trace Timemnn: squeezenetv1.1mnn: MobileNetV2_224mnn: nasnety-cruncher: 1Bbuild2: Time To Compilesimdjson: LargeRandmnn: resnet-v2-50mnn: inception-v3x265: Bosphorus 4Kgromacs: water_GMX50_baresimdjson: TopTweetsimdjson: DistinctUserIDmt-dgemm: Sustained Floating-Point Ratesimdjson: PartialTweetsabc4251662736053.11121323737238552472246438810688485252651.069.8146.2164.2220.52431.85157.7615.8616.69223.72674.994297.7571.4648.54669.54712.10.5796.816.8152.0274916.624377569737473.15722643554252348172362407710530329242749.519.6696.2534.21223.2831.87258.3985.8816.70123.63375.403298.5841.4748.74669.19612.080.5816.816.8352.0675466.625409535644393.77218964238227747842160439711305323252650.619.96.1524.26222.06831.50257.9515.9216.75723.84175.633299.9531.4648.79669.41212.130.5816.796.8352.0940876.62OpenBenchmarking.org

XNNPACK

Model: FP16MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV2abc120024003600480060004251437754091. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Largeabc140028004200560070006627569753561. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Largeabc100020003000400050003605374744391. (CXX) g++ options: -O3 -lrt -lm

Mobile Neural Network

Model: mobilenetV3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: mobilenetV3abc0.84871.69742.54613.39484.24353.1113.1573.772MIN: 2.56 / MAX: 10.64MIN: 2.44 / MAX: 19.36MIN: 2.45 / MAX: 17.151. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

XNNPACK

Model: QU8MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV3Smallabc50010001500200025002132226418961. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: QU8MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: QU8MobileNetV2abc90018002700360045003737355442381. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Smallabc50010001500200025002385252322771. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP16MobileNetV3Large

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP16MobileNetV3Largeabc110022003300440055005247481747841. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV3Small

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV3Smallabc50010001500200025002246236221601. (CXX) g++ options: -O3 -lrt -lm

XNNPACK

Model: FP32MobileNetV2

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK 2cd86bModel: FP32MobileNetV2abc90018002700360045004388407743971. (CXX) g++ options: -O3 -lrt -lm

Stockfish

Chess Benchmark

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfishChess Benchmarkabc2M4M6M8M10M1068848510530329113053231. Stockfish 16 by the Stockfish developers (see AUTHORS file)

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: Eigenabc6121824302524251. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.31.1Backend: BLASabc6121824302627261. (CXX) g++ options: -flto -pthread

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 1080pabc122436486051.0649.5150.611. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: SqueezeNetV1.0abc36912159.8149.6699.900MIN: 7.23 / MAX: 25.89MIN: 7.59 / MAX: 16.03MIN: 7.61 / MAX: 30.631. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: mobilenet-v1-1.0abc2468106.2166.2536.152MIN: 3.56 / MAX: 26.49MIN: 3.61 / MAX: 31.97MIN: 3.54 / MAX: 24.171. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: Kostyaabc0.95851.9172.87553.8344.79254.204.214.261. (CXX) g++ options: -O3 -lrt

Etcpak

Benchmark: Multi-Threaded - Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC2abc50100150200250220.52223.28222.071. (CXX) g++ options: -flto -pthread

Y-Cruncher

Pi Digits To Calculate: 500M

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 500Mabc71421283531.8531.8731.50

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-RayTrace Timeabc132639526557.7658.4057.951. POV-Ray 3.7.0.10.unofficial

Mobile Neural Network

Model: squeezenetv1.1

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: squeezenetv1.1abc1.33222.66443.99665.32886.6615.8615.8815.921MIN: 4.69 / MAX: 21.36MIN: 4.62 / MAX: 25.58MIN: 4.44 / MAX: 25.131. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: MobileNetV2_224abc2468106.6926.7016.757MIN: 5.01 / MAX: 16.16MIN: 5.19 / MAX: 26.48MIN: 5.05 / MAX: 29.031. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

Mobile Neural Network

Model: nasnet

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: nasnetabc61218243023.7323.6323.84MIN: 20 / MAX: 52.87MIN: 19.46 / MAX: 56.99MIN: 19.99 / MAX: 46.41. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

Y-Cruncher

Pi Digits To Calculate: 1B

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 1Babc2040608010074.9975.4075.63

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compileabc70140210280350297.76298.58299.95

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandomabc0.33080.66160.99241.32321.6541.461.471.461. (CXX) g++ options: -O3 -lrt

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: resnet-v2-50abc112233445548.5548.7548.80MIN: 36.42 / MAX: 73.25MIN: 37.7 / MAX: 70.01MIN: 38.05 / MAX: 133.881. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.9.b11b7037dModel: inception-v3abc153045607569.5569.2069.41MIN: 58.98 / MAX: 120.56MIN: 57.21 / MAX: 117.21MIN: 55.7 / MAX: 110.331. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -pthread -ldl

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 4Kabc369121512.1012.0812.131. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

GROMACS

Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACSInput: water_GMX50_bareabc0.13070.26140.39210.52280.65350.5790.5810.5811. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweetabc2468106.816.816.791. (CXX) g++ options: -O3 -lrt

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: DistinctUserIDabc2468106.816.836.831. (CXX) g++ options: -O3 -lrt

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rateabc122436486052.0352.0752.091. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: PartialTweetsabc2468106.626.626.621. (CXX) g++ options: -O3 -lrt


Phoronix Test Suite v10.8.5