clang-load-hoisting-benchmark

Intel Core i7-10750H testing with a CML Azalea_FMS (V1.03 BIOS) and NVIDIA GeForce GTX 1650 Ti 4GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2112028-TJ-CLANGLOAD40&gru&sro.

clang-load-hoisting-benchmarkProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolutionclang-without-load-hoistingclang-with-load-hoistingIntel Core i7-10750H @ 5.00GHz (6 Cores / 12 Threads)CML Azalea_FMS (V1.03 BIOS)Intel Comet Lake PCH16GB1024GB Micron_2210_MTFDHBA1T0QFDNVIDIA GeForce GTX 1650 Ti 4GBRealtek ALC255Realtek RTL8111/8168/8411 + Intel Wi-Fi 6 AX201Ubuntu 20.045.11.0-41-generic (x86_64)GNOME Shell 3.36.9X Server 1.20.11NVIDIA 460.91.034.6.01.2.145Clang 14.0.0 + GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: skylake Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xea - Thermald 1.9.1Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

clang-load-hoisting-benchmarkdav1d: Summer Nature 1080psvt-hevc: 7 - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080pvpxenc: Speed 5 - Bosphorus 1080px264: H.264 Video Encodingx265: Bosphorus 1080pgraphics-magick: HWB Color Spacecoremark: CoreMark Size 666 - Iterations Per Secondcompress-zstd: 3 - Compression Speedcompress-zstd: 3 - Decompression Speedtjbench: Decompression Throughputscimark2: Compositehimeno: Poisson Pressure Solvertscp: AI Chess Performanceasmfish: 1024 Hash Memory, 26 Depthjohn-the-ripper: MD5ebizzy: pgbench: 1 - 1 - Read Writepgbench: 1 - 1 - Read Write - Average Latencymrbayes: Primate Phylogeny Analysisbuild-php: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixeltungsten: Hairaobench: 2048 x 2048 - Total Timecompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9minion: Gracefulcpp-perf-bench: Stepanov Abstractionclang-without-load-hoistingclang-with-load-hoisting391.9270.1591.5325.8059.0641.42188194017.7299471179.12768.5180.0044652681.293502.6325681334249172758788918229981012990.775134.71764.900111.49643.489932.59854.60954.14766332.727398.9270.4391.2626.0060.4741.77191187025.8547251193.42770.8195.2253572866.493504.4762841334897178502309116331464614550.687125.19361.759104.55641.480131.38851.37948.90829129.949OpenBenchmarking.org

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.2Video Input: Summer Nature 1080pclang-with-load-hoistingclang-without-load-hoisting90180270360450SE +/- 0.98, N = 3SE +/- 2.82, N = 3398.92391.92MIN: 364.8 / MAX: 431.63MIN: 350.25 / MAX: 427.871. (CC) clang options: -Qunused-arguments -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lm -lxcb -lasound -lSDL2 -lsndio -lXv -lX11 -lXext -pthread -lbz2 -O3 -march=native -std=c11 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -mstack-alignment=16 -MMD -MF -MT

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pclang-with-load-hoistingclang-without-load-hoisting1632486480SE +/- 0.22, N = 3SE +/- 0.17, N = 370.4370.15

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pclang-with-load-hoistingclang-without-load-hoisting20406080100SE +/- 1.49, N = 3SE +/- 1.14, N = 391.2691.53

VP9 libvpx Encoding

Speed: Speed 5 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 1080pclang-with-load-hoistingclang-without-load-hoisting612182430SE +/- 0.02, N = 3SE +/- 0.04, N = 326.0025.801. (CC) clang options: -m64 -lpthread

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video Encodingclang-with-load-hoistingclang-without-load-hoisting1428425670SE +/- 0.68, N = 3SE +/- 0.85, N = 360.4759.06

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pclang-with-load-hoistingclang-without-load-hoisting1020304050SE +/- 0.17, N = 3SE +/- 0.45, N = 341.7741.42

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Spaceclang-with-load-hoistingclang-without-load-hoisting4080120160200SE +/- 0.58, N = 3191188

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondclang-with-load-hoistingclang-without-load-hoisting40K80K120K160K200KSE +/- 4680.12, N = 3SE +/- 576.77, N = 3187025.85194017.731. (CC) clang options: -O2 -O3 -march=native -lrt" -lrt

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speedclang-with-load-hoistingclang-without-load-hoisting30060090012001500SE +/- 10.58, N = 3SE +/- 12.07, N = 31193.41179.1

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speedclang-with-load-hoistingclang-without-load-hoisting6001200180024003000SE +/- 2.67, N = 3SE +/- 2.43, N = 32770.82768.5

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughputclang-with-load-hoistingclang-without-load-hoisting4080120160200SE +/- 0.35, N = 3SE +/- 0.41, N = 3195.23180.00

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Compositeclang-with-load-hoistingclang-without-load-hoisting6001200180024003000SE +/- 136.09, N = 3SE +/- 14.91, N = 32866.492681.291. (CC) clang options: -O3 -march=native -lm

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solverclang-with-load-hoistingclang-without-load-hoisting8001600240032004000SE +/- 1.69, N = 3SE +/- 5.98, N = 33504.483502.631. (CC) clang options: -O3 -march=native -mavx2

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performanceclang-with-load-hoistingclang-without-load-hoisting300K600K900K1200K1500KSE +/- 792.90, N = 5SE +/- 647.40, N = 5133489713342491. (CC) clang options: -O3 -march=native

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthclang-with-load-hoistingclang-without-load-hoisting4M8M12M16M20MSE +/- 27684.37, N = 3SE +/- 292729.63, N = 31785023017275878

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5clang-with-load-hoistingclang-without-load-hoisting20K40K60K80K100KSE +/- 652.06, N = 3SE +/- 712.43, N = 39116389182

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3clang-with-load-hoistingclang-without-load-hoisting70K140K210K280K350KSE +/- 8892.96, N = 3SE +/- 8335.76, N = 33146462998101. (CC) clang options: -pthread -lpthread -O3 -march=native

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 1 - Clients: 1 - Mode: Read Writeclang-with-load-hoistingclang-without-load-hoisting30060090012001500SE +/- 14.48, N = 3SE +/- 74.13, N = 314551299

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latencyclang-with-load-hoistingclang-without-load-hoisting0.17440.34880.52320.69760.872SE +/- 0.007, N = 3SE +/- 0.042, N = 30.6870.775

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysisclang-with-load-hoistingclang-without-load-hoisting306090120150SE +/- 0.41, N = 3SE +/- 1.63, N = 3125.19134.72

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compileclang-with-load-hoistingclang-without-load-hoisting1428425670SE +/- 0.08, N = 3SE +/- 0.84, N = 361.7664.90

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelclang-with-load-hoistingclang-without-load-hoisting20406080100SE +/- 0.35, N = 3SE +/- 0.57, N = 3104.56111.501. (CC) clang options: -lm -lpthread -O3 -march=native

Tungsten Renderer

Scene: Hair

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Hairclang-with-load-hoistingclang-without-load-hoisting1020304050SE +/- 0.02, N = 3SE +/- 0.31, N = 341.4843.49

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Timeclang-with-load-hoistingclang-without-load-hoisting816243240SE +/- 1.87, N = 3SE +/- 1.59, N = 331.3932.601. (CC) clang options: -lm -O3 -march=native

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9clang-with-load-hoistingclang-without-load-hoisting1224364860SE +/- 0.04, N = 3SE +/- 0.91, N = 351.3854.61

Minion

Benchmark: Graceful

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 1.8Benchmark: Gracefulclang-with-load-hoistingclang-without-load-hoisting1224364860SE +/- 1.42, N = 3SE +/- 0.53, N = 348.9154.15

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov Abstractionclang-with-load-hoistingclang-without-load-hoisting816243240SE +/- 0.36, N = 3SE +/- 0.45, N = 329.9532.73


Phoronix Test Suite v10.8.5