clang-load-hoisting-benchmark

Intel Core i7-10750H testing with a CML Azalea_FMS (V1.03 BIOS) and NVIDIA GeForce GTX 1650 Ti 4GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2112028-TJ-CLANGLOAD40&grw.

clang-load-hoisting-benchmarkProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolutionclang-without-load-hoistingclang-with-load-hoistingIntel Core i7-10750H @ 5.00GHz (6 Cores / 12 Threads)CML Azalea_FMS (V1.03 BIOS)Intel Comet Lake PCH16GB1024GB Micron_2210_MTFDHBA1T0QFDNVIDIA GeForce GTX 1650 Ti 4GBRealtek ALC255Realtek RTL8111/8168/8411 + Intel Wi-Fi 6 AX201Ubuntu 20.045.11.0-41-generic (x86_64)GNOME Shell 3.36.9X Server 1.20.11NVIDIA 460.91.034.6.01.2.145Clang 14.0.0 + GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: skylake Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xea - Thermald 1.9.1Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

clang-load-hoisting-benchmarktscp: AI Chess Performancescimark2: Compositecompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9cpp-perf-bench: Stepanov Abstractionminion: Gracefultjbench: Decompression Throughputmrbayes: Primate Phylogeny Analysishimeno: Poisson Pressure Solvercoremark: CoreMark Size 666 - Iterations Per Secondjohn-the-ripper: MD5build-php: Time To Compilecompress-zstd: 3 - Compression Speedcompress-zstd: 3 - Decompression Speedasmfish: 1024 Hash Memory, 26 Depthaobench: 2048 x 2048 - Total Timetungsten: Hairvpxenc: Speed 5 - Bosphorus 1080pgraphics-magick: HWB Color Spacesvt-vp9: Visual Quality Optimized - Bosphorus 1080px264: H.264 Video Encodingdav1d: Summer Nature 1080px265: Bosphorus 1080pc-ray: Total Time - 4K, 16 Rays Per Pixelsvt-hevc: 7 - Bosphorus 1080ppgbench: 1 - 1 - Read Writepgbench: 1 - 1 - Read Write - Average Latencyebizzy: clang-without-load-hoistingclang-with-load-hoisting13342492681.2954.60932.72754.147663180.004465134.7173502.632568194017.7299478918264.9001179.12768.51727587832.59843.489925.8018891.5359.06391.9241.42111.49670.1512990.77529981013348972866.4951.37929.94948.908291195.225357125.1933504.476284187025.8547259116361.7591193.42770.81785023031.38841.480126.0019191.2660.47398.9241.77104.55670.4314550.687314646OpenBenchmarking.org

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performanceclang-without-load-hoistingclang-with-load-hoisting300K600K900K1200K1500KSE +/- 647.40, N = 5SE +/- 792.90, N = 5133424913348971. (CC) clang options: -O3 -march=native

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Compositeclang-without-load-hoistingclang-with-load-hoisting6001200180024003000SE +/- 14.91, N = 3SE +/- 136.09, N = 32681.292866.491. (CC) clang options: -O3 -march=native -lm

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9clang-without-load-hoistingclang-with-load-hoisting1224364860SE +/- 0.91, N = 3SE +/- 0.04, N = 354.6151.38

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov Abstractionclang-without-load-hoistingclang-with-load-hoisting816243240SE +/- 0.45, N = 3SE +/- 0.36, N = 332.7329.95

Minion

Benchmark: Graceful

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 1.8Benchmark: Gracefulclang-without-load-hoistingclang-with-load-hoisting1224364860SE +/- 0.53, N = 3SE +/- 1.42, N = 354.1548.91

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughputclang-without-load-hoistingclang-with-load-hoisting4080120160200SE +/- 0.41, N = 3SE +/- 0.35, N = 3180.00195.23

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysisclang-without-load-hoistingclang-with-load-hoisting306090120150SE +/- 1.63, N = 3SE +/- 0.41, N = 3134.72125.19

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solverclang-without-load-hoistingclang-with-load-hoisting8001600240032004000SE +/- 5.98, N = 3SE +/- 1.69, N = 33502.633504.481. (CC) clang options: -O3 -march=native -mavx2

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondclang-without-load-hoistingclang-with-load-hoisting40K80K120K160K200KSE +/- 576.77, N = 3SE +/- 4680.12, N = 3194017.73187025.851. (CC) clang options: -O2 -O3 -march=native -lrt" -lrt

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5clang-without-load-hoistingclang-with-load-hoisting20K40K60K80K100KSE +/- 712.43, N = 3SE +/- 652.06, N = 38918291163

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compileclang-without-load-hoistingclang-with-load-hoisting1428425670SE +/- 0.84, N = 3SE +/- 0.08, N = 364.9061.76

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speedclang-without-load-hoistingclang-with-load-hoisting30060090012001500SE +/- 12.07, N = 3SE +/- 10.58, N = 31179.11193.4

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speedclang-without-load-hoistingclang-with-load-hoisting6001200180024003000SE +/- 2.43, N = 3SE +/- 2.67, N = 32768.52770.8

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthclang-without-load-hoistingclang-with-load-hoisting4M8M12M16M20MSE +/- 292729.63, N = 3SE +/- 27684.37, N = 31727587817850230

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Timeclang-without-load-hoistingclang-with-load-hoisting816243240SE +/- 1.59, N = 3SE +/- 1.87, N = 332.6031.391. (CC) clang options: -lm -O3 -march=native

Tungsten Renderer

Scene: Hair

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Hairclang-without-load-hoistingclang-with-load-hoisting1020304050SE +/- 0.31, N = 3SE +/- 0.02, N = 343.4941.48

VP9 libvpx Encoding

Speed: Speed 5 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 1080pclang-without-load-hoistingclang-with-load-hoisting612182430SE +/- 0.04, N = 3SE +/- 0.02, N = 325.8026.001. (CC) clang options: -m64 -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Spaceclang-without-load-hoistingclang-with-load-hoisting4080120160200SE +/- 0.58, N = 3188191

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pclang-without-load-hoistingclang-with-load-hoisting20406080100SE +/- 1.14, N = 3SE +/- 1.49, N = 391.5391.26

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video Encodingclang-without-load-hoistingclang-with-load-hoisting1428425670SE +/- 0.85, N = 3SE +/- 0.68, N = 359.0660.47

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.2Video Input: Summer Nature 1080pclang-without-load-hoistingclang-with-load-hoisting90180270360450SE +/- 2.82, N = 3SE +/- 0.98, N = 3391.92398.92MIN: 350.25 / MAX: 427.87MIN: 364.8 / MAX: 431.631. (CC) clang options: -Qunused-arguments -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lm -lxcb -lasound -lSDL2 -lsndio -lXv -lX11 -lXext -pthread -lbz2 -O3 -march=native -std=c11 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -mstack-alignment=16 -MMD -MF -MT

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pclang-without-load-hoistingclang-with-load-hoisting1020304050SE +/- 0.45, N = 3SE +/- 0.17, N = 341.4241.77

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelclang-without-load-hoistingclang-with-load-hoisting20406080100SE +/- 0.57, N = 3SE +/- 0.35, N = 3111.50104.561. (CC) clang options: -lm -lpthread -O3 -march=native

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pclang-without-load-hoistingclang-with-load-hoisting1632486480SE +/- 0.17, N = 3SE +/- 0.22, N = 370.1570.43

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 1 - Clients: 1 - Mode: Read Writeclang-without-load-hoistingclang-with-load-hoisting30060090012001500SE +/- 74.13, N = 3SE +/- 14.48, N = 312991455

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latencyclang-without-load-hoistingclang-with-load-hoisting0.17440.34880.52320.69760.872SE +/- 0.042, N = 3SE +/- 0.007, N = 30.7750.687

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3clang-without-load-hoistingclang-with-load-hoisting70K140K210K280K350KSE +/- 8335.76, N = 3SE +/- 8892.96, N = 32998103146461. (CC) clang options: -pthread -lpthread -O3 -march=native


Phoronix Test Suite v10.8.4