clang-load-hoisting-benchmark

Intel Core i7-10750H testing with a CML Azalea_FMS (V1.03 BIOS) and NVIDIA GeForce GTX 1650 Ti 4GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2112028-TJ-CLANGLOAD40&sor.

clang-load-hoisting-benchmarkProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolutionclang-without-load-hoistingclang-with-load-hoistingIntel Core i7-10750H @ 5.00GHz (6 Cores / 12 Threads)CML Azalea_FMS (V1.03 BIOS)Intel Comet Lake PCH16GB1024GB Micron_2210_MTFDHBA1T0QFDNVIDIA GeForce GTX 1650 Ti 4GBRealtek ALC255Realtek RTL8111/8168/8411 + Intel Wi-Fi 6 AX201Ubuntu 20.045.11.0-41-generic (x86_64)GNOME Shell 3.36.9X Server 1.20.11NVIDIA 460.91.034.6.01.2.145Clang 14.0.0 + GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: skylake Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xea - Thermald 1.9.1Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

clang-load-hoisting-benchmarkmrbayes: Primate Phylogeny Analysiscompress-zstd: 3 - Compression Speedcompress-zstd: 3 - Decompression Speedscimark2: Compositetscp: AI Chess Performancejohn-the-ripper: MD5graphics-magick: HWB Color Spacedav1d: Summer Nature 1080psvt-hevc: 7 - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080pvpxenc: Speed 5 - Bosphorus 1080px264: H.264 Video Encodingx265: Bosphorus 1080pcoremark: CoreMark Size 666 - Iterations Per Secondhimeno: Poisson Pressure Solverasmfish: 1024 Hash Memory, 26 Depthebizzy: build-php: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixeltungsten: Hairaobench: 2048 x 2048 - Total Timecompress-xz: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9minion: Gracefultjbench: Decompression Throughputpgbench: 1 - 1 - Read Writepgbench: 1 - 1 - Read Write - Average Latencycpp-perf-bench: Stepanov Abstractionclang-without-load-hoistingclang-with-load-hoisting134.7171179.12768.52681.29133424989182188391.9270.1591.5325.8059.0641.42194017.7299473502.6325681727587829981064.900111.49643.489932.59854.60954.147663180.00446512990.77532.727125.1931193.42770.82866.49133489791163191398.9270.4391.2626.0060.4741.77187025.8547253504.4762841785023031464661.759104.55641.480131.38851.37948.908291195.22535714550.68729.949OpenBenchmarking.org

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysisclang-with-load-hoistingclang-without-load-hoisting306090120150SE +/- 0.41, N = 3SE +/- 1.63, N = 3125.19134.72

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speedclang-with-load-hoistingclang-without-load-hoisting30060090012001500SE +/- 10.58, N = 3SE +/- 12.07, N = 31193.41179.1

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speedclang-with-load-hoistingclang-without-load-hoisting6001200180024003000SE +/- 2.67, N = 3SE +/- 2.43, N = 32770.82768.5

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Compositeclang-with-load-hoistingclang-without-load-hoisting6001200180024003000SE +/- 136.09, N = 3SE +/- 14.91, N = 32866.492681.291. (CC) clang options: -O3 -march=native -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performanceclang-with-load-hoistingclang-without-load-hoisting300K600K900K1200K1500KSE +/- 792.90, N = 5SE +/- 647.40, N = 5133489713342491. (CC) clang options: -O3 -march=native

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: MD5clang-with-load-hoistingclang-without-load-hoisting20K40K60K80K100KSE +/- 652.06, N = 3SE +/- 712.43, N = 39116389182

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Spaceclang-with-load-hoistingclang-without-load-hoisting4080120160200SE +/- 0.58, N = 3191188

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.2Video Input: Summer Nature 1080pclang-with-load-hoistingclang-without-load-hoisting90180270360450SE +/- 0.98, N = 3SE +/- 2.82, N = 3398.92391.92MIN: 364.8 / MAX: 431.63MIN: 350.25 / MAX: 427.871. (CC) clang options: -Qunused-arguments -lavdevice -lavfilter -lavformat -lavcodec -lswresample -lswscale -lavutil -lm -lxcb -lasound -lSDL2 -lsndio -lXv -lX11 -lXext -pthread -lbz2 -O3 -march=native -std=c11 -fomit-frame-pointer -fno-math-errno -fno-signed-zeros -mstack-alignment=16 -MMD -MF -MT

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pclang-with-load-hoistingclang-without-load-hoisting1632486480SE +/- 0.22, N = 3SE +/- 0.17, N = 370.4370.15

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pclang-without-load-hoistingclang-with-load-hoisting20406080100SE +/- 1.14, N = 3SE +/- 1.49, N = 391.5391.26

VP9 libvpx Encoding

Speed: Speed 5 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterVP9 libvpx Encoding 1.10.0Speed: Speed 5 - Input: Bosphorus 1080pclang-with-load-hoistingclang-without-load-hoisting612182430SE +/- 0.02, N = 3SE +/- 0.04, N = 326.0025.801. (CC) clang options: -m64 -lpthread

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video Encodingclang-with-load-hoistingclang-without-load-hoisting1428425670SE +/- 0.68, N = 3SE +/- 0.85, N = 360.4759.06

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pclang-with-load-hoistingclang-without-load-hoisting1020304050SE +/- 0.17, N = 3SE +/- 0.45, N = 341.7741.42

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Secondclang-without-load-hoistingclang-with-load-hoisting40K80K120K160K200KSE +/- 576.77, N = 3SE +/- 4680.12, N = 3194017.73187025.851. (CC) clang options: -O2 -O3 -march=native -lrt" -lrt

Himeno Benchmark

Poisson Pressure Solver

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure Solverclang-with-load-hoistingclang-without-load-hoisting8001600240032004000SE +/- 1.69, N = 3SE +/- 5.98, N = 33504.483502.631. (CC) clang options: -O3 -march=native -mavx2

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depthclang-with-load-hoistingclang-without-load-hoisting4M8M12M16M20MSE +/- 27684.37, N = 3SE +/- 292729.63, N = 31785023017275878

ebizzy

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3clang-with-load-hoistingclang-without-load-hoisting70K140K210K280K350KSE +/- 8892.96, N = 3SE +/- 8335.76, N = 33146462998101. (CC) clang options: -pthread -lpthread -O3 -march=native

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.4.2Time To Compileclang-with-load-hoistingclang-without-load-hoisting1428425670SE +/- 0.08, N = 3SE +/- 0.84, N = 361.7664.90

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixelclang-with-load-hoistingclang-without-load-hoisting20406080100SE +/- 0.35, N = 3SE +/- 0.57, N = 3104.56111.501. (CC) clang options: -lm -lpthread -O3 -march=native

Tungsten Renderer

Scene: Hair

OpenBenchmarking.orgSeconds, Fewer Is BetterTungsten Renderer 0.2.2Scene: Hairclang-with-load-hoistingclang-without-load-hoisting1020304050SE +/- 0.02, N = 3SE +/- 0.31, N = 341.4843.49

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Timeclang-with-load-hoistingclang-without-load-hoisting816243240SE +/- 1.87, N = 3SE +/- 1.59, N = 331.3932.601. (CC) clang options: -lm -O3 -march=native

XZ Compression

Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9

OpenBenchmarking.orgSeconds, Fewer Is BetterXZ Compression 5.2.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 9clang-with-load-hoistingclang-without-load-hoisting1224364860SE +/- 0.04, N = 3SE +/- 0.91, N = 351.3854.61

Minion

Benchmark: Graceful

OpenBenchmarking.orgSeconds, Fewer Is BetterMinion 1.8Benchmark: Gracefulclang-with-load-hoistingclang-without-load-hoisting1224364860SE +/- 1.42, N = 3SE +/- 0.53, N = 348.9154.15

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughputclang-with-load-hoistingclang-without-load-hoisting4080120160200SE +/- 0.35, N = 3SE +/- 0.41, N = 3195.23180.00

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 14.0Scaling Factor: 1 - Clients: 1 - Mode: Read Writeclang-with-load-hoistingclang-without-load-hoisting30060090012001500SE +/- 14.48, N = 3SE +/- 74.13, N = 314551299

PostgreSQL pgbench

Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 14.0Scaling Factor: 1 - Clients: 1 - Mode: Read Write - Average Latencyclang-with-load-hoistingclang-without-load-hoisting0.17440.34880.52320.69760.872SE +/- 0.007, N = 3SE +/- 0.042, N = 30.6870.775

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov Abstractionclang-with-load-hoistingclang-without-load-hoisting816243240SE +/- 0.36, N = 3SE +/- 0.45, N = 329.9532.73


Phoronix Test Suite v10.8.4