AMD AOCC 2.2 vs. GCC vs. Clang - EPYC 7742 2P

AMD AOCC 2.2 compiler against GCC 10, LLVM Clang 10. Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2008278-FI-AMDAOCC2233&sor.

AMD AOCC 2.2 vs. GCC vs. Clang - EPYC 7742 2PProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionAOCC 2.2GCC 10.2Clang 10.12 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)AMD DAYTONA_X (RDY1006G BIOS)AMD Starship/Matisse504GB3841GB Micron_9300_MTFDHAL3T8TDPASPEEDVE2282 x Mellanox MT27710Ubuntu 20.105.4.0-42-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8modesetting 1.20.8Clang 10.0.0ext41920x1080GCC 10.2.0Clang 10.0.1-1Target:OpenBenchmarking.orgEnvironment Details- CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Compiler Details- AOCC 2.2: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: znver2 - GCC 10.2: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-Fb4d6e/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-Fb4d6e/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance - CPU Microcode: 0x8301034Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

AMD AOCC 2.2 vs. GCC vs. Clang - EPYC 7742 2Pcryptopp: Unkeyed Algorithmsrodinia: OpenMP Leukocyterodinia: OpenMP Streamclustermrbayes: Primate Phylogeny Analysishmmer: Pfam Database Searchcompress-zstd: 19scimark2: Monte Carloscimark2: Sparse Matrix Multiplytscp: AI Chess Performancejohn-the-ripper: Blowfishgraphics-magick: Rotategraphics-magick: Enhancedgraphics-magick: Resizingonednn: IP Batch 1D - f32 - CPUonednn: IP Batch All - f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch deconv_1d - f32 - CPUonednn: Deconvolution Batch deconv_3d - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUsvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080px264: H.264 Video Encodingstockfish: Total Timebuild-apache: Time To Compilebuild-ffmpeg: Time To Compilebuild-mplayer: Time To Compilebullet: 3000 Fallbullet: 1000 Convexbullet: Prim Trimeshbullet: Convex Trimeshopenssl: RSA 4096-bit Performanceleveldb: Hot Readastcenc: Fastastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: UASTC Level 3cpp-perf-bench: Ctypecpp-perf-bench: Math Librarycpp-perf-bench: Stepanov Vectorcpp-perf-bench: Stepanov Abstractionapache: Static Web Page Servingbrl-cad: VGR Performance MetricAOCC 2.2GCC 10.2Clang 10.1317.37930750.14010.017106.5845.608128.3620.893450.97116395017697052911083510.81227912.67650.5068470.9896252.57847162.41761.86500.228572392.64412.51336.19204.6025482474241.24739.87135.8774.3109904.5578571.009441.18339518574.8286.1654.995.678.0419.9513.73540.209334.75290.49534.00727488.362693129310.14320552.07410.005109.5896.101125.7611.722811.30101432814915449714391071.9838018.86960.7440882.826882.98436870.720351.2280.737722376.61382.50311.85204.5224209816523.21116.44410.7034.3133414.8736981.0391751.2339324437.6289.7005.616.259.0922.0013.75643.269339.96199.19136.97126062.362951330316.53933452.70810.713111.3524.888120.7620.643462.03117889817764452713271620.92561612.91230.5370371.057692.62134219.26687.59760.272380366.54388.72321.84195.5125700811421.73921.81718.0864.3879454.5587711.0215951.17661818618.0293.7185.175.738.1119.7113.56940.336331.61986.12133.55026903.492779511OpenBenchmarking.org

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed AlgorithmsAOCC 2.2Clang 10.1GCC 10.270140210280350SE +/- 0.13, N = 3SE +/- 0.74, N = 3SE +/- 0.12, N = 3317.38316.54310.141. (CXX) g++ options: -O3 -march=native -fPIC -pthread -pipe

Rodinia

Test: OpenMP Leukocyte

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP LeukocyteAOCC 2.2GCC 10.2Clang 10.11224364860SE +/- 0.25, N = 3SE +/- 0.62, N = 6SE +/- 0.59, N = 350.1452.0752.71-O3 -fopenmp-O2 -lOpenCL-O3 -fopenmp1. (CXX) g++ options:

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP StreamclusterGCC 10.2AOCC 2.2Clang 10.13691215SE +/- 0.18, N = 12SE +/- 0.11, N = 15SE +/- 0.13, N = 510.0110.0210.71-O2 -lOpenCL-O3 -fopenmp-O3 -fopenmp1. (CXX) g++ options:

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisAOCC 2.2GCC 10.2Clang 10.120406080100SE +/- 0.46, N = 3SE +/- 0.39, N = 3SE +/- 0.14, N = 3106.58109.59111.35-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 2.3.2Pfam Database SearchClang 10.1AOCC 2.2GCC 10.2246810SE +/- 0.073, N = 3SE +/- 0.078, N = 3SE +/- 0.081, N = 34.8885.6086.1011. (CC) gcc options: -O3 -march=native -pthread -lhmmer -lsquid -lm

Zstd Compression

Compression Level: 19

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 19AOCC 2.2GCC 10.2Clang 10.1306090120150SE +/- 1.67, N = 3SE +/- 1.67, N = 3SE +/- 4.40, N = 12128.3125.7120.71. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloAOCC 2.2Clang 10.1GCC 10.2130260390520650SE +/- 0.09, N = 3SE +/- 0.22, N = 3SE +/- 0.39, N = 3620.89620.64611.721. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyClang 10.1AOCC 2.2GCC 10.27001400210028003500SE +/- 15.20, N = 3SE +/- 7.98, N = 3SE +/- 28.45, N = 33462.033450.972811.301. (CC) gcc options: -O3 -march=native -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceClang 10.1AOCC 2.2GCC 10.2300K600K900K1200K1500KSE +/- 1470.12, N = 5SE +/- 1433.03, N = 5SE +/- 1024.97, N = 51178898116395010143281. (CC) gcc options: -O3 -march=native

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishClang 10.1AOCC 2.2GCC 10.240K80K120K160K200KSE +/- 1042.76, N = 3SE +/- 1313.25, N = 3SE +/- 326.24, N = 31776441769701491541. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateAOCC 2.2Clang 10.1GCC 10.2110220330440550SE +/- 5.25, N = 15SE +/- 7.36, N = 4SE +/- 4.04, N = 35295274971. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedGCC 10.2Clang 10.1AOCC 2.230060090012001500SE +/- 6.43, N = 3SE +/- 5.69, N = 3SE +/- 0.58, N = 31439132711081. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingAOCC 2.2Clang 10.1GCC 10.280160240320400SE +/- 24.14, N = 15SE +/- 7.14, N = 12SE +/- 1.33, N = 33511621071. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

oneDNN

Harness: IP Batch 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch 1D - Data Type: f32 - Engine: CPUAOCC 2.2Clang 10.1GCC 10.20.44640.89281.33921.78562.232SE +/- 0.008809, N = 3SE +/- 0.006045, N = 3SE +/- 0.003268, N = 30.8122790.9256161.983800-fopenmp=libomp - MIN: 0.71-fopenmp=libomp - MIN: 0.83-fopenmp - MIN: 1.791. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Batch All - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: IP Batch All - Data Type: f32 - Engine: CPUAOCC 2.2Clang 10.1GCC 10.2510152025SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.21, N = 712.6812.9118.87-fopenmp=libomp - MIN: 11.46-fopenmp=libomp - MIN: 11.83-fopenmp - MIN: 15.391. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUAOCC 2.2Clang 10.1GCC 10.20.16740.33480.50220.66960.837SE +/- 0.003081, N = 3SE +/- 0.002740, N = 3SE +/- 0.002043, N = 30.5068470.5370370.744088-fopenmp=libomp - MIN: 0.44-fopenmp=libomp - MIN: 0.47-fopenmp - MIN: 0.661. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_1d - Data Type: f32 - Engine: CPUAOCC 2.2Clang 10.1GCC 10.20.6361.2721.9082.5443.18SE +/- 0.010667, N = 3SE +/- 0.004959, N = 3SE +/- 0.012408, N = 30.9896251.0576902.826880-fopenmp=libomp - MIN: 0.73-fopenmp=libomp - MIN: 0.91-fopenmp - MIN: 2.51. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Deconvolution Batch deconv_3d - Data Type: f32 - Engine: CPUAOCC 2.2Clang 10.1GCC 10.20.67151.3432.01452.6863.3575SE +/- 0.00531, N = 3SE +/- 0.00761, N = 3SE +/- 0.07819, N = 122.578472.621342.98436-fopenmp=libomp - MIN: 2.34-fopenmp=libomp - MIN: 2.42-fopenmp - MIN: 2.361. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAOCC 2.2Clang 10.1GCC 10.22004006008001000SE +/- 2.17, N = 4SE +/- 1.11, N = 3SE +/- 4.49, N = 3162.42219.27870.72-fopenmp=libomp - MIN: 145.25-fopenmp=libomp - MIN: 201.2-fopenmp - MIN: 830.551. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUAOCC 2.2Clang 10.1GCC 10.280160240320400SE +/- 1.82, N = 15SE +/- 1.96, N = 15SE +/- 2.11, N = 361.8787.60351.23-fopenmp=libomp - MIN: 34.47-fopenmp=libomp - MIN: 61.41-fopenmp - MIN: 333.051. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 1.5Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUAOCC 2.2Clang 10.1GCC 10.20.1660.3320.4980.6640.83SE +/- 0.003245, N = 4SE +/- 0.002288, N = 3SE +/- 0.002686, N = 30.2285720.2723800.737722-fopenmp=libomp - MIN: 0.18-fopenmp=libomp - MIN: 0.22-fopenmp - MIN: 0.661. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pAOCC 2.2GCC 10.2Clang 10.190180270360450SE +/- 3.17, N = 3SE +/- 5.95, N = 12SE +/- 4.02, N = 3392.64376.61366.541. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pAOCC 2.2Clang 10.1GCC 10.290180270360450SE +/- 5.71, N = 15SE +/- 4.72, N = 3SE +/- 4.16, N = 3412.51388.72382.501. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: Visual Quality Optimized - Input: Bosphorus 1080pAOCC 2.2Clang 10.1GCC 10.270140210280350SE +/- 3.05, N = 3SE +/- 3.32, N = 3SE +/- 5.02, N = 3336.19321.84311.851. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingAOCC 2.2GCC 10.2Clang 10.14080120160200SE +/- 1.48, N = 3SE +/- 1.50, N = 3SE +/- 1.46, N = 3204.60204.52195.51-mstack-alignment=64-mstack-alignment=641. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -march=native -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 9Total TimeClang 10.1AOCC 2.2GCC 10.260M120M180M240M300MSE +/- 1092575.32, N = 3SE +/- 254505.98, N = 3SE +/- 1656059.78, N = 32570081142548247422420981651. (CXX) g++ options: -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++11 -pedantic -msse -msse3 -mpopcnt -flto

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To CompileClang 10.1GCC 10.2AOCC 2.2918273645SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 321.7423.2141.25

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To CompileGCC 10.2Clang 10.1AOCC 2.2918273645SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.19, N = 316.4421.8239.87

Timed MPlayer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.4Time To CompileGCC 10.2Clang 10.1AOCC 2.2816243240SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.20, N = 310.7018.0935.88

Bullet Physics Engine

Test: 3000 Fall

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallAOCC 2.2GCC 10.2Clang 10.10.98731.97462.96193.94924.9365SE +/- 0.013109, N = 3SE +/- 0.009658, N = 3SE +/- 0.006058, N = 34.3109904.3133414.387945-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -march=native -rdynamic

Bullet Physics Engine

Test: 1000 Convex

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexAOCC 2.2Clang 10.1GCC 10.21.09662.19323.28984.38645.483SE +/- 0.004048, N = 3SE +/- 0.007811, N = 3SE +/- 0.010129, N = 34.5578574.5587714.873698-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -march=native -rdynamic

Bullet Physics Engine

Test: Prim Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshAOCC 2.2Clang 10.1GCC 10.20.23380.46760.70140.93521.169SE +/- 0.005065, N = 3SE +/- 0.000517, N = 3SE +/- 0.001394, N = 31.0094401.0215951.039175-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -march=native -rdynamic

Bullet Physics Engine

Test: Convex Trimesh

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshClang 10.1AOCC 2.2GCC 10.20.27760.55520.83281.11041.388SE +/- 0.000573, N = 3SE +/- 0.006371, N = 3SE +/- 0.001754, N = 31.1766181.1833951.233930-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -march=native -rdynamic

OpenSSL

RSA 4096-bit Performance

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceGCC 10.2Clang 10.1AOCC 2.25K10K15K20K25KSE +/- 258.78, N = 3SE +/- 111.56, N = 3SE +/- 169.89, N = 324437.618618.018574.8-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -march=native -lssl -lcrypto -ldl

LevelDB

Benchmark: Hot Read

OpenBenchmarking.orgMicroseconds Per Op, Fewer Is BetterLevelDB 1.22Benchmark: Hot ReadAOCC 2.2GCC 10.2Clang 10.160120180240300SE +/- 2.66, N = 3SE +/- 3.91, N = 3SE +/- 1.28, N = 3286.17289.70293.721. (CXX) g++ options: -O3 -march=native -lsnappy -lpthread

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: FastAOCC 2.2Clang 10.1GCC 10.21.26232.52463.78695.04926.3115SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 34.995.175.611. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: MediumAOCC 2.2Clang 10.1GCC 10.2246810SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 35.675.736.251. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ThoroughAOCC 2.2Clang 10.1GCC 10.23691215SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 38.048.119.091. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ExhaustiveClang 10.1AOCC 2.2GCC 10.2510152025SE +/- 0.03, N = 3SE +/- 0.21, N = 3SE +/- 0.12, N = 319.7119.9522.001. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3Clang 10.1AOCC 2.2GCC 10.248121620SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 313.5713.7413.761. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeAOCC 2.2Clang 10.1GCC 10.21020304050SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.08, N = 340.2140.3443.271. (CXX) g++ options: -O3 -march=native -std=c++11

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math LibraryClang 10.1AOCC 2.2GCC 10.270140210280350SE +/- 0.29, N = 3SE +/- 0.19, N = 3SE +/- 0.95, N = 3331.62334.75339.961. (CXX) g++ options: -O3 -march=native -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorClang 10.1AOCC 2.2GCC 10.220406080100SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 386.1290.5099.191. (CXX) g++ options: -O3 -march=native -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionClang 10.1AOCC 2.2GCC 10.2918273645SE +/- 0.01, N = 3SE +/- 0.13, N = 3SE +/- 0.00, N = 333.5534.0136.971. (CXX) g++ options: -O3 -march=native -std=c++11

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingAOCC 2.2Clang 10.1GCC 10.26K12K18K24K30KSE +/- 575.36, N = 12SE +/- 534.75, N = 15SE +/- 608.38, N = 1527488.3626903.4926062.361. (CC) gcc options: -shared -fPIC -pthread -O3 -march=native

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance MetricGCC 10.2Clang 10.1AOCC 2.2600K1200K1800K2400K3000K295133027795112693129-flto -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender1. (CXX) g++ options: -O3 -march=native -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -Qunused-arguments -finline-functions -pedantic -rdynamic -lpthread -ldl -luuid -lm


Phoronix Test Suite v10.8.4