Compiler Optimization Levels

Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0707 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2106106-IB-COMPILERO67&gru&sro.

Compiler Optimization LevelsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution-O3 -march=native-O1Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads)ASUS ROG MAXIMUS XIII HERO (0707 BIOS)Intel Tiger Lake-H32GB2000GB Corsair Force MP600 + 257GB Flash DriveAMD Radeon VII 16GB (1801/1000MHz)Intel Tiger Lake-H HD AudioASUS MG28U2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Fedora 345.12.9-300.fc34.x86_64 (x86_64)GNOME Shell 40.1X Server + Wayland4.6 Mesa 21.1.1 (LLVM 12.0.0)GCC 11.1.1 20210531btrfs3840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- -O1: CXXFLAGS=-O1 CFLAGS=-O1Compiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Disk Details- NONE / compress=zstd:1,relatime,rw,seclabel,space_cache,ssd,subvol=/home,subvolid=256 / Block Size: 4096 Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x3c - Thermald 2.4.4 Python Details- Python 3.9.5Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Compiler Optimization Levelssysbench: CPUdav1d: Summer Nature 4Ksvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080px265: Bosphorus 4Kx265: Bosphorus 1080pmt-dgemm: Sustained Floating-Point Rategmpbench: Total Timechia-vdf: Square Plain C++chia-vdf: Square Assembly Optimizedgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacecoremark: CoreMark Size 666 - Iterations Per Secondcompress-zstd: 3 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 8 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedtjbench: Decompression Throughputbotan: KASUMIbotan: KASUMI - Decryptbotan: AES-256botan: AES-256 - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptcryptopp: All Algorithmscryptopp: Keyed Algorithmscryptopp: Unkeyed Algorithmscryptopp: Integer + Elliptic Curve Public Key Algorithmsstockfish: Total Timelammps: 20k Atomslammps: Rhodopsin Proteinredis: GETredis: SETpjsip: INVITEpjsip: OPTIONS, Statefulpjsip: OPTIONS, Statelessliquid-dsp: 1 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 4 - 256 - 57liquid-dsp: 8 - 256 - 57liquid-dsp: 16 - 256 - 57clomp: Static OMP Speedupkripke: postmark: Disk Transaction Performancecaffe: AlexNet - CPU - 100caffe: GoogleNet - CPU - 100onednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUmnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mtnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1mrbayes: Primate Phylogeny Analysishmmer: Pfam Database Searchqe: AUSURF112c-ray: Total Time - 4K, 16 Rays Per Pixelsmallpt: Global Illumination Renderer; 128 Samplesaobench: 2048 x 2048 - Total Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3encode-opus: WAV To Opus Encodeespeak: Text-To-Speech Synthesisastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3sqlite-speedtest: Timed Time - Size 1,000encode-wavpack: WAV To WavPack-O3 -march=native-O134770.14195.949.48140.40279.12198.73204.96166.4316.0267.853.6046416171.8208400250633689109419527012223101285434724.8497442731.54997.8192.65189.935.44506.51451.05346.0285.95542.932.84540.6271.676664115.816112.0278401.8528412.961464.472451.660552.463553.519168.756168.8511012.7321010.7872346.359074924.212911491.4549817194.857104294431128.7378.5134049394.672956462.0050609375254610998443331880033333637600006878466677227566674.833544357949636558836254.0378111.200214.27544.982814.289843165.601876.423.524853.74819.2241.9161.88322.51311.763.212.493.262.224.241.1510.0954.3611.089.6418.2320.2115.298.57230.113227.45583.43099.4842609.0247.3358.40121.5565.9375.4735.59521.7654.21539.360151.485320.8086.10629.13854.58646.08711.09834882.14185.959.20137.23271.99191.41198.18160.7315.7267.853.922224209233247933592107816221810213061207366951.4842902568.04847.5189.25075.835.44406.41542.85215.3281.55385.732.94506.0260.256611108.276106.4788879.3308885.129430.951427.255533.956532.560149.439149.8071019.9131004.6472114.624613751.481521472.9470896862.786620294480178.3458.1843982525.832962660.8349939333247106884110001620466673167100005958166676722966675.133790753925936622847294.0482811.028914.17004.972884.282243133.281854.403.524993.84819.5071.9821.92122.94215.024.193.193.453.175.241.2411.4054.9111.479.6222.2921.2616.189.73243.162235.96388.533103.7422525.86128.9079.13324.6056.5907.6756.82824.0014.36069.773453.252820.8456.11429.10854.55749.01111.132OpenBenchmarking.org

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU-O1-O3 -march=native7K14K21K28K35KSE +/- 6.87, N = 3SE +/- 2.38, N = 334882.1434770.14-O1-O3 -march=native1. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Summer Nature 4K-O1-O3 -march=native4080120160200SE +/- 0.05, N = 3SE +/- 0.19, N = 3185.95195.94-O1 - MIN: 169.98 / MAX: 195.75-O3 -march=native - MIN: 181.35 / MAX: 208.711. (CC) gcc options: -pthread -lm

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p-O1-O3 -march=native3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 39.209.48-O1-march=native1. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p-O1-O3 -march=native306090120150SE +/- 0.28, N = 3SE +/- 0.11, N = 3137.23140.40-O1-march=native1. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p-O1-O3 -march=native60120180240300SE +/- 0.19, N = 3SE +/- 0.60, N = 3271.99279.12-O1-march=native1. (CC) gcc options: -fPIE -fPIC -O2 -O3 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p-O1-O3 -march=native4080120160200SE +/- 1.54, N = 9SE +/- 1.49, N = 10191.41198.73-O1-march=native1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p-O1-O3 -march=native4080120160200SE +/- 0.07, N = 3SE +/- 0.17, N = 3198.18204.96-O1-march=native1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p-O1-O3 -march=native4080120160200SE +/- 0.29, N = 3SE +/- 0.27, N = 3160.73166.43-O1-march=native1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K-O1-O3 -march=native48121620SE +/- 0.17, N = 4SE +/- 0.12, N = 315.7216.02-O1-O3 -march=native1. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p-O1-O3 -march=native1530456075SE +/- 0.60, N = 3SE +/- 0.32, N = 367.8567.85-O1-O3 -march=native1. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate-O1-O3 -march=native0.88251.7652.64753.534.4125SE +/- 0.023378, N = 3SE +/- 0.018800, N = 33.9222243.604641-O11. (CC) gcc options: -O3 -march=native -fopenmp

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time-O3 -march=native130026003900520065006171.81. (CC) gcc options: -O3 -march=native -lm

Chia Blockchain VDF

Test: Square Plain C++

OpenBenchmarking.orgIPS, More Is BetterChia Blockchain VDF 1.0.1Test: Square Plain C++-O1-O3 -march=native40K80K120K160K200KSE +/- 120.19, N = 3SE +/- 57.74, N = 32092332084001. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread

Chia Blockchain VDF

Test: Square Assembly Optimized

OpenBenchmarking.orgIPS, More Is BetterChia Blockchain VDF 1.0.1Test: Square Assembly Optimized-O1-O3 -march=native50K100K150K200K250KSE +/- 1471.21, N = 3SE +/- 1105.04, N = 32479332506331. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirl-O1-O3 -march=native150300450600750SE +/- 1.00, N = 3SE +/- 2.67, N = 3592689-O1-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-O1-O3 -march=native2004006008001000SE +/- 1.20, N = 3SE +/- 2.03, N = 310781094-O1-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-O1-O3 -march=native4080120160200SE +/- 0.58, N = 3SE +/- 0.58, N = 3162195-O1-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-O1-O3 -march=native60120180240300SE +/- 0.33, N = 3218270-O1-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-O1-O3 -march=native30060090012001500SE +/- 1.00, N = 3SE +/- 2.33, N = 310211222-O1-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussian-O1-O3 -march=native70140210280350SE +/- 0.88, N = 3306310-O1-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Space-O1-O3 -march=native30060090012001500SE +/- 1.33, N = 3SE +/- 1.20, N = 312071285-O1-O3 -march=native1. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second-O1-O3 -march=native90K180K270K360K450KSE +/- 661.73, N = 3SE +/- 533.17, N = 3366951.48434724.85-O1-O3 -march=native1. (CC) gcc options: -O2 -lrt" -lrt

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speed-O1-O3 -march=native6001200180024003000SE +/- 8.18, N = 3SE +/- 14.92, N = 32568.02731.5-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speed-O1-O3 -march=native11002200330044005500SE +/- 8.75, N = 3SE +/- 19.31, N = 34847.54997.8-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression Speed-O1-O3 -march=native4080120160200SE +/- 0.57, N = 3SE +/- 0.90, N = 3189.2192.6-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Decompression Speed-O1-O3 -march=native11002200330044005500SE +/- 13.17, N = 3SE +/- 15.26, N = 35075.85189.9-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speed-O1-O3 -march=native816243240SE +/- 0.43, N = 4SE +/- 0.48, N = 335.435.4-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speed-O1-O3 -march=native10002000300040005000SE +/- 6.02, N = 4SE +/- 18.10, N = 34406.44506.5-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Compression Speed-O1-O3 -march=native30060090012001500SE +/- 12.97, N = 3SE +/- 22.75, N = 151542.81451.0-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Decompression Speed-O1-O3 -march=native11002200330044005500SE +/- 8.30, N = 3SE +/- 2.50, N = 155215.35346.0-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression Speed-O1-O3 -march=native60120180240300SE +/- 2.78, N = 3SE +/- 2.25, N = 15281.5285.9-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression Speed-O1-O3 -march=native12002400360048006000SE +/- 9.52, N = 3SE +/- 6.10, N = 155385.75542.9-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speed-O1-O3 -march=native816243240SE +/- 0.15, N = 3SE +/- 0.19, N = 332.932.8-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speed-O1-O3 -march=native10002000300040005000SE +/- 3.19, N = 3SE +/- 15.31, N = 34506.04540.6-O1-O3 -march=native1. (CC) gcc options: -pthread -lz

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughput-O1-O3 -march=native60120180240300SE +/- 0.31, N = 3SE +/- 0.45, N = 3260.26271.68-O1-march=native1. (CC) gcc options: -O3 -rdynamic

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI-O1-O3 -march=native306090120150SE +/- 0.03, N = 3SE +/- 0.01, N = 3108.28115.821. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt-O1-O3 -march=native306090120150SE +/- 0.06, N = 3SE +/- 0.05, N = 3106.48112.031. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256-O1-O3 -march=native2K4K6K8K10KSE +/- 0.64, N = 3SE +/- 5.18, N = 38879.338401.851. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt-O1-O3 -march=native2K4K6K8K10KSE +/- 2.06, N = 3SE +/- 5.34, N = 38885.138412.961. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish-O1-O3 -march=native100200300400500SE +/- 0.19, N = 3SE +/- 0.31, N = 3430.95464.471. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt-O1-O3 -march=native100200300400500SE +/- 0.13, N = 3SE +/- 0.62, N = 3427.26451.661. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish-O1-O3 -march=native120240360480600SE +/- 0.93, N = 3SE +/- 0.20, N = 3533.96552.461. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt-O1-O3 -march=native120240360480600SE +/- 1.04, N = 3SE +/- 0.26, N = 3532.56553.521. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256-O1-O3 -march=native4080120160200SE +/- 1.37, N = 15SE +/- 0.06, N = 3149.44168.761. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt-O1-O3 -march=native4080120160200SE +/- 1.14, N = 15SE +/- 0.01, N = 3149.81168.851. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305-O1-O3 -march=native2004006008001000SE +/- 1.88, N = 3SE +/- 0.46, N = 31019.911012.731. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt-O1-O3 -march=native2004006008001000SE +/- 1.73, N = 3SE +/- 0.23, N = 31004.651010.791. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Crypto++

Test: All Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: All Algorithms-O1-O3 -march=native5001000150020002500SE +/- 0.38, N = 3SE +/- 1.51, N = 32114.622346.36-O1-O3 -march=native1. (CXX) g++ options: -fPIC -pthread -pipe

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed Algorithms-O1-O3 -march=native2004006008001000SE +/- 0.51, N = 3SE +/- 0.64, N = 3751.48924.21-O1-O3 -march=native1. (CXX) g++ options: -fPIC -pthread -pipe

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed Algorithms-O1-O3 -march=native110220330440550SE +/- 0.06, N = 3SE +/- 0.05, N = 3472.95491.45-O1-O3 -march=native1. (CXX) g++ options: -fPIC -pthread -pipe

Crypto++

Test: Integer + Elliptic Curve Public Key Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Integer + Elliptic Curve Public Key Algorithms-O1-O3 -march=native15003000450060007500SE +/- 4.50, N = 3SE +/- 1.75, N = 36862.797194.86-O1-O3 -march=native1. (CXX) g++ options: -fPIC -pthread -pipe

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time-O1-O3 -march=native6M12M18M24M30MSE +/- 371064.83, N = 3SE +/- 193823.90, N = 32944801729443112-O1-march=native1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms-O1-O3 -march=native246810SE +/- 0.035, N = 3SE +/- 0.020, N = 38.3458.737-O1-O3 -march=native1. (CXX) g++ options: -O2 -pthread -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein-O1-O3 -march=native246810SE +/- 0.028, N = 3SE +/- 0.026, N = 38.1848.513-O1-O3 -march=native1. (CXX) g++ options: -O2 -pthread -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET-O1-O3 -march=native900K1800K2700K3600K4500KSE +/- 33158.80, N = 3SE +/- 18099.88, N = 33982525.834049394.67-O1-march=native1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET-O1-O3 -march=native600K1200K1800K2400K3000KSE +/- 19439.73, N = 3SE +/- 33577.98, N = 32962660.832956462.00-O1-march=native1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITE-O1-O3 -march=native11002200330044005500SE +/- 45.51, N = 3SE +/- 15.24, N = 349935060-O1-O3 -march=native1. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateful-O1-O3 -march=native2K4K6K8K10KSE +/- 4.41, N = 3SE +/- 7.69, N = 393339375-O1-O3 -march=native1. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateless-O1-O3 -march=native50K100K150K200K250KSE +/- 520.47, N = 3SE +/- 711.03, N = 3247106254610-O1-O3 -march=native1. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57-O1-O3 -march=native20M40M60M80M100MSE +/- 6806.86, N = 3SE +/- 14836.14, N = 38841100099844333-O1-march=native1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 57-O1-O3 -march=native40M80M120M160M200MSE +/- 601728.99, N = 3SE +/- 66416.20, N = 3162046667188003333-O1-march=native1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 57-O1-O3 -march=native80M160M240M320M400MSE +/- 132035.35, N = 3SE +/- 1410968.93, N = 3316710000363760000-O1-march=native1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57-O1-O3 -march=native150M300M450M600M750MSE +/- 736168.76, N = 3SE +/- 689597.31, N = 3595816667687846667-O1-march=native1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57-O1-O3 -march=native150M300M450M600M750MSE +/- 328295.26, N = 3SE +/- 134824.99, N = 3672296667722756667-O1-march=native1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup-O1-O3 -march=native1.14752.2953.44254.595.7375SE +/- 0.06, N = 3SE +/- 0.07, N = 35.14.8-O1-march=native1. (CC) gcc options: -fopenmp -O3 -lm

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4-O1-O3 -march=native7M14M21M28M35MSE +/- 73388.54, N = 3SE +/- 88809.55, N = 33379075333544357-O1-O3 -march=native1. (CXX) g++ options: -O2 -fopenmp

PostMark

Disk Transaction Performance

OpenBenchmarking.orgTPS, More Is BetterPostMark 1.51Disk Transaction Performance-O1-O3 -march=native2K4K6K8K10KSE +/- 118.67, N = 392599496-O1-march=native1. (CC) gcc options: -O3

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 100-O1-O3 -march=native8K16K24K32K40KSE +/- 14.01, N = 3SE +/- 51.83, N = 33662236558-O1-O3 -march=native1. (CXX) g++ options: -fPIC -O2 -rdynamic -lboost_system -lboost_thread -lboost_filesystem -lboost_chrono -lboost_date_time -lboost_atomic -lglog -lgflags -lprotobuf -lpthread -lhdf5_cpp -lhdf5 -lhdf5_hl_cpp -lhdf5_hl -llmdb -lopenblas

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 100-O1-O3 -march=native20K40K60K80K100KSE +/- 10.17, N = 3SE +/- 43.97, N = 38472983625-O1-O3 -march=native1. (CXX) g++ options: -fPIC -O2 -rdynamic -lboost_system -lboost_thread -lboost_filesystem -lboost_chrono -lboost_date_time -lboost_atomic -lglog -lgflags -lprotobuf -lpthread -lhdf5_cpp -lhdf5 -lhdf5_hl_cpp -lhdf5_hl -llmdb -lopenblas

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU-O1-O3 -march=native0.91091.82182.73273.64364.5545SE +/- 0.00076, N = 3SE +/- 0.00473, N = 34.048284.03781-O1 - MIN: 3.91MIN: 3.921. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU-O1-O3 -march=native3691215SE +/- 0.01, N = 3SE +/- 0.00, N = 311.0311.20-O1 - MIN: 10.93MIN: 11.111. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU-O1-O3 -march=native48121620SE +/- 0.02, N = 3SE +/- 0.01, N = 314.1714.28-O1 - MIN: 14.04MIN: 14.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU-O1-O3 -march=native1.12112.24223.36334.48445.6055SE +/- 0.01654, N = 3SE +/- 0.01117, N = 34.972884.98281-O1 - MIN: 3.81MIN: 3.811. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU-O1-O3 -march=native0.96521.93042.89563.86084.826SE +/- 0.00335, N = 3SE +/- 0.00621, N = 34.282244.28984-O1 - MIN: 4.17MIN: 4.171. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU-O1-O3 -march=native7001400210028003500SE +/- 2.80, N = 3SE +/- 1.32, N = 33133.283165.60-O1 - MIN: 3120.48MIN: 3154.251. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU-O1-O3 -march=native400800120016002000SE +/- 4.14, N = 3SE +/- 1.46, N = 31854.401876.42-O1 - MIN: 1837.76MIN: 1865.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU-O1-O3 -march=native0.79311.58622.37933.17243.9655SE +/- 0.00163, N = 3SE +/- 0.00042, N = 33.524993.52485-O1 - MIN: 3.45MIN: 3.461. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0-O1-O3 -march=native0.86581.73162.59743.46324.329SE +/- 0.019, N = 3SE +/- 0.024, N = 33.8483.748-O1 - MIN: 3.75 / MAX: 8.08-march=native - MIN: 3.64 / MAX: 10.51. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50-O1-O3 -march=native510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 319.5119.22-O1 - MIN: 19.33 / MAX: 23.75-march=native - MIN: 19.06 / MAX: 24.921. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224-O1-O3 -march=native0.4460.8921.3381.7842.23SE +/- 0.011, N = 3SE +/- 0.008, N = 31.9821.916-O1 - MIN: 1.93 / MAX: 7.73-march=native - MIN: 1.87 / MAX: 6.221. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0-O1-O3 -march=native0.43220.86441.29661.72882.161SE +/- 0.004, N = 3SE +/- 0.001, N = 31.9211.883-O1 - MIN: 1.89 / MAX: 9.19-march=native - MIN: 1.85 / MAX: 7.811. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3-O1-O3 -march=native510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 322.9422.51-O1 - MIN: 22.65 / MAX: 29.53-march=native - MIN: 22.19 / MAX: 27.641. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet-O1-O3 -march=native48121620SE +/- 0.00, N = 3SE +/- 0.06, N = 315.0211.76-O1 - MIN: 14.88 / MAX: 18.66-O3 -march=native - MIN: 11.54 / MAX: 15.411. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2-O1-O3 -march=native0.94281.88562.82843.77124.714SE +/- 0.01, N = 3SE +/- 0.01, N = 34.193.21-O1 - MIN: 4.06 / MAX: 7.81-O3 -march=native - MIN: 3.08 / MAX: 4.111. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3-O1-O3 -march=native0.71781.43562.15342.87123.589SE +/- 0.01, N = 3SE +/- 0.00, N = 33.192.49-O1 - MIN: 3.16 / MAX: 4.05-O3 -march=native - MIN: 2.44 / MAX: 6.141. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2-O1-O3 -march=native0.77631.55262.32893.10523.8815SE +/- 0.01, N = 3SE +/- 0.01, N = 33.453.26-O1 - MIN: 3.39 / MAX: 7.07-O3 -march=native - MIN: 3.18 / MAX: 6.941. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet-O1-O3 -march=native0.71331.42662.13992.85323.5665SE +/- 0.01, N = 3SE +/- 0.02, N = 33.172.22-O1 - MIN: 3.14 / MAX: 6.8-O3 -march=native - MIN: 2.17 / MAX: 2.351. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0-O1-O3 -march=native1.1792.3583.5374.7165.895SE +/- 0.01, N = 3SE +/- 0.01, N = 35.244.24-O1 - MIN: 5.17 / MAX: 8.84-O3 -march=native - MIN: 4.19 / MAX: 7.91. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface-O1-O3 -march=native0.2790.5580.8371.1161.395SE +/- 0.01, N = 3SE +/- 0.03, N = 31.241.15-O1 - MIN: 1.21 / MAX: 5.59-O3 -march=native - MIN: 1.08 / MAX: 21. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet-O1-O3 -march=native3691215SE +/- 0.02, N = 3SE +/- 0.17, N = 311.4010.09-O1 - MIN: 11.29 / MAX: 14.99-O3 -march=native - MIN: 9.67 / MAX: 13.941. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16-O1-O3 -march=native1224364860SE +/- 0.09, N = 3SE +/- 0.11, N = 354.9154.36-O1 - MIN: 54.36 / MAX: 58.94-O3 -march=native - MIN: 53.85 / MAX: 59.241. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18-O1-O3 -march=native3691215SE +/- 0.01, N = 3SE +/- 0.14, N = 311.4711.08-O1 - MIN: 11.34 / MAX: 15.37-O3 -march=native - MIN: 10.69 / MAX: 16.911. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet-O1-O3 -march=native3691215SE +/- 0.01, N = 3SE +/- 0.02, N = 39.629.64-O1 - MIN: 9.5 / MAX: 13.21-O3 -march=native - MIN: 9.53 / MAX: 13.241. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50-O1-O3 -march=native510152025SE +/- 0.03, N = 3SE +/- 0.15, N = 322.2918.23-O1 - MIN: 22.02 / MAX: 27-O3 -march=native - MIN: 17.79 / MAX: 22.111. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny-O1-O3 -march=native510152025SE +/- 0.05, N = 3SE +/- 0.03, N = 321.2620.21-O1 - MIN: 20.97 / MAX: 27.08-O3 -march=native - MIN: 20.03 / MAX: 23.861. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd-O1-O3 -march=native48121620SE +/- 0.03, N = 3SE +/- 0.01, N = 316.1815.29-O1 - MIN: 16.02 / MAX: 19.89-O3 -march=native - MIN: 15.14 / MAX: 191. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m-O1-O3 -march=native3691215SE +/- 0.05, N = 3SE +/- 0.02, N = 39.738.57-O1 - MIN: 9.55 / MAX: 14.41-O3 -march=native - MIN: 8.47 / MAX: 12.351. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2-O1-O3 -march=native50100150200250SE +/- 0.20, N = 3SE +/- 0.06, N = 3243.16230.11-O1 - MIN: 241.63 / MAX: 246.21-O3 -march=native - MIN: 229.52 / MAX: 232.811. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1-O1-O3 -march=native50100150200250SE +/- 0.15, N = 3SE +/- 0.04, N = 3235.96227.46-O1 - MIN: 234.76 / MAX: 237.84-O3 -march=native - MIN: 226.88 / MAX: 228.231. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-O1-O3 -march=native20406080100SE +/- 0.17, N = 3SE +/- 0.09, N = 388.5383.43-O1-march=native1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database Search-O1-O3 -march=native20406080100SE +/- 0.08, N = 3SE +/- 0.04, N = 3103.7499.48-O1-O3 -march=native1. (CC) gcc options: -pthread -lhmmer -leasel -lm -lmpi

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112-O1-O3 -march=native6001200180024003000SE +/- 27.81, N = 5SE +/- 5.73, N = 32525.862609.021. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O1-O3 -march=native306090120150SE +/- 0.06, N = 3SE +/- 0.15, N = 3128.9147.34-O1-march=native1. (CC) gcc options: -lm -lpthread -O3

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O1-O3 -march=native3691215SE +/- 0.002, N = 3SE +/- 0.009, N = 39.1338.401-O1-march=native1. (CXX) g++ options: -fopenmp -O3

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O1-O3 -march=native612182430SE +/- 0.04, N = 3SE +/- 0.01, N = 324.6121.56-O1-march=native1. (CC) gcc options: -lm -O3

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O1-O3 -march=native246810SE +/- 0.004, N = 5SE +/- 0.002, N = 56.5905.937-O1-O3 -march=native1. (CXX) g++ options: -fvisibility=hidden -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O1-O3 -march=native246810SE +/- 0.092, N = 4SE +/- 0.008, N = 37.6755.473-O1-march=native1. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode-O1-O3 -march=native246810SE +/- 0.004, N = 5SE +/- 0.010, N = 56.8285.595-O1-O3 -march=native1. (CXX) g++ options: -fvisibility=hidden -logg -lm

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis-O1-O3 -march=native612182430SE +/- 0.07, N = 4SE +/- 0.06, N = 424.0021.77-O1-O3 -march=native1. (CC) gcc options: -std=c99 -lpthread -lm

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Medium-O1-O3 -march=native0.98111.96222.94333.92444.9055SE +/- 0.0112, N = 3SE +/- 0.0026, N = 34.36064.2153-O1-O3 -march=native1. (CXX) g++ options: -O2 -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Thorough-O1-O3 -march=native3691215SE +/- 0.0228, N = 3SE +/- 0.0151, N = 39.77349.3601-O1-O3 -march=native1. (CXX) g++ options: -O2 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Exhaustive-O1-O3 -march=native1224364860SE +/- 0.02, N = 3SE +/- 0.04, N = 353.2551.49-O1-O3 -march=native1. (CXX) g++ options: -O2 -flto -pthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S-O1-O3 -march=native510152025SE +/- 0.03, N = 3SE +/- 0.02, N = 320.8520.811. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0-O1-O3 -march=native246810SE +/- 0.005, N = 3SE +/- 0.002, N = 36.1146.1061. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2-O1-O3 -march=native714212835SE +/- 0.08, N = 3SE +/- 0.08, N = 329.1129.141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3-O1-O3 -march=native1224364860SE +/- 0.02, N = 3SE +/- 0.00, N = 354.5654.591. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O1-O3 -march=native1122334455SE +/- 0.26, N = 3SE +/- 0.15, N = 349.0146.09-O1-O3 -march=native1. (CC) gcc options: -ldl -lz -lpthread

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack-O1-O3 -march=native3691215SE +/- 0.01, N = 5SE +/- 0.00, N = 511.1311.10-O1-O3 -march=native1. (CXX) g++ options: -rdynamic


Phoronix Test Suite v10.8.4