Compiler Optimization Levels

Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0707 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2106106-IB-COMPILERO67&rdt&grs.

Compiler Optimization LevelsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution-O3 -march=native-O1Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads)ASUS ROG MAXIMUS XIII HERO (0707 BIOS)Intel Tiger Lake-H32GB2000GB Corsair Force MP600 + 257GB Flash DriveAMD Radeon VII 16GB (1801/1000MHz)Intel Tiger Lake-H HD AudioASUS MG28U2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Fedora 345.12.9-300.fc34.x86_64 (x86_64)GNOME Shell 40.1X Server + Wayland4.6 Mesa 21.1.1 (LLVM 12.0.0)GCC 11.1.1 20210531btrfs3840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- -O1: CXXFLAGS=-O1 CFLAGS=-O1Compiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Disk Details- NONE / compress=zstd:1,relatime,rw,seclabel,space_cache,ssd,subvol=/home,subvolid=256 / Block Size: 4096 Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x3c - Thermald 2.4.4 Python Details- Python 3.9.5Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Compiler Optimization Levelsc-ray: Total Time - 4K, 16 Rays Per Pixelncnn: CPU - mnasnetencode-mp3: WAV To MP3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - mobilenetgraphics-magick: Enhancedncnn: CPU - efficientnet-b0cryptopp: Keyed Algorithmsncnn: CPU - resnet50encode-opus: WAV To Opus Encodegraphics-magick: Sharpengraphics-magick: Resizingcoremark: CoreMark Size 666 - Iterations Per Secondgraphics-magick: Swirlliquid-dsp: 2 - 256 - 57liquid-dsp: 8 - 256 - 57liquid-dsp: 4 - 256 - 57aobench: 2048 x 2048 - Total Timencnn: CPU - regnety_400mncnn: CPU - googlenetliquid-dsp: 1 - 256 - 57botan: CAST-256botan: CAST-256 - Decryptencode-flac: WAV To FLACcryptopp: All Algorithmsespeak: Text-To-Speech Synthesismt-dgemm: Sustained Floating-Point Ratesmallpt: Global Illumination Renderer; 128 Samplesncnn: CPU - blazefacebotan: Twofishliquid-dsp: 16 - 256 - 57botan: KASUMIgraphics-magick: HWB Color Spacecompress-zstd: 3 - Compression Speedsqlite-speedtest: Timed Time - Size 1,000clomp: Static OMP Speedupmrbayes: Primate Phylogeny Analysisncnn: CPU - shufflenet-v2ncnn: CPU - squeezenet_ssdbotan: Twofish - Decryptbotan: AES-256tnn: CPU - MobileNet v2botan: AES-256 - Decryptdav1d: Summer Nature 4Kbotan: KASUMI - Decryptncnn: CPU - yolov4-tinycryptopp: Integer + Elliptic Curve Public Key Algorithmslammps: 20k Atomsastcenc: Thoroughtjbench: Decompression Throughputhmmer: Pfam Database Searchlammps: Rhodopsin Proteinbotan: Blowfish - Decryptcryptopp: Unkeyed Algorithmssvt-vp9: VMAF Optimized - Bosphorus 1080ptnn: CPU - SqueezeNet v1.1svt-vp9: Visual Quality Optimized - Bosphorus 1080pncnn: CPU - resnet18botan: Blowfishastcenc: Mediummnn: MobileNetV2_224astcenc: Exhaustivesvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080pqe: AUSURF112compress-zstd: 3 - Decompression Speedsvt-hevc: 1 - Bosphorus 1080ppjsip: OPTIONS, Statelesscompress-zstd: 8, Long Mode - Decompression Speedmnn: SqueezeNetV1.0svt-hevc: 10 - Bosphorus 1080ppostmark: Disk Transaction Performancecompress-zstd: 3, Long Mode - Decompression Speedsvt-hevc: 7 - Bosphorus 1080pcompress-zstd: 19 - Decompression Speedcompress-zstd: 8 - Decompression Speedmnn: mobilenet-v1-1.0x265: Bosphorus 4Kmnn: inception-v3compress-zstd: 8 - Compression Speedredis: GETcompress-zstd: 8, Long Mode - Compression Speedonednn: IP Shapes 3D - f32 - CPUgraphics-magick: Rotatemnn: resnet-v2-50pjsip: INVITEcaffe: GoogleNet - CPU - 100graphics-magick: Noise-Gaussianonednn: Recurrent Neural Network Inference - f32 - CPUchia-vdf: Square Assembly Optimizedonednn: Recurrent Neural Network Training - f32 - CPUncnn: CPU - vgg16compress-zstd: 19, Long Mode - Decompression Speedonednn: Convolution Batch Shapes Auto - f32 - CPUkripke: botan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptpjsip: OPTIONS, Statefulchia-vdf: Square Plain C++sysbench: CPUencode-wavpack: WAV To WavPackcompress-zstd: 19, Long Mode - Compression Speedonednn: IP Shapes 1D - f32 - CPUredis: SETncnn: CPU - alexnetonednn: Deconvolution Batch shapes_1d - f32 - CPUbasis: ETC1Sonednn: Deconvolution Batch shapes_3d - f32 - CPUcaffe: AlexNet - CPU - 100basis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3stockfish: Total Timeonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUx265: Bosphorus 1080pcompress-zstd: 19 - Compression Speedgmpbench: Total Timecompress-zstd: 3, Long Mode - Compression Speed-O3 -march=native-O147.3352.225.4733.212.4911.762704.24924.21291118.235.5951951222434724.84974468918800333368784666736376000021.5568.5710.0999844333168.756168.8515.9372346.35907421.7653.6046418.4011.15464.472722756667115.81612852731.546.0874.883.4303.2615.29451.6608401.852230.1138412.961195.94112.02720.217194.8571048.7379.3601271.67666499.4848.513553.519491.454981198.73227.455166.4311.08552.4634.21531.91651.4853204.962609.024997.89.482546105542.93.748279.1294965346.0140.404506.55189.91.88316.0222.513192.64049394.67285.911.2002109419.2245060836253101876.422506333165.6054.364540.614.2754335443571012.7321010.787937520840034770.1411.09832.84.037812956462.009.644.9828120.8084.28984365586.10629.13854.586294431123.5248567.8535.46171.81451.0128.9073.177.6754.193.1915.022185.24751.48152122.296.8281621021366951.48429059216204666759581666731671000024.6059.7311.4088411000149.439149.8076.5902114.62461324.0013.9222249.1331.24430.951672296667108.27612072568.049.0115.188.5333.4516.18427.2558879.330243.1628885.129185.95106.47821.266862.7866208.3459.7734260.256611103.7428.184532.560472.947089191.41235.963160.7311.47533.9564.36061.98253.2528198.182525.864847.59.202471065385.73.848271.9992595215.3137.234406.45075.81.92115.7222.942189.23982525.83281.511.0289107819.5074993847293061854.402479333133.2854.914506.014.1700337907531019.9131004.647933320923334882.1411.13232.94.048282962660.839.624.9728820.8454.28224366226.11429.10854.557294480173.5249967.8535.41542.8OpenBenchmarking.org

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=native-O1306090120150SE +/- 0.15, N = 3SE +/- 0.06, N = 347.34128.91-march=native-O11. (CC) gcc options: -lm -lpthread -O3

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet-O3 -march=native-O10.71331.42662.13992.85323.5665SE +/- 0.02, N = 3SE +/- 0.01, N = 32.223.17-O3 -march=native - MIN: 2.17 / MAX: 2.35-O1 - MIN: 3.14 / MAX: 6.81. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=native-O1246810SE +/- 0.008, N = 3SE +/- 0.092, N = 45.4737.675-march=native-O11. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2-O3 -march=native-O10.94281.88562.82843.77124.714SE +/- 0.01, N = 3SE +/- 0.01, N = 33.214.19-O3 -march=native - MIN: 3.08 / MAX: 4.11-O1 - MIN: 4.06 / MAX: 7.811. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3-O3 -march=native-O10.71781.43562.15342.87123.589SE +/- 0.00, N = 3SE +/- 0.01, N = 32.493.19-O3 -march=native - MIN: 2.44 / MAX: 6.14-O1 - MIN: 3.16 / MAX: 4.051. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet-O3 -march=native-O148121620SE +/- 0.06, N = 3SE +/- 0.00, N = 311.7615.02-O3 -march=native - MIN: 11.54 / MAX: 15.41-O1 - MIN: 14.88 / MAX: 18.661. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-O3 -march=native-O160120180240300SE +/- 0.33, N = 3270218-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0-O3 -march=native-O11.1792.3583.5374.7165.895SE +/- 0.01, N = 3SE +/- 0.01, N = 34.245.24-O3 -march=native - MIN: 4.19 / MAX: 7.9-O1 - MIN: 5.17 / MAX: 8.841. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed Algorithms-O3 -march=native-O12004006008001000SE +/- 0.64, N = 3SE +/- 0.51, N = 3924.21751.48-O3 -march=native-O11. (CXX) g++ options: -fPIC -pthread -pipe

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50-O3 -march=native-O1510152025SE +/- 0.15, N = 3SE +/- 0.03, N = 318.2322.29-O3 -march=native - MIN: 17.79 / MAX: 22.11-O1 - MIN: 22.02 / MAX: 271. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode-O3 -march=native-O1246810SE +/- 0.010, N = 5SE +/- 0.004, N = 55.5956.828-O3 -march=native-O11. (CXX) g++ options: -fvisibility=hidden -logg -lm

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-O3 -march=native-O14080120160200SE +/- 0.58, N = 3SE +/- 0.58, N = 3195162-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-O3 -march=native-O130060090012001500SE +/- 2.33, N = 3SE +/- 1.00, N = 312221021-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second-O3 -march=native-O190K180K270K360K450KSE +/- 533.17, N = 3SE +/- 661.73, N = 3434724.85366951.48-O3 -march=native-O11. (CC) gcc options: -O2 -lrt" -lrt

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirl-O3 -march=native-O1150300450600750SE +/- 2.67, N = 3SE +/- 1.00, N = 3689592-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O140M80M120M160M200MSE +/- 66416.20, N = 3SE +/- 601728.99, N = 3188003333162046667-march=native-O11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O1150M300M450M600M750MSE +/- 689597.31, N = 3SE +/- 736168.76, N = 3687846667595816667-march=native-O11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O180M160M240M320M400MSE +/- 1410968.93, N = 3SE +/- 132035.35, N = 3363760000316710000-march=native-O11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=native-O1612182430SE +/- 0.01, N = 3SE +/- 0.04, N = 321.5624.61-march=native-O11. (CC) gcc options: -lm -O3

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m-O3 -march=native-O13691215SE +/- 0.02, N = 3SE +/- 0.05, N = 38.579.73-O3 -march=native - MIN: 8.47 / MAX: 12.35-O1 - MIN: 9.55 / MAX: 14.411. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet-O3 -march=native-O13691215SE +/- 0.17, N = 3SE +/- 0.02, N = 310.0911.40-O3 -march=native - MIN: 9.67 / MAX: 13.94-O1 - MIN: 11.29 / MAX: 14.991. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O120M40M60M80M100MSE +/- 14836.14, N = 3SE +/- 6806.86, N = 39984433388411000-march=native-O11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256-O3 -march=native-O14080120160200SE +/- 0.06, N = 3SE +/- 1.37, N = 15168.76149.441. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt-O3 -march=native-O14080120160200SE +/- 0.01, N = 3SE +/- 1.14, N = 15168.85149.811. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3 -march=native-O1246810SE +/- 0.002, N = 5SE +/- 0.004, N = 55.9376.590-O3 -march=native-O11. (CXX) g++ options: -fvisibility=hidden -logg -lm

Crypto++

Test: All Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: All Algorithms-O3 -march=native-O15001000150020002500SE +/- 1.51, N = 3SE +/- 0.38, N = 32346.362114.62-O3 -march=native-O11. (CXX) g++ options: -fPIC -pthread -pipe

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis-O3 -march=native-O1612182430SE +/- 0.06, N = 4SE +/- 0.07, N = 421.7724.00-O3 -march=native-O11. (CC) gcc options: -std=c99 -lpthread -lm

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate-O3 -march=native-O10.88251.7652.64753.534.4125SE +/- 0.018800, N = 3SE +/- 0.023378, N = 33.6046413.922224-O11. (CC) gcc options: -O3 -march=native -fopenmp

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O3 -march=native-O13691215SE +/- 0.009, N = 3SE +/- 0.002, N = 38.4019.133-march=native-O11. (CXX) g++ options: -fopenmp -O3

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface-O3 -march=native-O10.2790.5580.8371.1161.395SE +/- 0.03, N = 3SE +/- 0.01, N = 31.151.24-O3 -march=native - MIN: 1.08 / MAX: 2-O1 - MIN: 1.21 / MAX: 5.591. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish-O3 -march=native-O1100200300400500SE +/- 0.31, N = 3SE +/- 0.19, N = 3464.47430.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O1150M300M450M600M750MSE +/- 134824.99, N = 3SE +/- 328295.26, N = 3722756667672296667-march=native-O11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI-O3 -march=native-O1306090120150SE +/- 0.01, N = 3SE +/- 0.03, N = 3115.82108.281. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Space-O3 -march=native-O130060090012001500SE +/- 1.20, N = 3SE +/- 1.33, N = 312851207-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speed-O3 -march=native-O16001200180024003000SE +/- 14.92, N = 3SE +/- 8.18, N = 32731.52568.0-O3 -march=native-O11. (CC) gcc options: -pthread -lz

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O3 -march=native-O11122334455SE +/- 0.15, N = 3SE +/- 0.26, N = 346.0949.01-O3 -march=native-O11. (CC) gcc options: -ldl -lz -lpthread

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup-O3 -march=native-O11.14752.2953.44254.595.7375SE +/- 0.07, N = 3SE +/- 0.06, N = 34.85.1-march=native-O11. (CC) gcc options: -fopenmp -O3 -lm

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-O3 -march=native-O120406080100SE +/- 0.09, N = 3SE +/- 0.17, N = 383.4388.53-march=native-O11. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -lm

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2-O3 -march=native-O10.77631.55262.32893.10523.8815SE +/- 0.01, N = 3SE +/- 0.01, N = 33.263.45-O3 -march=native - MIN: 3.18 / MAX: 6.94-O1 - MIN: 3.39 / MAX: 7.071. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd-O3 -march=native-O148121620SE +/- 0.01, N = 3SE +/- 0.03, N = 315.2916.18-O3 -march=native - MIN: 15.14 / MAX: 19-O1 - MIN: 16.02 / MAX: 19.891. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt-O3 -march=native-O1100200300400500SE +/- 0.62, N = 3SE +/- 0.13, N = 3451.66427.261. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256-O3 -march=native-O12K4K6K8K10KSE +/- 5.18, N = 3SE +/- 0.64, N = 38401.858879.331. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2-O3 -march=native-O150100150200250SE +/- 0.06, N = 3SE +/- 0.20, N = 3230.11243.16-O3 -march=native - MIN: 229.52 / MAX: 232.81-O1 - MIN: 241.63 / MAX: 246.211. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt-O3 -march=native-O12K4K6K8K10KSE +/- 5.34, N = 3SE +/- 2.06, N = 38412.968885.131. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Summer Nature 4K-O3 -march=native-O14080120160200SE +/- 0.19, N = 3SE +/- 0.05, N = 3195.94185.95-O3 -march=native - MIN: 181.35 / MAX: 208.71-O1 - MIN: 169.98 / MAX: 195.751. (CC) gcc options: -pthread -lm

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt-O3 -march=native-O1306090120150SE +/- 0.05, N = 3SE +/- 0.06, N = 3112.03106.481. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny-O3 -march=native-O1510152025SE +/- 0.03, N = 3SE +/- 0.05, N = 320.2121.26-O3 -march=native - MIN: 20.03 / MAX: 23.86-O1 - MIN: 20.97 / MAX: 27.081. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

Crypto++

Test: Integer + Elliptic Curve Public Key Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Integer + Elliptic Curve Public Key Algorithms-O3 -march=native-O115003000450060007500SE +/- 1.75, N = 3SE +/- 4.50, N = 37194.866862.79-O3 -march=native-O11. (CXX) g++ options: -fPIC -pthread -pipe

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms-O3 -march=native-O1246810SE +/- 0.020, N = 3SE +/- 0.035, N = 38.7378.345-O3 -march=native-O11. (CXX) g++ options: -O2 -pthread -lm

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Thorough-O3 -march=native-O13691215SE +/- 0.0151, N = 3SE +/- 0.0228, N = 39.36019.7734-O3 -march=native-O11. (CXX) g++ options: -O2 -flto -pthread

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughput-O3 -march=native-O160120180240300SE +/- 0.45, N = 3SE +/- 0.31, N = 3271.68260.26-march=native-O11. (CC) gcc options: -O3 -rdynamic

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database Search-O3 -march=native-O120406080100SE +/- 0.04, N = 3SE +/- 0.08, N = 399.48103.74-O3 -march=native-O11. (CC) gcc options: -pthread -lhmmer -leasel -lm -lmpi

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein-O3 -march=native-O1246810SE +/- 0.026, N = 3SE +/- 0.028, N = 38.5138.184-O3 -march=native-O11. (CXX) g++ options: -O2 -pthread -lm

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt-O3 -march=native-O1120240360480600SE +/- 0.26, N = 3SE +/- 1.04, N = 3553.52532.561. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed Algorithms-O3 -march=native-O1110220330440550SE +/- 0.05, N = 3SE +/- 0.06, N = 3491.45472.95-O3 -march=native-O11. (CXX) g++ options: -fPIC -pthread -pipe

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p-O3 -march=native-O14080120160200SE +/- 1.49, N = 10SE +/- 1.54, N = 9198.73191.41-march=native-O11. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1-O3 -march=native-O150100150200250SE +/- 0.04, N = 3SE +/- 0.15, N = 3227.46235.96-O3 -march=native - MIN: 226.88 / MAX: 228.23-O1 - MIN: 234.76 / MAX: 237.841. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p-O3 -march=native-O14080120160200SE +/- 0.27, N = 3SE +/- 0.29, N = 3166.43160.73-march=native-O11. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18-O3 -march=native-O13691215SE +/- 0.14, N = 3SE +/- 0.01, N = 311.0811.47-O3 -march=native - MIN: 10.69 / MAX: 16.91-O1 - MIN: 11.34 / MAX: 15.371. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish-O3 -march=native-O1120240360480600SE +/- 0.20, N = 3SE +/- 0.93, N = 3552.46533.961. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Medium-O3 -march=native-O10.98111.96222.94333.92444.9055SE +/- 0.0026, N = 3SE +/- 0.0112, N = 34.21534.3606-O3 -march=native-O11. (CXX) g++ options: -O2 -flto -pthread

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224-O3 -march=native-O10.4460.8921.3381.7842.23SE +/- 0.008, N = 3SE +/- 0.011, N = 31.9161.982-march=native - MIN: 1.87 / MAX: 6.22-O1 - MIN: 1.93 / MAX: 7.731. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Exhaustive-O3 -march=native-O11224364860SE +/- 0.04, N = 3SE +/- 0.02, N = 351.4953.25-O3 -march=native-O11. (CXX) g++ options: -O2 -flto -pthread

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p-O3 -march=native-O14080120160200SE +/- 0.17, N = 3SE +/- 0.07, N = 3204.96198.18-march=native-O11. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112-O3 -march=native-O16001200180024003000SE +/- 5.73, N = 3SE +/- 27.81, N = 52609.022525.861. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speed-O3 -march=native-O111002200330044005500SE +/- 19.31, N = 3SE +/- 8.75, N = 34997.84847.5-O3 -march=native-O11. (CC) gcc options: -pthread -lz

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p-O3 -march=native-O13691215SE +/- 0.01, N = 3SE +/- 0.01, N = 39.489.20-march=native-O11. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateless-O3 -march=native-O150K100K150K200K250KSE +/- 711.03, N = 3SE +/- 520.47, N = 3254610247106-O3 -march=native-O11. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression Speed-O3 -march=native-O112002400360048006000SE +/- 6.10, N = 15SE +/- 9.52, N = 35542.95385.7-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0-O3 -march=native-O10.86581.73162.59743.46324.329SE +/- 0.024, N = 3SE +/- 0.019, N = 33.7483.848-march=native - MIN: 3.64 / MAX: 10.5-O1 - MIN: 3.75 / MAX: 8.081. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p-O3 -march=native-O160120180240300SE +/- 0.60, N = 3SE +/- 0.19, N = 3279.12271.99-march=native-O11. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

PostMark

Disk Transaction Performance

OpenBenchmarking.orgTPS, More Is BetterPostMark 1.51Disk Transaction Performance-O3 -march=native-O12K4K6K8K10KSE +/- 118.67, N = 394969259-march=native-O11. (CC) gcc options: -O3

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Decompression Speed-O3 -march=native-O111002200330044005500SE +/- 2.50, N = 15SE +/- 8.30, N = 35346.05215.3-O3 -march=native-O11. (CC) gcc options: -pthread -lz

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p-O3 -march=native-O1306090120150SE +/- 0.11, N = 3SE +/- 0.28, N = 3140.40137.23-march=native-O11. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speed-O3 -march=native-O110002000300040005000SE +/- 18.10, N = 3SE +/- 6.02, N = 44506.54406.4-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Decompression Speed-O3 -march=native-O111002200330044005500SE +/- 15.26, N = 3SE +/- 13.17, N = 35189.95075.8-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0-O3 -march=native-O10.43220.86441.29661.72882.161SE +/- 0.001, N = 3SE +/- 0.004, N = 31.8831.921-march=native - MIN: 1.85 / MAX: 7.81-O1 - MIN: 1.89 / MAX: 9.191. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K-O3 -march=native-O148121620SE +/- 0.12, N = 3SE +/- 0.17, N = 416.0215.72-O3 -march=native-O11. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3-O3 -march=native-O1510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 322.5122.94-march=native - MIN: 22.19 / MAX: 27.64-O1 - MIN: 22.65 / MAX: 29.531. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression Speed-O3 -march=native-O14080120160200SE +/- 0.90, N = 3SE +/- 0.57, N = 3192.6189.2-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET-O3 -march=native-O1900K1800K2700K3600K4500KSE +/- 18099.88, N = 3SE +/- 33158.80, N = 34049394.673982525.83-march=native-O11. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression Speed-O3 -march=native-O160120180240300SE +/- 2.25, N = 15SE +/- 2.78, N = 3285.9281.5-O3 -march=native-O11. (CC) gcc options: -pthread -lz

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU-O3 -march=native-O13691215SE +/- 0.00, N = 3SE +/- 0.01, N = 311.2011.03MIN: 11.11-O1 - MIN: 10.931. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-O3 -march=native-O12004006008001000SE +/- 2.03, N = 3SE +/- 1.20, N = 310941078-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50-O3 -march=native-O1510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 319.2219.51-march=native - MIN: 19.06 / MAX: 24.92-O1 - MIN: 19.33 / MAX: 23.751. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITE-O3 -march=native-O111002200330044005500SE +/- 15.24, N = 3SE +/- 45.51, N = 350604993-O3 -march=native-O11. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 100-O3 -march=native-O120K40K60K80K100KSE +/- 43.97, N = 3SE +/- 10.17, N = 38362584729-O3 -march=native-O11. (CXX) g++ options: -fPIC -O2 -rdynamic -lboost_system -lboost_thread -lboost_filesystem -lboost_chrono -lboost_date_time -lboost_atomic -lglog -lgflags -lprotobuf -lpthread -lhdf5_cpp -lhdf5 -lhdf5_hl_cpp -lhdf5_hl -llmdb -lopenblas

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussian-O3 -march=native-O170140210280350SE +/- 0.88, N = 3310306-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU-O3 -march=native-O1400800120016002000SE +/- 1.46, N = 3SE +/- 4.14, N = 31876.421854.40MIN: 1865.18-O1 - MIN: 1837.761. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Chia Blockchain VDF

Test: Square Assembly Optimized

OpenBenchmarking.orgIPS, More Is BetterChia Blockchain VDF 1.0.1Test: Square Assembly Optimized-O3 -march=native-O150K100K150K200K250KSE +/- 1105.04, N = 3SE +/- 1471.21, N = 32506332479331. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU-O3 -march=native-O17001400210028003500SE +/- 1.32, N = 3SE +/- 2.80, N = 33165.603133.28MIN: 3154.25-O1 - MIN: 3120.481. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16-O3 -march=native-O11224364860SE +/- 0.11, N = 3SE +/- 0.09, N = 354.3654.91-O3 -march=native - MIN: 53.85 / MAX: 59.24-O1 - MIN: 54.36 / MAX: 58.941. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speed-O3 -march=native-O110002000300040005000SE +/- 15.31, N = 3SE +/- 3.19, N = 34540.64506.0-O3 -march=native-O11. (CC) gcc options: -pthread -lz

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU-O3 -march=native-O148121620SE +/- 0.01, N = 3SE +/- 0.02, N = 314.2814.17MIN: 14.18-O1 - MIN: 14.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4-O3 -march=native-O17M14M21M28M35MSE +/- 88809.55, N = 3SE +/- 73388.54, N = 33354435733790753-O3 -march=native-O11. (CXX) g++ options: -O2 -fopenmp

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305-O3 -march=native-O12004006008001000SE +/- 0.46, N = 3SE +/- 1.88, N = 31012.731019.911. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt-O3 -march=native-O12004006008001000SE +/- 0.23, N = 3SE +/- 1.73, N = 31010.791004.651. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateful-O3 -march=native-O12K4K6K8K10KSE +/- 7.69, N = 3SE +/- 4.41, N = 393759333-O3 -march=native-O11. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

Chia Blockchain VDF

Test: Square Plain C++

OpenBenchmarking.orgIPS, More Is BetterChia Blockchain VDF 1.0.1Test: Square Plain C++-O3 -march=native-O140K80K120K160K200KSE +/- 57.74, N = 3SE +/- 120.19, N = 32084002092331. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU-O3 -march=native-O17K14K21K28K35KSE +/- 2.38, N = 3SE +/- 6.87, N = 334770.1434882.14-O3 -march=native-O11. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack-O3 -march=native-O13691215SE +/- 0.00, N = 5SE +/- 0.01, N = 511.1011.13-O3 -march=native-O11. (CXX) g++ options: -rdynamic

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speed-O3 -march=native-O1816243240SE +/- 0.19, N = 3SE +/- 0.15, N = 332.832.9-O3 -march=native-O11. (CC) gcc options: -pthread -lz

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU-O3 -march=native-O10.91091.82182.73273.64364.5545SE +/- 0.00473, N = 3SE +/- 0.00076, N = 34.037814.04828MIN: 3.92-O1 - MIN: 3.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET-O3 -march=native-O1600K1200K1800K2400K3000KSE +/- 33577.98, N = 3SE +/- 19439.73, N = 32956462.002962660.83-march=native-O11. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet-O3 -march=native-O13691215SE +/- 0.02, N = 3SE +/- 0.01, N = 39.649.62-O3 -march=native - MIN: 9.53 / MAX: 13.24-O1 - MIN: 9.5 / MAX: 13.211. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU-O3 -march=native-O11.12112.24223.36334.48445.6055SE +/- 0.01117, N = 3SE +/- 0.01654, N = 34.982814.97288MIN: 3.81-O1 - MIN: 3.811. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S-O3 -march=native-O1510152025SE +/- 0.02, N = 3SE +/- 0.03, N = 320.8120.851. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU-O3 -march=native-O10.96521.93042.89563.86084.826SE +/- 0.00621, N = 3SE +/- 0.00335, N = 34.289844.28224MIN: 4.17-O1 - MIN: 4.171. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 100-O3 -march=native-O18K16K24K32K40KSE +/- 51.83, N = 3SE +/- 14.01, N = 33655836622-O3 -march=native-O11. (CXX) g++ options: -fPIC -O2 -rdynamic -lboost_system -lboost_thread -lboost_filesystem -lboost_chrono -lboost_date_time -lboost_atomic -lglog -lgflags -lprotobuf -lpthread -lhdf5_cpp -lhdf5 -lhdf5_hl_cpp -lhdf5_hl -llmdb -lopenblas

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0-O3 -march=native-O1246810SE +/- 0.002, N = 3SE +/- 0.005, N = 36.1066.1141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2-O3 -march=native-O1714212835SE +/- 0.08, N = 3SE +/- 0.08, N = 329.1429.111. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3-O3 -march=native-O11224364860SE +/- 0.00, N = 3SE +/- 0.02, N = 354.5954.561. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time-O3 -march=native-O16M12M18M24M30MSE +/- 193823.90, N = 3SE +/- 371064.83, N = 32944311229448017-march=native-O11. (CXX) g++ options: -lgcov -m64 -lpthread -O3 -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU-O3 -march=native-O10.79311.58622.37933.17243.9655SE +/- 0.00042, N = 3SE +/- 0.00163, N = 33.524853.52499MIN: 3.46-O1 - MIN: 3.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p-O3 -march=native-O11530456075SE +/- 0.32, N = 3SE +/- 0.60, N = 367.8567.85-O3 -march=native-O11. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speed-O3 -march=native-O1816243240SE +/- 0.48, N = 3SE +/- 0.43, N = 435.435.4-O3 -march=native-O11. (CC) gcc options: -pthread -lz

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time-O3 -march=native130026003900520065006171.81. (CC) gcc options: -O3 -march=native -lm

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Compression Speed-O3 -march=native-O130060090012001500SE +/- 22.75, N = 15SE +/- 12.97, N = 31451.01542.8-O3 -march=native-O11. (CC) gcc options: -pthread -lz


Phoronix Test Suite v10.8.5