Compiler Optimization Levels

Tests for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2106132-IB-COMPILERO15&sor&grr.

Compiler Optimization LevelsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution-O3 -march=native-O1-O0-Og-O2-O2 -march=native-O2 -flto-O3-O3 -flto-O3 -march=native -flto-Ofast -march=native-OfastIntel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads)ASUS ROG MAXIMUS XIII HERO (0707 BIOS)Intel Tiger Lake-H32GB2000GB Corsair Force MP600 + 257GB Flash DriveAMD Radeon VII 16GB (1801/1000MHz)Intel Tiger Lake-H HD AudioASUS MG28U2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Fedora 345.12.9-300.fc34.x86_64 (x86_64)GNOME Shell 40.1X Server + Wayland4.6 Mesa 21.1.1 (LLVM 12.0.0)GCC 11.1.1 20210531btrfs3840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- -O1: CXXFLAGS=-O1 CFLAGS=-O1- -O0: CXXFLAGS=-O0 CFLAGS=-O0- -Og: CXXFLAGS=-Og CFLAGS=-Og- -O2: CXXFLAGS=-O2 CFLAGS=-O2- -O2 -march=native: CXXFLAGS="-O2 -march=native" CFLAGS="-O2 -march=native"- -O2 -flto: CXXFLAGS="-O2 -flto" CFLAGS="-O2 -flto"- -O3: CXXFLAGS=-O3 CFLAGS=-O3- -O3 -flto: CXXFLAGS="-O3 -flto" CFLAGS="-O3 -flto"- -O3 -march=native -flto: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- -Ofast -march=native: CXXFLAGS="-Ofast -march=native" CFLAGS="-Ofast -march=native"- -Ofast: CXXFLAGS=-Ofast CFLAGS=-OfastCompiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Disk Details- NONE / compress=zstd:1,relatime,rw,seclabel,space_cache,ssd,subvol=/home,subvolid=256 / Block Size: 4096Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x3c - Thermald 2.4.4Python Details- Python 3.9.5Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Compiler Optimization Levelslammps: 20k Atomscryptopp: All Algorithmscryptopp: Keyed Algorithmscryptopp: Integer + Elliptic Curve Public Key Algorithmshmmer: Pfam Database Searchmrbayes: Primate Phylogeny Analysisc-ray: Total Time - 4K, 16 Rays Per Pixelcaffe: GoogleNet - CPU - 100onednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUcryptopp: Unkeyed Algorithmspjsip: INVITEx265: Bosphorus 4Ksvt-hevc: 1 - Bosphorus 1080ppjsip: OPTIONS, Statefulcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedgraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Noise-Gaussiangraphics-magick: Resizinggraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: HWB Color Spacecompress-zstd: 19, Long Mode - Compression Speedastcenc: Exhaustivecompress-zstd: 19, Long Mode - Decompression Speedmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0ncnn: CPU - alexnetncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetsqlite-speedtest: Timed Time - Size 1,000botan: CAST-256 - Decryptbotan: CAST-256compress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedespeak: Text-To-Speech Synthesisbotan: ChaCha20Poly1305 - Decryptbotan: Blowfish - Decryptbotan: Blowfishbotan: Twofish - Decryptbotan: Twofishbotan: KASUMI - Decryptbotan: KASUMIpostmark: Disk Transaction Performanceaobench: 2048 x 2048 - Total Timecoremark: CoreMark Size 666 - Iterations Per Secondtjbench: Decompression Throughputdav1d: Summer Nature 4Kchia-vdf: Square Assembly Optimizedliquid-dsp: 1 - 256 - 57liquid-dsp: 4 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 8 - 256 - 57tnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1encode-flac: WAV To FLACencode-opus: WAV To Opus Encoderedis: GETsmallpt: Global Illumination Renderer; 128 Samplespjsip: OPTIONS, Statelessastcenc: Thoroughonednn: IP Shapes 3D - f32 - CPUencode-mp3: WAV To MP3lammps: Rhodopsin Proteinsvt-hevc: 7 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080pastcenc: Mediumsvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080p-O3 -march=native-O1-O0-Og-O2-O2 -march=native-O2 -flto-O3-O3 -flto-O3 -march=native -flto-Ofast -march=native-Ofast8.7372346.359074924.2129117194.85710499.48483.43047.335836253165.601876.42491.454981506016.029.4893755542.9285.919527031012226891094128551.48534540.622.5131.8831.91619.2243.7488.5715.2920.2118.2311.0854.3610.091.154.242.223.262.493.2111.7646.087168.851168.7565189.9192.64997.82731.521.7651010.787553.519552.463451.660464.472112.027115.816949621.556434724.849744271.676664195.9425063399844333363760000188003333722756667687846667230.113227.4555.9375.5954049394.678.4012546109.360111.20025.4738.513140.40198.734.2153166.43204.96279.128.3452114.624613751.4815216862.786620103.74288.533128.907847293133.281854.40472.947089499315.729.2093335385.7281.516221830610215921078120753.25284506.022.9421.9211.98219.5073.8489.7316.1821.2622.2911.4754.9111.401.245.243.173.453.194.1915.0249.011149.807149.4395075.8189.24847.52568.024.0011004.647532.560533.956427.255430.951106.478108.276925924.605366951.484290260.256611185.9524793388411000316710000162046667672296667595816667243.162235.9636.5906.8283982525.839.1332471069.773411.02897.6758.184137.23191.414.3606160.73198.18271.998.368542.232639135.9564082145.953632532.066265.306191.788848313143.031860.28150.668726481615.659.2091891206.4244.15080188422378106388120.253.27611066.823.0761.9191.96619.5653.8259.639.5916.1821.3722.3611.3055.3111.071.195.233.093.463.174.1815.0680.88936.33836.8871087.0180.91030.11005.159.00976.49146.59247.15343.79343.44632.59634.456914940.88290409.832369259.485533146.062495331102400041181333212996679039766778172667243.096236.46741.97125.4963975306.3329.0942151829.827011.070219.0128.093137.47193.964.3715160.72198.38273.728.3561975.626588676.0307996555.531693114.92293.338130.016848093127.131836.67450.925032488415.389.2193114236.9316.6872132817635811115120731.953.26953476.622.9261.9311.97319.5383.8609.649.6516.1620.9622.1211.3355.3411.041.185.233.113.453.174.2014.9252.668161.424160.8813939.4190.83791.42376.326.892969.304441.082443.742387.639384.85998.826100.677925934.358261573.755828260.633407184.0124793366759667242253333126890000517440000462596667243.334235.7746.7407.6994019272.0010.6902474549.812410.59418.4808.121137.30193.574.3511161.02197.85271.788.3142245.551578825.8165847128.981138102.35984.363106.489848953168.451876.16492.238479496215.749.2193665749.7298.916421930410926221072123532.853.28014791.023.0691.9201.95819.6203.8249.659.6416.2521.2422.3411.4855.1711.511.285.363.213.483.254.3615.0146.312150.169154.2925394.8191.85191.92707.621.411901.568549.171562.854449.213448.567115.399119.474925924.426428580.321691261.219497187.1024670092384333335913333173196667711676667637103333243.409236.3316.1086.4364081087.338.7702534669.815811.23597.3578.012137.13192.874.3470160.47197.89271.668.7552261.012041844.6821607173.762512100.39988.44384.725836383171.691880.88484.447237500915.899.4793805649.1277.019726531212306821075124532.951.48954676.522.7861.8841.92119.4513.7539.698.7515.4620.3218.5111.2954.5910.391.14.272.233.252.503.2211.8846.090158.889153.4425324.1190.55093.52729.422.2281080.314533.320554.069470.026470.938111.714112.449925922.645423569.010151272.808635185.3625060097389667355393333184670000721436667674180000230.217227.5905.9415.9064054268.337.8862524959.366211.23696.9658.499139.61195.034.2307164.42198.90274.648.2672224.098469794.9741237257.052944101.84885.582106.595837223182.481883.34474.998433511415.489.5194065628.6299.116321830310896171079134532.553.27654693.222.8261.9171.94419.3323.7679.8310.2716.6324.5823.4211.6755.5112.011.815.543.395.973.444.4616.9646.231158.251155.1585207.4194.05087.02723.221.656907.735550.133563.749445.665448.404114.976119.378925924.274429492.027757260.29750724803392353667335793333173536667707806667637113333251.613244.6326.0636.6324021727.258.7032551149.821111.22547.1068.001140.32193.704.3697162.81198.86274.398.2952316.405581910.8630367052.849054103.12581.37266.471848453177.981880.75496.417145495315.559.1993755676.9308.416421330110786251077123933.253.28614720.023.0711.9241.96919.6703.8389.689.7116.3321.4122.4111.5155.7711.521.245.303.183.463.244.2515.1246.350178.197178.2115332.9192.55151.92660.522.926960.082540.306553.525451.783449.439118.834124.035914923.692440872.955313257.853485181.3124816795463333341970000175400000711086667646743333243.311236.4276.1066.1524056377.089.0292538879.807111.23685.8848.135136.89191.354.3603159.86196.43270.198.3352269.036592828.8895357270.918001102.31780.56166.353837133177.881877.44490.790520504915.389.5194055474.8306.716421430510866121074134332.753.29684602.922.9281.9181.96219.3773.8159.8210.3416.6624.5023.3911.6555.7911.831.805.543.406.083.414.4616.9246.542172.969170.5785121.7194.34972.92712.721.602969.699538.668551.742451.243448.616119.055124.146914923.368427503.802053260.03135724786795045333345733333178046667713980000650283333251.486244.5886.0736.1754018545.179.0882568599.801011.21955.8188.109140.48193.534.3686162.78198.37273.858.7572278.379843842.1034437252.64035499.81083.29747.456842713177.571878.45490.390372506115.279.7493965478.3287.719527031412316931076134932.951.54104591.022.5311.8781.90719.2943.7069.748.9215.8923.8218.7511.4555.8210.641.804.312.285.682.543.2613.6646.916168.408169.1255105.5192.74935.52719.622.6611016.886555.056553.173454.149463.010112.043116.005925921.539436079.646495271.74826324956799749333363303333189120000722200000687796667248.410242.6865.9385.5504042208.588.4572553819.399111.22525.3868.619142.04195.124.2359165.01199.57275.578.7392330.254345922.6371797097.80800389.95350.81346.138836933180.011880.54490.612594508915.419.4693815529.2276.521729935212287351077127533.151.51144589.422.5291.8811.92219.3213.7419.688.6015.6520.6618.3911.1655.1510.151.164.332.253.252.513.2411.9246.586168.833168.8755209.0189.84991.42711.421.7461008.847553.967554.569449.759463.943112.365116.046925921.537435243.249092273.245785179.9524796796017000349460000182180000684946667671553333230.494227.4055.9234038893.758.2902528699.394811.22155.4568.510138.97193.794.2296163.33197.26271.628.3482320.940908910.4329007091.95212093.79146.56263.746848433177.341880.71495.403012503515.679.2093845734.4300.716222834810786591075126733.153.28684725.422.9141.9211.97619.5413.8459.729.7216.3421.2722.2911.3755.7711.431.225.323.143.473.214.2015.0847.004178.283178.3265383.8188.65174.72658.322.772962.283538.430552.868453.840449.995118.794124.158914922.990450629.164534260.997644180.0524696794333667341200000178193333671610000648636667243.294235.9886.0824026735.58.5902526479.837711.23605.8618.032136.73190.494.3786159.74195.23267.46OpenBenchmarking.org

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms-O3 -march=native -flto-O2 -march=native-Ofast -march=native-O3 -march=native-O0-Og-Ofast-O1-O3 -flto-O2-O3-O2 -flto246810SE +/- 0.006, N = 3SE +/- 0.014, N = 3SE +/- 0.011, N = 3SE +/- 0.020, N = 3SE +/- 0.005, N = 3SE +/- 0.022, N = 3SE +/- 0.010, N = 3SE +/- 0.035, N = 3SE +/- 0.026, N = 3SE +/- 0.010, N = 3SE +/- 0.016, N = 3SE +/- 0.077, N = 98.7578.7558.7398.7378.3688.3568.3488.3458.3358.3148.2958.267-O3 -march=native -flto-march=native-Ofast -march=native-O3 -march=native-O0-Og-Ofast-O1-O3 -flto-O3-flto1. (CXX) g++ options: -O2 -pthread -lm

Crypto++

Test: All Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: All Algorithms-O3 -march=native-Ofast -march=native-Ofast-O3-O3 -march=native -flto-O3 -flto-O2 -march=native-O2-O2 -flto-O1-Og-O05001000150020002500SE +/- 1.51, N = 3SE +/- 2.30, N = 3SE +/- 0.43, N = 3SE +/- 0.85, N = 3SE +/- 0.90, N = 3SE +/- 1.95, N = 3SE +/- 0.95, N = 3SE +/- 1.37, N = 3SE +/- 1.13, N = 3SE +/- 0.38, N = 3SE +/- 1.15, N = 3SE +/- 0.18, N = 32346.362330.252320.942316.412278.382269.042261.012245.552224.102114.621975.63542.23-O3 -march=native-Ofast -march=native-Ofast-O3-O3 -march=native -flto-O3 -flto-O2 -march=native-O2-O2 -flto-O1-Og-O01. (CXX) g++ options: -fPIC -pthread -pipe

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed Algorithms-O3 -march=native-Ofast -march=native-O3-Ofast-O2 -march=native-O3 -march=native -flto-O3 -flto-O2-O2 -flto-O1-Og-O02004006008001000SE +/- 0.64, N = 3SE +/- 0.32, N = 3SE +/- 0.39, N = 3SE +/- 1.09, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 1.27, N = 3SE +/- 0.99, N = 3SE +/- 0.80, N = 3SE +/- 0.51, N = 3SE +/- 0.17, N = 3SE +/- 0.19, N = 3924.21922.64910.86910.43844.68842.10828.89825.82794.97751.48676.03135.96-O3 -march=native-Ofast -march=native-O3-Ofast-O2 -march=native-O3 -march=native -flto-O3 -flto-O2-O2 -flto-O1-Og-O01. (CXX) g++ options: -fPIC -pthread -pipe

Crypto++

Test: Integer + Elliptic Curve Public Key Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Integer + Elliptic Curve Public Key Algorithms-O3 -flto-O2 -flto-O3 -march=native -flto-O3 -march=native-O2 -march=native-O2-Ofast -march=native-Ofast-O3-O1-Og-O016003200480064008000SE +/- 3.89, N = 3SE +/- 4.94, N = 3SE +/- 7.22, N = 3SE +/- 1.75, N = 3SE +/- 6.19, N = 3SE +/- 5.85, N = 3SE +/- 0.23, N = 3SE +/- 3.58, N = 3SE +/- 7.04, N = 3SE +/- 4.50, N = 3SE +/- 7.34, N = 3SE +/- 0.91, N = 37270.927257.057252.647194.867173.767128.987097.817091.957052.856862.796555.532145.95-O3 -flto-O2 -flto-O3 -march=native -flto-O3 -march=native-O2 -march=native-O2-Ofast -march=native-Ofast-O3-O1-Og-O01. (CXX) g++ options: -fPIC -pthread -pipe

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database Search-Ofast -march=native-Ofast-O3 -march=native-O3 -march=native -flto-O2 -march=native-O2 -flto-O3 -flto-O2-O3-O1-Og-O0120240360480600SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.22, N = 3SE +/- 0.15, N = 389.9593.7999.4899.81100.40101.85102.32102.36103.13103.74114.92532.07-Ofast -march=native-Ofast-O3 -march=native-O3 -march=native -flto-O2 -march=native-O2 -flto-O3 -flto-O2-O3-O1-Og-O01. (CC) gcc options: -pthread -lhmmer -leasel -lm -lmpi

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-Ofast-Ofast -march=native-O3 -flto-O3-O3 -march=native -flto-O3 -march=native-O2-O2 -flto-O2 -march=native-O1-Og-O060120180240300SE +/- 0.20, N = 3SE +/- 0.27, N = 3SE +/- 0.43, N = 3SE +/- 0.35, N = 3SE +/- 0.79, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.19, N = 3SE +/- 0.14, N = 3SE +/- 0.17, N = 3SE +/- 0.06, N = 3SE +/- 0.38, N = 346.5650.8180.5681.3783.3083.4384.3685.5888.4488.5393.34265.31-Ofast-Ofast -march=native-flto-march=native -flto-march=native-O2-O2 -flto-O2 -march=native-O1-Og-O01. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-Ofast -march=native-O3 -march=native-O3 -march=native -flto-Ofast-O3 -flto-O3-O2 -march=native-O2-O2 -flto-O1-Og-O04080120160200SE +/- 0.17, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 3SE +/- 0.16, N = 3SE +/- 0.19, N = 3SE +/- 0.19, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 346.1447.3447.4663.7566.3566.4784.73106.49106.60128.91130.02191.79-Ofast -march=native-march=native-march=native -flto-Ofast-flto-O2 -march=native-O2-O2 -flto-O1-Og-O01. (CC) gcc options: -lm -lpthread -O3

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 100-O3 -march=native-O2 -march=native-Ofast -march=native-O3 -flto-O2 -flto-O3 -march=native -flto-O1-Og-O0-Ofast-O3-O220K40K60K80K100KSE +/- 43.97, N = 3SE +/- 35.23, N = 3SE +/- 22.39, N = 3SE +/- 47.42, N = 3SE +/- 32.64, N = 3SE +/- 61.19, N = 3SE +/- 10.17, N = 3SE +/- 73.11, N = 3SE +/- 84.95, N = 3SE +/- 30.02, N = 3SE +/- 30.68, N = 3SE +/- 133.67, N = 3836258363883693837138372284271847298480984831848438484584895-O3 -march=native-march=native-Ofast -march=native-O3 -flto-flto-O3 -march=native -flto-O1-Og-O0-Ofast-O31. (CXX) g++ options: -fPIC -O2 -rdynamic -lboost_system -lboost_thread -lboost_filesystem -lboost_chrono -lboost_date_time -lboost_atomic -lglog -lgflags -lprotobuf -lpthread -lhdf5_cpp -lhdf5 -lhdf5_hl_cpp -lhdf5_hl -llmdb -lopenblas

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU-Og-O1-O0-O3 -march=native-O2-O2 -march=native-Ofast-O3 -march=native -flto-O3 -flto-O3-Ofast -march=native-O2 -flto7001400210028003500SE +/- 1.46, N = 3SE +/- 2.80, N = 3SE +/- 0.67, N = 3SE +/- 1.32, N = 3SE +/- 1.81, N = 3SE +/- 0.97, N = 3SE +/- 1.46, N = 3SE +/- 1.52, N = 3SE +/- 1.80, N = 3SE +/- 0.66, N = 3SE +/- 0.15, N = 3SE +/- 2.58, N = 33127.133133.283143.033165.603168.453171.693177.343177.573177.883177.983180.013182.48-Og - MIN: 3113.66-O1 - MIN: 3120.48-O0 - MIN: 3131.71MIN: 3154.25MIN: 3155.33MIN: 3161.55-Ofast - MIN: 3164.15-flto - MIN: 3166.18-flto - MIN: 3165.88MIN: 3167.31-Ofast - MIN: 3168.64-flto - MIN: 3168.441. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU-Og-O1-O0-O2-O3 -march=native-O3 -flto-O3 -march=native -flto-Ofast -march=native-Ofast-O3-O2 -march=native-O2 -flto400800120016002000SE +/- 1.91, N = 3SE +/- 4.14, N = 3SE +/- 1.24, N = 3SE +/- 1.27, N = 3SE +/- 1.46, N = 3SE +/- 1.95, N = 3SE +/- 0.62, N = 3SE +/- 1.60, N = 3SE +/- 0.97, N = 3SE +/- 1.53, N = 3SE +/- 0.46, N = 3SE +/- 0.58, N = 31836.671854.401860.281876.161876.421877.441878.451880.541880.711880.751880.881883.34-Og - MIN: 1823.46-O1 - MIN: 1837.76-O0 - MIN: 1849.08MIN: 1864.8MIN: 1865.18-flto - MIN: 1864.71-flto - MIN: 1869.12-Ofast - MIN: 1868.92-Ofast - MIN: 1870.91MIN: 1870.28MIN: 1870.79-flto - MIN: 1872.751. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed Algorithms-O3-Ofast-O2-O3 -march=native-O3 -flto-Ofast -march=native-O3 -march=native -flto-O2 -march=native-O2 -flto-O1-Og-O0110220330440550SE +/- 0.66, N = 3SE +/- 0.54, N = 3SE +/- 0.40, N = 3SE +/- 0.05, N = 3SE +/- 0.32, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.11, N = 3SE +/- 0.67, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3496.42495.40492.24491.45490.79490.61490.39484.45475.00472.95450.93150.67-O3-Ofast-O2-O3 -march=native-O3 -flto-Ofast -march=native-O3 -march=native -flto-O2 -march=native-O2 -flto-O1-Og-O01. (CXX) g++ options: -fPIC -pthread -pipe

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITE-O2 -flto-Ofast -march=native-O3 -march=native -flto-O3 -march=native-O3 -flto-Ofast-O2 -march=native-O1-O2-O3-Og-O011002200330044005500SE +/- 33.42, N = 3SE +/- 27.45, N = 3SE +/- 22.84, N = 3SE +/- 15.24, N = 3SE +/- 5.04, N = 3SE +/- 52.72, N = 3SE +/- 43.32, N = 3SE +/- 45.51, N = 3SE +/- 68.29, N = 3SE +/- 43.88, N = 3SE +/- 65.58, N = 3SE +/- 51.86, N = 4511450895061506050495035500949934962495348844816-O2 -flto-Ofast -march=native-O3 -march=native -flto-O3 -march=native-O3 -flto-Ofast-O2 -march=native-O1-O2-O3-Og-O01. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K-O3 -march=native-O2 -march=native-O2-O1-Ofast-O0-O3-O2 -flto-Ofast -march=native-O3 -flto-Og-O3 -march=native -flto48121620SE +/- 0.12, N = 3SE +/- 0.16, N = 5SE +/- 0.18, N = 4SE +/- 0.17, N = 4SE +/- 0.21, N = 3SE +/- 0.10, N = 3SE +/- 0.17, N = 4SE +/- 0.13, N = 8SE +/- 0.15, N = 3SE +/- 0.17, N = 5SE +/- 0.15, N = 3SE +/- 0.10, N = 1516.0215.8915.7415.7215.6715.6515.5515.4815.4115.3815.3815.27-O3 -march=native-march=native-O1-Ofast-O0-O3-flto-Ofast -march=native-O3 -flto-Og-O3 -march=native -flto1. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p-O3 -march=native -flto-O3 -flto-O2 -flto-O3 -march=native-O2 -march=native-Ofast -march=native-O2-Og-Ofast-O0-O1-O33691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.749.519.519.489.479.469.219.219.209.209.209.19-march=native -flto-flto-flto-march=native-march=native-Ofast -march=native-Og-Ofast-O0-O11. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateful-O2 -flto-O3 -flto-O3 -march=native -flto-Ofast-Ofast -march=native-O2 -march=native-O3-O3 -march=native-O2-O1-Og-O02K4K6K8K10KSE +/- 3.28, N = 3SE +/- 1.20, N = 3SE +/- 6.08, N = 3SE +/- 5.55, N = 3SE +/- 6.43, N = 3SE +/- 3.79, N = 3SE +/- 3.06, N = 3SE +/- 7.69, N = 3SE +/- 4.36, N = 3SE +/- 4.41, N = 3SE +/- 7.86, N = 3SE +/- 2.73, N = 3940694059396938493819380937593759366933393119189-O2 -flto-O3 -flto-O3 -march=native -flto-Ofast-Ofast -march=native-O2 -march=native-O3-O3 -march=native-O2-O1-Og-O01. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression Speed-O2-Ofast-O3-O2 -march=native-O2 -flto-O3 -march=native-Ofast -march=native-O3 -march=native -flto-O3 -flto-O1-Og-O012002400360048006000SE +/- 5.12, N = 3SE +/- 2.92, N = 3SE +/- 10.95, N = 3SE +/- 7.83, N = 3SE +/- 11.17, N = 3SE +/- 6.10, N = 15SE +/- 10.09, N = 3SE +/- 6.94, N = 12SE +/- 13.35, N = 11SE +/- 9.52, N = 3SE +/- 6.26, N = 3SE +/- 0.28, N = 35749.75734.45676.95649.15628.65542.95529.25478.35474.85385.74236.91206.4-O2-Ofast-O3-O2 -march=native-O2 -flto-O3 -march=native-Ofast -march=native-O3 -march=native -flto-O3 -flto-O1-Og-O01. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression Speed-Og-O3-O3 -flto-Ofast-O2 -flto-O2-O3 -march=native -flto-O3 -march=native-O1-O2 -march=native-Ofast -march=native-O070140210280350SE +/- 1.92, N = 3SE +/- 3.28, N = 3SE +/- 2.23, N = 11SE +/- 2.55, N = 3SE +/- 2.74, N = 3SE +/- 1.34, N = 3SE +/- 2.00, N = 12SE +/- 2.25, N = 15SE +/- 2.78, N = 3SE +/- 2.77, N = 3SE +/- 3.28, N = 3SE +/- 1.88, N = 3316.6308.4306.7300.7299.1298.9287.7285.9281.5277.0276.5244.1-Og-O3-O3 -flto-Ofast-O2 -flto-O2-O3 -march=native -flto-O3 -march=native-O1-O2 -march=native-Ofast -march=native-O01. (CC) gcc options: -pthread -lz

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-Ofast -march=native-O2 -march=native-O3 -march=native -flto-O3 -march=native-O3 -flto-O3-O2-O2 -flto-Ofast-O1-Og-O050100150200250SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 32171971951951641641641631621628750-Ofast -march=native-O2 -march=native-O3 -march=native -flto-O3 -march=native-O3 -flto-O3-O2-O2 -flto-Ofast-O1-Og-O01. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-Ofast -march=native-O3 -march=native -flto-O3 -march=native-O2 -march=native-Ofast-O2-O2 -flto-O1-O3 -flto-O3-Og-O070140210280350SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 329927027026522821921821821421321380-Ofast -march=native-O3 -march=native -flto-O3 -march=native-O2 -march=native-Ofast-O2-O2 -flto-O1-O3 -flto-O3-Og-O01. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussian-Ofast -march=native-Ofast-O3 -march=native -flto-O2 -march=native-O3 -march=native-O1-O3 -flto-O2-O2 -flto-O3-Og-O080160240320400SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 3SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3352348314312310306305304303301281188-Ofast -march=native-Ofast-O3 -march=native -flto-O2 -march=native-O3 -march=native-O1-O3 -flto-O2-O2 -flto-O3-Og-O01. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-O3 -march=native -flto-O2 -march=native-Ofast -march=native-O3 -march=native-O2-O2 -flto-O3 -flto-Ofast-O3-O1-Og-O030060090012001500SE +/- 1.86, N = 3SE +/- 6.17, N = 3SE +/- 1.00, N = 3SE +/- 2.33, N = 3SE +/- 1.86, N = 3SE +/- 3.38, N = 3SE +/- 1.45, N = 3SE +/- 3.06, N = 3SE +/- 1.76, N = 3SE +/- 1.00, N = 3SE +/- 1.86, N = 31231123012281222109210891086107810781021763422-O3 -march=native -flto-O2 -march=native-Ofast -march=native-O3 -march=native-O2-O2 -flto-O3 -flto-Ofast-O3-O1-Og-O01. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirl-Ofast -march=native-O3 -march=native -flto-O3 -march=native-O2 -march=native-Ofast-O3-O2-O2 -flto-O3 -flto-O1-Og-O0160320480640800SE +/- 3.28, N = 3SE +/- 1.73, N = 3SE +/- 2.67, N = 3SE +/- 2.00, N = 3SE +/- 1.76, N = 3SE +/- 2.08, N = 3SE +/- 1.67, N = 3SE +/- 1.15, N = 3SE +/- 0.58, N = 3SE +/- 1.00, N = 3SE +/- 0.58, N = 3SE +/- 0.88, N = 3735693689682659625622617612592581378-Ofast -march=native-O3 -march=native -flto-O3 -march=native-O2 -march=native-Ofast-O3-O2-O2 -flto-O3 -flto-O1-Og-O01. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-Og-O3 -march=native-O2 -flto-O1-Ofast -march=native-O3-O3 -march=native -flto-Ofast-O2 -march=native-O3 -flto-O2-O02004006008001000SE +/- 2.08, N = 3SE +/- 2.03, N = 3SE +/- 1.76, N = 3SE +/- 1.20, N = 3SE +/- 1.00, N = 3SE +/- 0.67, N = 3SE +/- 1.86, N = 3SE +/- 2.40, N = 3SE +/- 2.40, N = 3SE +/- 2.00, N = 3SE +/- 1.15, N = 3111510941079107810771077107610751075107410721063-Og-O3 -march=native-O2 -flto-O1-Ofast -march=native-O3-O3 -march=native -flto-Ofast-O2 -march=native-O3 -flto-O2-O01. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Space-O3 -march=native -flto-O2 -flto-O3 -flto-O3 -march=native-Ofast -march=native-Ofast-O2 -march=native-O3-O2-Og-O1-O030060090012001500SE +/- 1.53, N = 3SE +/- 0.67, N = 3SE +/- 1.20, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 1.53, N = 3SE +/- 0.33, N = 3SE +/- 1.33, N = 3SE +/- 0.33, N = 313491345134312851275126712451239123512071207881-O3 -march=native -flto-O2 -flto-O3 -flto-O3 -march=native-Ofast -march=native-Ofast-O2 -march=native-O3-O2-Og-O1-O01. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speed-O3-Ofast-Ofast -march=native-O3 -march=native -flto-O2 -march=native-O2-O3 -flto-O2 -flto-Og-O0816243240SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.17, N = 3SE +/- 0.15, N = 3SE +/- 0.09, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.21, N = 533.233.133.132.932.932.832.732.531.920.2-O3-Ofast-Ofast -march=native-O3 -march=native -flto-O2 -march=native-O2-O3 -flto-O2 -flto-Og-O01. (CC) gcc options: -pthread -lz

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Exhaustive-O3 -march=native-O2 -march=native-Ofast -march=native-O3 -march=native -flto-O1-Og-O0-O2 -flto-O2-O3-Ofast-O3 -flto1224364860SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 351.4951.4951.5151.5453.2553.2753.2853.2853.2853.2953.2953.30-O3 -march=native-march=native-Ofast -march=native-O3 -march=native-O1-Og-O0-O3-Ofast-O31. (CXX) g++ options: -O2 -flto -pthread

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speed-O2-Ofast-O3-O2 -flto-O2 -march=native-O3 -flto-O3 -march=native -flto-Ofast -march=native-O3 -march=native-O1-Og-O010002000300040005000SE +/- 13.81, N = 3SE +/- 8.36, N = 3SE +/- 2.68, N = 3SE +/- 5.20, N = 3SE +/- 6.86, N = 3SE +/- 16.05, N = 3SE +/- 22.61, N = 3SE +/- 11.52, N = 3SE +/- 15.31, N = 3SE +/- 3.19, N = 3SE +/- 8.30, N = 3SE +/- 0.44, N = 54791.04725.44720.04693.24676.54602.94591.04589.44540.64506.03476.61066.8-O2-Ofast-O3-O2 -flto-O2 -march=native-O3 -flto-O3 -march=native -flto-Ofast -march=native-O3 -march=native-O1-Og-O01. (CC) gcc options: -pthread -lz

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3-O3 -march=native-Ofast -march=native-O3 -march=native -flto-O2 -march=native-O2 -flto-Ofast-Og-O3 -flto-O1-O2-O3-O0612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.20, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.18, N = 3SE +/- 0.20, N = 3SE +/- 0.16, N = 322.5122.5322.5322.7922.8322.9122.9322.9322.9423.0723.0723.08-march=native - MIN: 22.19 / MAX: 27.64-Ofast -march=native - MIN: 22.28 / MAX: 29.02-march=native -flto - MIN: 22.28 / MAX: 27.89-march=native - MIN: 22.14 / MAX: 27.48-flto - MIN: 22.56 / MAX: 32.63-Ofast - MIN: 22.68 / MAX: 28.85-Og - MIN: 22.69 / MAX: 27.2-flto - MIN: 22.56 / MAX: 29.28-O1 - MIN: 22.65 / MAX: 29.53MIN: 22.73 / MAX: 29.81MIN: 22.71 / MAX: 30.08-O0 - MIN: 22.77 / MAX: 27.581. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0-O3 -march=native -flto-Ofast -march=native-O3 -march=native-O2 -march=native-O2 -flto-O3 -flto-O0-O2-O1-Ofast-O3-Og0.43450.8691.30351.7382.1725SE +/- 0.004, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.005, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 3SE +/- 0.005, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.008, N = 3SE +/- 0.000, N = 31.8781.8811.8831.8841.9171.9181.9191.9201.9211.9211.9241.931-march=native -flto - MIN: 1.85 / MAX: 6.19-Ofast -march=native - MIN: 1.85 / MAX: 6.17-march=native - MIN: 1.85 / MAX: 7.81-march=native - MIN: 1.85 / MAX: 6.17-flto - MIN: 1.89 / MAX: 7.02-flto - MIN: 1.89 / MAX: 6.19-O0 - MIN: 1.89 / MAX: 6.19MIN: 1.89 / MAX: 6.2-O1 - MIN: 1.89 / MAX: 9.19-Ofast - MIN: 1.89 / MAX: 6.22MIN: 1.89 / MAX: 8.6-Og - MIN: 1.9 / MAX: 8.691. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224-O3 -march=native -flto-O3 -march=native-O2 -march=native-Ofast -march=native-O2 -flto-O2-O3 -flto-O0-O3-Og-Ofast-O10.4460.8921.3381.7842.23SE +/- 0.011, N = 3SE +/- 0.008, N = 3SE +/- 0.007, N = 3SE +/- 0.007, N = 3SE +/- 0.003, N = 3SE +/- 0.008, N = 3SE +/- 0.015, N = 3SE +/- 0.014, N = 3SE +/- 0.007, N = 3SE +/- 0.010, N = 3SE +/- 0.006, N = 3SE +/- 0.011, N = 31.9071.9161.9211.9221.9441.9581.9621.9661.9691.9731.9761.982-march=native -flto - MIN: 1.84 / MAX: 6.27-march=native - MIN: 1.87 / MAX: 6.22-march=native - MIN: 1.88 / MAX: 6.22-Ofast -march=native - MIN: 1.88 / MAX: 6.23-flto - MIN: 1.9 / MAX: 6.25MIN: 1.91 / MAX: 6.28-flto - MIN: 1.9 / MAX: 6.25-O0 - MIN: 1.91 / MAX: 6.27MIN: 1.93 / MAX: 6.3-Og - MIN: 1.93 / MAX: 6.26-Ofast - MIN: 1.93 / MAX: 6.27-O1 - MIN: 1.93 / MAX: 7.731. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50-O3 -march=native-O3 -march=native -flto-Ofast -march=native-O2 -flto-O3 -flto-O2 -march=native-O1-Og-Ofast-O0-O2-O3510152025SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.14, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.16, N = 3SE +/- 0.19, N = 3SE +/- 0.14, N = 319.2219.2919.3219.3319.3819.4519.5119.5419.5419.5719.6219.67-march=native - MIN: 19.06 / MAX: 24.92-march=native -flto - MIN: 19.06 / MAX: 25.79-Ofast -march=native - MIN: 19.15 / MAX: 28.31-flto - MIN: 19.14 / MAX: 23.64-flto - MIN: 19.15 / MAX: 25.49-march=native - MIN: 19.15 / MAX: 26.45-O1 - MIN: 19.33 / MAX: 23.75-Og - MIN: 19.39 / MAX: 23.78-Ofast - MIN: 19.37 / MAX: 25.55-O0 - MIN: 19.17 / MAX: 25.8MIN: 19.25 / MAX: 24.23MIN: 19.37 / MAX: 24.171. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0-O3 -march=native -flto-Ofast -march=native-O3 -march=native-O2 -march=native-O2 -flto-O3 -flto-O2-O0-O3-Ofast-O1-Og0.86851.7372.60553.4744.3425SE +/- 0.025, N = 3SE +/- 0.019, N = 3SE +/- 0.024, N = 3SE +/- 0.026, N = 3SE +/- 0.029, N = 3SE +/- 0.041, N = 3SE +/- 0.035, N = 3SE +/- 0.043, N = 3SE +/- 0.008, N = 3SE +/- 0.005, N = 3SE +/- 0.019, N = 3SE +/- 0.004, N = 33.7063.7413.7483.7533.7673.8153.8243.8253.8383.8453.8483.860-march=native -flto - MIN: 3.61 / MAX: 7.92-Ofast -march=native - MIN: 3.65 / MAX: 8.19-march=native - MIN: 3.64 / MAX: 10.5-march=native - MIN: 3.66 / MAX: 8.01-flto - MIN: 3.69 / MAX: 8.26-flto - MIN: 3.69 / MAX: 8.62MIN: 3.71 / MAX: 8.07-O0 - MIN: 3.69 / MAX: 8.15MIN: 3.78 / MAX: 8.07-Ofast - MIN: 3.78 / MAX: 8.38-O1 - MIN: 3.75 / MAX: 8.08-Og - MIN: 3.81 / MAX: 8.21. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet-O0-Og-O2-O3-Ofast -march=native-O2 -march=native-Ofast-O3 -march=native -flto-O3 -flto-O2 -flto3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.639.649.659.689.689.699.729.749.829.83-O0 - MIN: 9.54 / MAX: 13.14-Og - MIN: 9.54 / MAX: 13.27MIN: 9.55 / MAX: 14.36-O3 - MIN: 9.59 / MAX: 13.26-Ofast -march=native - MIN: 9.6 / MAX: 13.26-march=native - MIN: 9.58 / MAX: 14.24-Ofast - MIN: 9.62 / MAX: 13.28-O3 -march=native -flto - MIN: 9.64 / MAX: 13.24-O3 -flto - MIN: 9.75 / MAX: 13.41-flto - MIN: 9.75 / MAX: 13.441. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m-O3 -march=native-Ofast -march=native-O2 -march=native-O3 -march=native -flto-O0-O2-Og-O3-Ofast-O1-O2 -flto-O3 -flto3691215SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 2SE +/- 0.06, N = 3SE +/- 0.14, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 38.578.608.758.929.599.649.659.719.729.7310.2710.34-O3 -march=native - MIN: 8.47 / MAX: 12.35-Ofast -march=native - MIN: 8.43 / MAX: 12.94-march=native - MIN: 8.52 / MAX: 15-O3 -march=native -flto - MIN: 8.75 / MAX: 12.62-O0 - MIN: 9.46 / MAX: 13.26MIN: 9.46 / MAX: 13.35-Og - MIN: 9.51 / MAX: 15.51-O3 - MIN: 9.49 / MAX: 13.42-Ofast - MIN: 9.38 / MAX: 13.46-O1 - MIN: 9.55 / MAX: 14.41-flto - MIN: 10.13 / MAX: 13.94-O3 -flto - MIN: 10.1 / MAX: 17.971. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd-O3 -march=native-O2 -march=native-Ofast -march=native-O3 -march=native -flto-Og-O1-O0-O2-O3-Ofast-O2 -flto-O3 -flto48121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 315.2915.4615.6515.8916.1616.1816.1816.2516.3316.3416.6316.66-O3 -march=native - MIN: 15.14 / MAX: 19-march=native - MIN: 15.31 / MAX: 19.14-Ofast -march=native - MIN: 15.35 / MAX: 19.25-O3 -march=native -flto - MIN: 15.72 / MAX: 21.7-Og - MIN: 15.99 / MAX: 20.07-O1 - MIN: 16.02 / MAX: 19.89-O0 - MIN: 16.05 / MAX: 19.86MIN: 16.07 / MAX: 24.06-O3 - MIN: 16.15 / MAX: 21.16-Ofast - MIN: 16.12 / MAX: 20.09-flto - MIN: 16.51 / MAX: 20.21-O3 -flto - MIN: 16.52 / MAX: 20.451. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny-O3 -march=native-O2 -march=native-Ofast -march=native-Og-O2-O1-Ofast-O0-O3-O3 -march=native -flto-O3 -flto-O2 -flto612182430SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 3SE +/- 0.28, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.36, N = 3SE +/- 0.14, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 320.2120.3220.6620.9621.2421.2621.2721.3721.4123.8224.5024.58-O3 -march=native - MIN: 20.03 / MAX: 23.86-march=native - MIN: 20.15 / MAX: 24.06-Ofast -march=native - MIN: 20.3 / MAX: 24.56-Og - MIN: 20.29 / MAX: 25.35MIN: 21.01 / MAX: 26.9-O1 - MIN: 20.97 / MAX: 27.08-Ofast - MIN: 20.47 / MAX: 26.5-O0 - MIN: 21.06 / MAX: 25.18-O3 - MIN: 21.1 / MAX: 25.22-O3 -march=native -flto - MIN: 23.43 / MAX: 28.5-O3 -flto - MIN: 24.27 / MAX: 28.19-flto - MIN: 24.33 / MAX: 28.271. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50-O3 -march=native-Ofast -march=native-O2 -march=native-O3 -march=native -flto-Og-O1-Ofast-O2-O0-O3-O3 -flto-O2 -flto612182430SE +/- 0.15, N = 3SE +/- 0.19, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.23, N = 3SE +/- 0.03, N = 3SE +/- 0.24, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 318.2318.3918.5118.7522.1222.2922.2922.3422.3622.4123.3923.42-O3 -march=native - MIN: 17.79 / MAX: 22.11-Ofast -march=native - MIN: 17.9 / MAX: 22.23-march=native - MIN: 18.3 / MAX: 24.03-O3 -march=native -flto - MIN: 18.58 / MAX: 22.39-Og - MIN: 21.48 / MAX: 26.11-O1 - MIN: 22.02 / MAX: 27-Ofast - MIN: 21.67 / MAX: 26.11MIN: 22.12 / MAX: 26.1-O0 - MIN: 22.1 / MAX: 27.7-O3 - MIN: 22.18 / MAX: 26.09-O3 -flto - MIN: 23.2 / MAX: 27.18-flto - MIN: 23.16 / MAX: 28.061. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18-O3 -march=native-Ofast -march=native-O2 -march=native-O0-Og-Ofast-O3 -march=native -flto-O1-O2-O3-O3 -flto-O2 -flto3691215SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 3SE +/- 0.19, N = 3SE +/- 0.20, N = 3SE +/- 0.18, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 2SE +/- 0.01, N = 3SE +/- 0.01, N = 311.0811.1611.2911.3011.3311.3711.4511.4711.4811.5111.6511.67-O3 -march=native - MIN: 10.69 / MAX: 16.91-Ofast -march=native - MIN: 10.75 / MAX: 14.91-march=native - MIN: 11.17 / MAX: 14.94-O0 - MIN: 10.85 / MAX: 15.09-Og - MIN: 10.84 / MAX: 17.14-Ofast - MIN: 10.94 / MAX: 15.16-O3 -march=native -flto - MIN: 11.35 / MAX: 15.04-O1 - MIN: 11.34 / MAX: 15.37MIN: 11.37 / MAX: 15.06-O3 - MIN: 11.4 / MAX: 16.95-O3 -flto - MIN: 11.53 / MAX: 15.24-flto - MIN: 11.56 / MAX: 15.31. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16-O3 -march=native-O2 -march=native-O1-Ofast -march=native-O2-O0-Og-O2 -flto-O3-Ofast-O3 -flto-O3 -march=native -flto1326395265SE +/- 0.11, N = 3SE +/- 0.23, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.16, N = 3SE +/- 0.18, N = 3SE +/- 0.12, N = 3SE +/- 0.04, N = 354.3654.5954.9155.1555.1755.3155.3455.5155.7755.7755.7955.82-O3 -march=native - MIN: 53.85 / MAX: 59.24-march=native - MIN: 53.82 / MAX: 59.28-O1 - MIN: 54.36 / MAX: 58.94-Ofast -march=native - MIN: 54.82 / MAX: 60.18MIN: 54.55 / MAX: 59.77-O0 - MIN: 54.78 / MAX: 60.73-Og - MIN: 54.84 / MAX: 62.65-flto - MIN: 55.03 / MAX: 59.08-O3 - MIN: 55.15 / MAX: 66.36-Ofast - MIN: 55.26 / MAX: 59.78-O3 -flto - MIN: 55.29 / MAX: 59.45-O3 -march=native -flto - MIN: 55.41 / MAX: 61.051. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet-O3 -march=native-Ofast -march=native-O2 -march=native-O3 -march=native -flto-Og-O0-O1-Ofast-O2-O3-O3 -flto-O2 -flto3691215SE +/- 0.17, N = 3SE +/- 0.19, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 3SE +/- 0.02, N = 3SE +/- 0.23, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.25, N = 3SE +/- 0.07, N = 310.0910.1510.3910.6411.0411.0711.4011.4311.5111.5211.8312.01-O3 -march=native - MIN: 9.67 / MAX: 13.94-Ofast -march=native - MIN: 9.84 / MAX: 15.83-march=native - MIN: 10.3 / MAX: 14.02-O3 -march=native -flto - MIN: 10.48 / MAX: 15.45-Og - MIN: 10.78 / MAX: 14.99-O0 - MIN: 10.74 / MAX: 15.65-O1 - MIN: 11.29 / MAX: 14.99-Ofast - MIN: 10.92 / MAX: 17.47MIN: 11.34 / MAX: 15.19-O3 - MIN: 11.38 / MAX: 15.27-O3 -flto - MIN: 11.25 / MAX: 15.72-flto - MIN: 11.84 / MAX: 15.721. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface-O2 -march=native-O3 -march=native-Ofast -march=native-Og-O0-Ofast-O1-O3-O2-O3 -flto-O3 -march=native -flto-O2 -flto0.40730.81461.22191.62922.0365SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 31.101.151.161.181.191.221.241.241.281.801.801.81-march=native - MIN: 1.08 / MAX: 4.83-O3 -march=native - MIN: 1.08 / MAX: 2-Ofast -march=native - MIN: 1.08 / MAX: 4.68-Og - MIN: 1.14 / MAX: 1.94-O0 - MIN: 1.14 / MAX: 4.76-Ofast - MIN: 1.15 / MAX: 1.3-O1 - MIN: 1.21 / MAX: 5.59-O3 - MIN: 1.21 / MAX: 4.79MIN: 1.22 / MAX: 4.92-O3 -flto - MIN: 1.74 / MAX: 1.88-O3 -march=native -flto - MIN: 1.72 / MAX: 2.56-flto - MIN: 1.77 / MAX: 1.941. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0-O3 -march=native-O2 -march=native-O3 -march=native -flto-Ofast -march=native-O0-Og-O1-O3-Ofast-O2-O2 -flto-O3 -flto1.24652.4933.73954.9866.2325SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 34.244.274.314.335.235.235.245.305.325.365.545.54-O3 -march=native - MIN: 4.19 / MAX: 7.9-march=native - MIN: 4.2 / MAX: 7.94-O3 -march=native -flto - MIN: 4.25 / MAX: 7.95-Ofast -march=native - MIN: 4.21 / MAX: 7.88-O0 - MIN: 5.15 / MAX: 8.89-Og - MIN: 5.17 / MAX: 8.95-O1 - MIN: 5.17 / MAX: 8.84-O3 - MIN: 5.18 / MAX: 9.09-Ofast - MIN: 5.19 / MAX: 11.06MIN: 5.18 / MAX: 9.93-flto - MIN: 5.45 / MAX: 9.2-O3 -flto - MIN: 5.41 / MAX: 9.331. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet-O3 -march=native-O2 -march=native-Ofast -march=native-O3 -march=native -flto-O0-Og-Ofast-O1-O3-O2-O2 -flto-O3 -flto0.7651.532.2953.063.825SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 32.222.232.252.283.093.113.143.173.183.213.393.40-O3 -march=native - MIN: 2.17 / MAX: 2.35-march=native - MIN: 2.19 / MAX: 5.93-Ofast -march=native - MIN: 2.17 / MAX: 2.77-O3 -march=native -flto - MIN: 2.22 / MAX: 5.98-O0 - MIN: 3.05 / MAX: 6.68-Og - MIN: 3.06 / MAX: 6.78-Ofast - MIN: 3.06 / MAX: 6.77-O1 - MIN: 3.14 / MAX: 6.8-O3 - MIN: 3.09 / MAX: 6.86MIN: 3.08 / MAX: 6.93-flto - MIN: 3.29 / MAX: 6.96-O3 -flto - MIN: 3.24 / MAX: 7.11. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2-O2 -march=native-Ofast -march=native-O3 -march=native-O1-Og-O0-O3-Ofast-O2-O3 -march=native -flto-O2 -flto-O3 -flto246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 33.253.253.263.453.453.463.463.473.485.685.976.08-march=native - MIN: 3.19 / MAX: 6.89-Ofast -march=native - MIN: 3.17 / MAX: 6.87-O3 -march=native - MIN: 3.18 / MAX: 6.94-O1 - MIN: 3.39 / MAX: 7.07-Og - MIN: 3.39 / MAX: 7.06-O0 - MIN: 3.4 / MAX: 7.21-O3 - MIN: 3.42 / MAX: 6.99-Ofast - MIN: 3.39 / MAX: 7.12MIN: 3.38 / MAX: 7.1-O3 -march=native -flto - MIN: 5.56 / MAX: 9.36-flto - MIN: 5.83 / MAX: 9.65-O3 -flto - MIN: 5.91 / MAX: 9.831. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3-O3 -march=native-O2 -march=native-Ofast -march=native-O3 -march=native -flto-O0-Og-O1-Ofast-O3-O2-O3 -flto-O2 -flto0.7741.5482.3223.0963.87SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 32.492.502.512.543.173.173.193.213.243.253.413.44-O3 -march=native - MIN: 2.44 / MAX: 6.14-march=native - MIN: 2.46 / MAX: 6.05-Ofast -march=native - MIN: 2.44 / MAX: 6.19-O3 -march=native -flto - MIN: 2.47 / MAX: 6.15-O0 - MIN: 3.11 / MAX: 6.83-Og - MIN: 3.12 / MAX: 5.46-O1 - MIN: 3.16 / MAX: 4.05-Ofast - MIN: 3.14 / MAX: 4.03-O3 - MIN: 3.18 / MAX: 6.79MIN: 3.16 / MAX: 6.87-O3 -flto - MIN: 3.33 / MAX: 4.14-flto - MIN: 3.38 / MAX: 7.081. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2-O3 -march=native-O2 -march=native-Ofast -march=native-O3 -march=native -flto-O0-O1-Og-Ofast-O3-O2-O2 -flto-O3 -flto1.00352.0073.01054.0145.0175SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 33.213.223.243.264.184.194.204.204.254.364.464.46-O3 -march=native - MIN: 3.08 / MAX: 4.11-march=native - MIN: 3.1 / MAX: 6.96-Ofast -march=native - MIN: 3.13 / MAX: 4.21-O3 -march=native -flto - MIN: 3.14 / MAX: 6.82-O0 - MIN: 4.03 / MAX: 7.87-O1 - MIN: 4.06 / MAX: 7.81-Og - MIN: 4.06 / MAX: 10.02-Ofast - MIN: 4.07 / MAX: 7.83-O3 - MIN: 4.06 / MAX: 7.77MIN: 4.12 / MAX: 8.14-flto - MIN: 4.28 / MAX: 8.19-O3 -flto - MIN: 4.27 / MAX: 8.141. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet-O3 -march=native-O2 -march=native-Ofast -march=native-O3 -march=native -flto-Og-O2-O1-O0-Ofast-O3-O3 -flto-O2 -flto48121620SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.16, N = 3SE +/- 0.10, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 311.7611.8811.9213.6614.9215.0115.0215.0615.0815.1216.9216.96-O3 -march=native - MIN: 11.54 / MAX: 15.41-march=native - MIN: 11.77 / MAX: 15.49-Ofast -march=native - MIN: 11.81 / MAX: 15.57-O3 -march=native -flto - MIN: 13.25 / MAX: 17.52-Og - MIN: 14.6 / MAX: 18.67MIN: 14.87 / MAX: 18.64-O1 - MIN: 14.88 / MAX: 18.66-O0 - MIN: 14.84 / MAX: 18.7-Ofast - MIN: 14.77 / MAX: 18.87-O3 - MIN: 14.93 / MAX: 18.84-O3 -flto - MIN: 16.64 / MAX: 24.79-flto - MIN: 16.74 / MAX: 20.551. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O3 -march=native-O2 -march=native-O2 -flto-O2-O3-O3 -flto-Ofast -march=native-O3 -march=native -flto-Ofast-O1-Og-O020406080100SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.24, N = 3SE +/- 0.10, N = 3SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.26, N = 3SE +/- 0.11, N = 3SE +/- 0.24, N = 346.0946.0946.2346.3146.3546.5446.5946.9247.0049.0152.6780.89-O3 -march=native-O2 -march=native-O2 -flto-O2-O3-O3 -flto-Ofast -march=native-O3 -march=native -flto-Ofast-O1-Og-O01. (CC) gcc options: -ldl -lz -lpthread

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt-Ofast-O3-O3 -flto-O3 -march=native-Ofast -march=native-O3 -march=native -flto-Og-O2 -march=native-O2 -flto-O2-O1-O04080120160200SE +/- 0.11, N = 3SE +/- 0.15, N = 3SE +/- 0.15, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.25, N = 3SE +/- 0.67, N = 3SE +/- 1.87, N = 4SE +/- 2.02, N = 6SE +/- 3.81, N = 3SE +/- 1.14, N = 15SE +/- 0.05, N = 3178.28178.20172.97168.85168.83168.41161.42158.89158.25150.17149.8136.341. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256-Ofast-O3-O3 -flto-O3 -march=native -flto-Ofast -march=native-O3 -march=native-Og-O2 -flto-O2-O2 -march=native-O1-O04080120160200SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.95, N = 3SE +/- 1.50, N = 6SE +/- 1.17, N = 3SE +/- 1.79, N = 4SE +/- 1.37, N = 15SE +/- 0.03, N = 3178.33178.21170.58169.13168.88168.76160.88155.16154.29153.44149.4436.891. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Decompression Speed-O2-Ofast-O3-O2 -march=native-Ofast -march=native-O2 -flto-O3 -march=native-O3 -flto-O3 -march=native -flto-O1-Og-O012002400360048006000SE +/- 6.15, N = 3SE +/- 9.23, N = 3SE +/- 3.81, N = 3SE +/- 6.35, N = 3SE +/- 12.33, N = 3SE +/- 91.74, N = 3SE +/- 15.26, N = 3SE +/- 27.37, N = 3SE +/- 19.39, N = 3SE +/- 13.17, N = 3SE +/- 9.27, N = 3SE +/- 0.86, N = 35394.85383.85332.95324.15209.05207.45189.95121.75105.55075.83939.41087.0-O2-Ofast-O3-O2 -march=native-Ofast -march=native-O2 -flto-O3 -march=native-O3 -flto-O3 -march=native -flto-O1-Og-O01. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression Speed-O3 -flto-O2 -flto-O3 -march=native -flto-O3 -march=native-O3-O2-Og-O2 -march=native-Ofast -march=native-O1-Ofast-O04080120160200SE +/- 0.38, N = 3SE +/- 0.38, N = 3SE +/- 0.30, N = 3SE +/- 0.90, N = 3SE +/- 0.84, N = 3SE +/- 0.75, N = 3SE +/- 0.31, N = 3SE +/- 0.82, N = 3SE +/- 0.89, N = 3SE +/- 0.57, N = 3SE +/- 0.78, N = 3SE +/- 0.79, N = 3194.3194.0192.7192.6192.5191.8190.8190.5189.8189.2188.6180.9-O3 -flto-O2 -flto-O3 -march=native -flto-O3 -march=native-O3-O2-Og-O2 -march=native-Ofast -march=native-O1-Ofast-O01. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speed-O2-Ofast-O3-O2 -march=native-O2 -flto-O3 -march=native-Ofast -march=native-O3 -flto-O3 -march=native -flto-O1-Og-O011002200330044005500SE +/- 3.43, N = 3SE +/- 20.70, N = 3SE +/- 7.36, N = 3SE +/- 4.97, N = 3SE +/- 2.77, N = 3SE +/- 19.31, N = 3SE +/- 9.83, N = 3SE +/- 20.13, N = 3SE +/- 18.92, N = 3SE +/- 8.75, N = 3SE +/- 14.02, N = 3SE +/- 1.51, N = 35191.95174.75151.95093.55087.04997.84991.44972.94935.54847.53791.41030.1-O2-Ofast-O3-O2 -march=native-O2 -flto-O3 -march=native-Ofast -march=native-O3 -flto-O3 -march=native -flto-O1-Og-O01. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speed-O3 -march=native-O2 -march=native-O2 -flto-O3 -march=native -flto-O3 -flto-Ofast -march=native-O2-O3-Ofast-O1-Og-O06001200180024003000SE +/- 14.92, N = 3SE +/- 5.47, N = 3SE +/- 18.81, N = 3SE +/- 3.46, N = 3SE +/- 18.53, N = 3SE +/- 6.82, N = 3SE +/- 14.51, N = 3SE +/- 8.13, N = 3SE +/- 6.54, N = 3SE +/- 8.18, N = 3SE +/- 7.00, N = 3SE +/- 2.56, N = 32731.52729.42723.22719.62712.72711.42707.62660.52658.32568.02376.31005.1-O3 -march=native-O2 -march=native-O2 -flto-O3 -march=native -flto-O3 -flto-Ofast -march=native-O2-O3-Ofast-O1-Og-O01. (CC) gcc options: -pthread -lz

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis-O2-O3 -flto-O2 -flto-Ofast -march=native-O3 -march=native-O2 -march=native-O3 -march=native -flto-Ofast-O3-O1-Og-O01326395265SE +/- 0.03, N = 4SE +/- 0.05, N = 4SE +/- 0.06, N = 4SE +/- 0.06, N = 4SE +/- 0.06, N = 4SE +/- 0.08, N = 4SE +/- 0.07, N = 4SE +/- 0.06, N = 4SE +/- 0.07, N = 4SE +/- 0.07, N = 4SE +/- 0.08, N = 4SE +/- 0.06, N = 421.4121.6021.6621.7521.7722.2322.6622.7722.9324.0026.8959.01-O2-O3 -flto-O2 -flto-Ofast -march=native-O3 -march=native-O2 -march=native-O3 -march=native -flto-Ofast-O3-O1-Og-O01. (CC) gcc options: -std=c99 -lpthread -lm

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt-O2 -march=native-O3 -march=native -flto-O3 -march=native-Ofast -march=native-O1-O3 -flto-Og-Ofast-O3-O2 -flto-O2-O02004006008001000SE +/- 1.76, N = 6SE +/- 0.91, N = 6SE +/- 0.23, N = 3SE +/- 1.51, N = 6SE +/- 1.73, N = 3SE +/- 3.37, N = 6SE +/- 1.20, N = 6SE +/- 2.42, N = 6SE +/- 3.00, N = 6SE +/- 3.44, N = 6SE +/- 3.49, N = 6SE +/- 0.13, N = 61080.311016.891010.791008.851004.65969.70969.30962.28960.08907.74901.5776.491. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt-O3 -march=native -flto-Ofast -march=native-O3 -march=native-O2 -flto-O2-O3-O3 -flto-Ofast-O2 -march=native-O1-Og-O0120240360480600SE +/- 0.47, N = 3SE +/- 0.18, N = 3SE +/- 0.26, N = 3SE +/- 0.21, N = 3SE +/- 0.27, N = 3SE +/- 0.20, N = 3SE +/- 2.20, N = 3SE +/- 0.54, N = 3SE +/- 2.45, N = 3SE +/- 1.04, N = 3SE +/- 2.60, N = 3SE +/- 0.14, N = 3555.06553.97553.52550.13549.17540.31538.67538.43533.32532.56441.0846.591. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish-O2 -flto-O2-Ofast -march=native-O2 -march=native-O3-O3 -march=native -flto-Ofast-O3 -march=native-O3 -flto-O1-Og-O0120240360480600SE +/- 0.11, N = 3SE +/- 0.19, N = 3SE +/- 0.24, N = 3SE +/- 0.23, N = 3SE +/- 0.23, N = 3SE +/- 0.13, N = 3SE +/- 0.35, N = 3SE +/- 0.20, N = 3SE +/- 2.26, N = 3SE +/- 0.93, N = 3SE +/- 0.55, N = 3SE +/- 0.13, N = 3563.75562.85554.57554.07553.53553.17552.87552.46551.74533.96443.7447.151. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt-O2 -march=native-O3 -march=native -flto-Ofast-O3-O3 -march=native-O3 -flto-Ofast -march=native-O2-O2 -flto-O1-Og-O0100200300400500SE +/- 0.96, N = 3SE +/- 0.11, N = 3SE +/- 0.65, N = 3SE +/- 0.22, N = 3SE +/- 0.62, N = 3SE +/- 0.25, N = 3SE +/- 1.74, N = 3SE +/- 0.35, N = 3SE +/- 0.16, N = 3SE +/- 0.13, N = 3SE +/- 0.43, N = 3SE +/- 0.10, N = 3470.03454.15453.84451.78451.66451.24449.76449.21445.67427.26387.6443.791. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish-O2 -march=native-O3 -march=native-Ofast -march=native-O3 -march=native -flto-Ofast-O3-O3 -flto-O2-O2 -flto-O1-Og-O0100200300400500SE +/- 0.08, N = 3SE +/- 0.31, N = 3SE +/- 1.26, N = 3SE +/- 1.36, N = 3SE +/- 0.29, N = 3SE +/- 0.19, N = 3SE +/- 0.23, N = 3SE +/- 0.75, N = 3SE +/- 0.06, N = 3SE +/- 0.19, N = 3SE +/- 1.58, N = 3SE +/- 0.26, N = 3470.94464.47463.94463.01450.00449.44448.62448.57448.40430.95384.8643.451. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt-O3 -flto-O3-Ofast-O2-O2 -flto-Ofast -march=native-O3 -march=native -flto-O3 -march=native-O2 -march=native-O1-Og-O0306090120150SE +/- 0.09, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.59, N = 3SE +/- 0.36, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.27, N = 3SE +/- 0.06, N = 3SE +/- 0.80, N = 3SE +/- 0.07, N = 3119.06118.83118.79115.40114.98112.37112.04112.03111.71106.4898.8332.601. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI-Ofast-O3 -flto-O3-O2-O2 -flto-Ofast -march=native-O3 -march=native -flto-O3 -march=native-O2 -march=native-O1-Og-O0306090120150SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.22, N = 3SE +/- 0.03, N = 3SE +/- 0.58, N = 3SE +/- 0.02, N = 3124.16124.15124.04119.47119.38116.05116.01115.82112.45108.28100.6834.461. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

PostMark

Disk Transaction Performance

OpenBenchmarking.orgTPS, More Is BetterPostMark 1.51Disk Transaction Performance-O3 -march=native-Ofast -march=native-O3 -march=native -flto-O2 -flto-O2 -march=native-O2-Og-O1-Ofast-O3 -flto-O3-O02K4K6K8K10KSE +/- 118.67, N = 3SE +/- 110.33, N = 3SE +/- 110.33, N = 3SE +/- 110.33, N = 3SE +/- 110.33, N = 3949692599259925992599259925992599149914991499149-march=native-Ofast -march=native-march=native -flto-O2 -flto-O2 -march=native-O2-Og-O1-Ofast-flto-O01. (CC) gcc options: -O3

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-Ofast -march=native-O3 -march=native -flto-O3 -march=native-O2 -march=native-Ofast-O3 -flto-O3-O2 -flto-O2-O1-Og-O0918273645SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.17, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.27, N = 4SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.39, N = 321.5421.5421.5622.6522.9923.3723.6924.2724.4324.6134.3640.88-Ofast -march=native-march=native -flto-march=native-O2 -march=native-Ofast-flto-O2 -flto-O2-O1-Og-O01. (CC) gcc options: -lm -O3

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second-Ofast-O3-O3 -march=native -flto-Ofast -march=native-O3 -march=native-O2 -flto-O2-O3 -flto-O2 -march=native-O1-Og-O0100K200K300K400K500KSE +/- 751.74, N = 3SE +/- 295.16, N = 3SE +/- 163.45, N = 3SE +/- 387.90, N = 3SE +/- 533.17, N = 3SE +/- 236.18, N = 3SE +/- 734.61, N = 3SE +/- 491.58, N = 3SE +/- 686.94, N = 3SE +/- 661.73, N = 3SE +/- 256.15, N = 3SE +/- 138.09, N = 3450629.16440872.96436079.65435243.25434724.85429492.03428580.32427503.80423569.01366951.48261573.7690409.83-Ofast-O3-O3 -march=native -flto-Ofast -march=native-O3 -march=native-flto-O3 -flto-march=native-O1-Og-O01. (CC) gcc options: -O2 -lrt" -lrt

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughput-Ofast -march=native-O2 -march=native-O3 -march=native -flto-O3 -march=native-O2-Ofast-Og-O2 -flto-O1-O3 -flto-O0-O360120180240300SE +/- 0.14, N = 3SE +/- 0.41, N = 3SE +/- 1.04, N = 3SE +/- 0.45, N = 3SE +/- 1.18, N = 3SE +/- 0.59, N = 3SE +/- 0.43, N = 3SE +/- 0.51, N = 3SE +/- 0.31, N = 3SE +/- 0.65, N = 3SE +/- 0.67, N = 3SE +/- 1.16, N = 3273.25272.81271.75271.68261.22261.00260.63260.30260.26260.03259.49257.85-Ofast -march=native-O2 -march=native-march=native -flto -lm-march=native-O2-Ofast-Og-O2 -flto -lm-O1-flto -lm-O01. (CC) gcc options: -O3 -rdynamic

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Summer Nature 4K-O3 -march=native-O2-O1-O2 -march=native-Og-O3-Ofast-Ofast -march=native-O04080120160200SE +/- 0.19, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 3SE +/- 0.11, N = 3SE +/- 0.18, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3195.94187.10185.95185.36184.01181.31180.05179.95146.06-O3 -march=native - MIN: 181.35 / MAX: 208.71-O2 - MIN: 172.85 / MAX: 197.02-O1 - MIN: 169.98 / MAX: 195.75-O2 -march=native - MIN: 170.59 / MAX: 194.09-Og - MIN: 171.24 / MAX: 196.19-O3 - MIN: 165.68 / MAX: 190.05-Ofast - MIN: 163.86 / MAX: 187.94-Ofast -march=native - MIN: 165.19 / MAX: 188.36-O0 - MIN: 135.09 / MAX: 168.721. (CC) gcc options: -pthread -lm

Chia Blockchain VDF

Test: Square Assembly Optimized

OpenBenchmarking.orgIPS, More Is BetterChia Blockchain VDF 1.0.1Test: Square Assembly Optimized-O3 -march=native-O2 -march=native-O3 -march=native -flto-O0-O3-O2 -flto-Ofast -march=native-Og-O1-O3 -flto-Ofast-O250K100K150K200K250KSE +/- 1105.04, N = 3SE +/- 200.00, N = 3SE +/- 961.48, N = 3SE +/- 520.68, N = 3SE +/- 674.12, N = 3SE +/- 484.19, N = 3SE +/- 1020.35, N = 3SE +/- 1816.90, N = 3SE +/- 1471.21, N = 3SE +/- 1809.54, N = 3SE +/- 1922.09, N = 3SE +/- 472.58, N = 32506332506002495672495332481672480332479672479332479332478672469672467001. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O3 -march=native -flto-O2 -march=native-Ofast -march=native-O3-O3 -flto-Ofast-O2-O2 -flto-O1-Og-O020M40M60M80M100MSE +/- 14836.14, N = 3SE +/- 3711.84, N = 3SE +/- 3282.95, N = 3SE +/- 26576.93, N = 3SE +/- 42880.97, N = 3SE +/- 1089169.77, N = 3SE +/- 15213.30, N = 3SE +/- 996670.85, N = 3SE +/- 1080865.75, N = 3SE +/- 6806.86, N = 3SE +/- 16556.30, N = 3SE +/- 10115.99, N = 3998443339974933397389667960170009546333395045333943336679238433392353667884110006675966711024000-march=native-march=native -flto-O2 -march=native-Ofast -march=native-flto-Ofast-O2-O2 -flto-O1-Og1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O3 -march=native -flto-O2 -march=native-Ofast -march=native-O3 -flto-O3-Ofast-O2-O2 -flto-O1-Og-O080M160M240M320M400MSE +/- 1410968.93, N = 3SE +/- 495120.64, N = 3SE +/- 772492.36, N = 3SE +/- 763566.19, N = 3SE +/- 1498358.36, N = 3SE +/- 1167318.87, N = 3SE +/- 946379.06, N = 3SE +/- 1022518.68, N = 3SE +/- 372394.65, N = 3SE +/- 132035.35, N = 3SE +/- 636273.88, N = 3SE +/- 97119.06, N = 336376000036330333335539333334946000034573333334197000034120000033591333333579333331671000024225333341181333-march=native-march=native -flto-O2 -march=native-Ofast -march=native-flto-Ofast-O2-O2 -flto-O1-Og1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 57-O3 -march=native -flto-O3 -march=native-O2 -march=native-Ofast -march=native-Ofast-O3 -flto-O3-O2 -flto-O2-O1-Og-O040M80M120M160M200MSE +/- 612943.72, N = 3SE +/- 66416.20, N = 3SE +/- 691182.56, N = 3SE +/- 30550.50, N = 3SE +/- 417026.51, N = 3SE +/- 29627.31, N = 3SE +/- 2010530.61, N = 3SE +/- 1464631.16, N = 3SE +/- 23333.33, N = 3SE +/- 601728.99, N = 3SE +/- 10000.00, N = 3SE +/- 107734.76, N = 318912000018800333318467000018218000017819333317804666717540000017353666717319666716204666712689000021299667-march=native -flto-march=native-O2 -march=native-Ofast -march=native-Ofast-flto-O2 -flto-O2-O1-Og1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O3 -march=native -flto-O2 -march=native-O3 -flto-O2-O3-O2 -flto-Ofast -march=native-O1-Ofast-Og-O0150M300M450M600M750MSE +/- 134824.99, N = 3SE +/- 265015.72, N = 3SE +/- 403993.95, N = 3SE +/- 330201.96, N = 3SE +/- 63857.48, N = 3SE +/- 49103.07, N = 3SE +/- 171302.20, N = 3SE +/- 331679.23, N = 3SE +/- 328295.26, N = 3SE +/- 58594.65, N = 3SE +/- 130766.97, N = 3SE +/- 172619.36, N = 372275666772220000072143666771398000071167666771108666770780666768494666767229666767161000051744000090397667-march=native-march=native -flto-O2 -march=native-flto-O2-O2 -flto-Ofast -march=native-O1-Ofast-Og1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O3 -march=native -flto-O2 -march=native-Ofast -march=native-O3 -flto-Ofast-O3-O2 -flto-O2-O1-Og-O0150M300M450M600M750MSE +/- 689597.31, N = 3SE +/- 898319.42, N = 3SE +/- 1217880.67, N = 3SE +/- 1059124.38, N = 3SE +/- 597559.85, N = 3SE +/- 261809.94, N = 3SE +/- 632200.74, N = 3SE +/- 355074.33, N = 3SE +/- 463692.91, N = 3SE +/- 736168.76, N = 3SE +/- 764293.43, N = 3SE +/- 320728.51, N = 368784666768779666767418000067155333365028333364863666764674333363711333363710333359581666746259666778172667-march=native-march=native -flto-O2 -march=native-Ofast -march=native-flto-Ofast-O2 -flto-O2-O1-Og1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2-O3 -march=native-O2 -march=native-Ofast -march=native-O0-O1-Ofast-O3-Og-O2-O3 -march=native -flto-O3 -flto-O2 -flto50100150200250SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.21, N = 3SE +/- 0.13, N = 3SE +/- 0.20, N = 3SE +/- 0.17, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 3SE +/- 0.28, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3230.11230.22230.49243.10243.16243.29243.31243.33243.41248.41251.49251.61-O3 -march=native - MIN: 229.52 / MAX: 232.81-march=native - MIN: 229.36 / MAX: 233.08-Ofast -march=native - MIN: 229.69 / MAX: 233.56-O0 - MIN: 241.4 / MAX: 244.76-O1 - MIN: 241.63 / MAX: 246.21-Ofast - MIN: 241.45 / MAX: 246.15-O3 - MIN: 241.88 / MAX: 245.9-Og - MIN: 241.41 / MAX: 252.51MIN: 241.93 / MAX: 245.01-O3 -march=native -flto - MIN: 247.34 / MAX: 252.91-O3 -flto - MIN: 250.87 / MAX: 254.61-flto - MIN: 251.13 / MAX: 254.921. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1-Ofast -march=native-O3 -march=native-O2 -march=native-Og-O1-Ofast-O2-O3-O0-O3 -march=native -flto-O3 -flto-O2 -flto50100150200250SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.22, N = 3SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.37, N = 3SE +/- 0.12, N = 3SE +/- 0.19, N = 3SE +/- 0.16, N = 3227.41227.46227.59235.77235.96235.99236.33236.43236.47242.69244.59244.63-Ofast -march=native - MIN: 226.48 / MAX: 228.67-O3 -march=native - MIN: 226.88 / MAX: 228.23-march=native - MIN: 226.72 / MAX: 229.36-Og - MIN: 234.57 / MAX: 237.7-O1 - MIN: 234.76 / MAX: 237.84-Ofast - MIN: 234.48 / MAX: 237.48MIN: 234.45 / MAX: 237.44-O3 - MIN: 234.65 / MAX: 237.59-O0 - MIN: 234.7 / MAX: 238.18-O3 -march=native -flto - MIN: 242.07 / MAX: 243.46-O3 -flto - MIN: 243.8 / MAX: 245.52-flto - MIN: 243.9 / MAX: 245.591. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-Ofast -march=native-O3 -march=native-O3 -march=native -flto-O2 -march=native-O2 -flto-O3 -flto-Ofast-O3-O2-O1-Og-O01020304050SE +/- 0.004, N = 5SE +/- 0.002, N = 5SE +/- 0.005, N = 5SE +/- 0.005, N = 5SE +/- 0.003, N = 5SE +/- 0.004, N = 5SE +/- 0.005, N = 5SE +/- 0.012, N = 5SE +/- 0.010, N = 5SE +/- 0.004, N = 5SE +/- 0.004, N = 5SE +/- 0.013, N = 55.9235.9375.9385.9416.0636.0736.0826.1066.1086.5906.74041.971-Ofast -march=native-O3 -march=native-O3 -march=native -flto-O2 -march=native-O2 -flto-O3 -flto-Ofast-O3-O2-O1-Og-O01. (CXX) g++ options: -fvisibility=hidden -logg -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode-O3 -march=native -flto-O3 -march=native-O2 -march=native-O3-O3 -flto-O2-O2 -flto-O1-Og-O0612182430SE +/- 0.003, N = 5SE +/- 0.010, N = 5SE +/- 0.002, N = 5SE +/- 0.006, N = 5SE +/- 0.006, N = 5SE +/- 0.004, N = 5SE +/- 0.011, N = 5SE +/- 0.004, N = 5SE +/- 0.002, N = 5SE +/- 0.015, N = 55.5505.5955.9066.1526.1756.4366.6326.8287.69925.496-O3 -march=native -flto-O3 -march=native-O2 -march=native-O3-O3 -flto-O2-O2 -flto-O1-Og-O01. (CXX) g++ options: -fvisibility=hidden -logg -lm

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET-O2-O3-O2 -march=native-O3 -march=native-O3 -march=native -flto-Ofast -march=native-Ofast-O2 -flto-Og-O3 -flto-O1-O0900K1800K2700K3600K4500KSE +/- 3379.85, N = 3SE +/- 13278.68, N = 3SE +/- 19952.38, N = 3SE +/- 18099.88, N = 3SE +/- 15517.78, N = 3SE +/- 13892.93, N = 3SE +/- 28152.86, N = 3SE +/- 50798.23, N = 3SE +/- 29703.89, N = 3SE +/- 39966.63, N = 3SE +/- 33158.80, N = 3SE +/- 48889.70, N = 34081087.334056377.084054268.334049394.674042208.584038893.754026735.504021727.254019272.004018545.173982525.833975306.33-O2-O2 -march=native-march=native-march=native -flto-Ofast -march=native-Ofast-O2 -flto-Og-flto-O1-O01. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O2 -march=native-Ofast -march=native-O3 -march=native-O3 -march=native -flto-Ofast-O2 -flto-O2-O3-O3 -flto-O1-Og-O0714212835SE +/- 0.019, N = 3SE +/- 0.006, N = 3SE +/- 0.009, N = 3SE +/- 0.008, N = 3SE +/- 0.006, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.025, N = 3SE +/- 0.015, N = 3SE +/- 0.002, N = 3SE +/- 0.020, N = 3SE +/- 0.037, N = 37.8868.2908.4018.4578.5908.7038.7709.0299.0889.13310.69029.094-O2 -march=native-Ofast -march=native-march=native-march=native -flto-Ofast-O2 -flto-O2-flto-O1-Og-O01. (CXX) g++ options: -fopenmp -O3

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateless-O3 -flto-O3 -march=native -flto-O2 -flto-O3 -march=native-O3-O2-Ofast -march=native-Ofast-O2 -march=native-Og-O1-O060K120K180K240K300KSE +/- 392.40, N = 3SE +/- 381.59, N = 3SE +/- 608.82, N = 3SE +/- 711.03, N = 3SE +/- 231.12, N = 3SE +/- 97.41, N = 3SE +/- 730.96, N = 3SE +/- 590.54, N = 3SE +/- 574.52, N = 3SE +/- 1082.91, N = 3SE +/- 520.47, N = 3SE +/- 540.11, N = 3256859255381255114254610253887253466252869252647252495247454247106215182-O3 -flto-O3 -march=native -flto-O2 -flto-O3 -march=native-O3-O2-Ofast -march=native-Ofast-O2 -march=native-Og-O1-O01. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Thorough-O3 -march=native-O2 -march=native-Ofast -march=native-O3 -march=native -flto-O1-O3 -flto-O3-Og-O2-O2 -flto-O0-Ofast3691215SE +/- 0.0151, N = 3SE +/- 0.0180, N = 3SE +/- 0.0113, N = 3SE +/- 0.0219, N = 3SE +/- 0.0228, N = 3SE +/- 0.0190, N = 3SE +/- 0.0093, N = 3SE +/- 0.0166, N = 3SE +/- 0.0276, N = 3SE +/- 0.0141, N = 3SE +/- 0.0078, N = 3SE +/- 0.0209, N = 39.36019.36629.39489.39919.77349.80109.80719.81249.81589.82119.82709.8377-O3 -march=native-march=native-Ofast -march=native-O3 -march=native-O1-O3-O3-Og-O0-Ofast1. (CXX) g++ options: -O2 -flto -pthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU-Og-O1-O0-O3 -march=native-O3 -flto-Ofast -march=native-O3 -march=native -flto-O2 -flto-O2-Ofast-O3-O2 -march=native3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 310.5911.0311.0711.2011.2211.2211.2311.2311.2411.2411.2411.24-Og - MIN: 10.49-O1 - MIN: 10.93-O0 - MIN: 10.97MIN: 11.11-flto - MIN: 11.13-Ofast - MIN: 11.13-flto - MIN: 11.13-flto - MIN: 11.13MIN: 11.13-Ofast - MIN: 11.14MIN: 11.14MIN: 11.131. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=native -flto-Ofast -march=native-O3 -march=native-O3 -flto-Ofast-O3-O2 -march=native-O2 -flto-O2-O1-Og-O0510152025SE +/- 0.005, N = 3SE +/- 0.006, N = 3SE +/- 0.008, N = 3SE +/- 0.008, N = 3SE +/- 0.015, N = 3SE +/- 0.019, N = 3SE +/- 0.005, N = 3SE +/- 0.008, N = 3SE +/- 0.074, N = 3SE +/- 0.092, N = 4SE +/- 0.068, N = 3SE +/- 0.039, N = 35.3865.4565.4735.8185.8615.8846.9657.1067.3577.6758.48019.012-march=native -flto-Ofast -march=native-march=native-flto-Ofast-O2 -march=native-O2 -flto-O2-O1-Og-O01. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein-O3 -march=native -flto-O3 -march=native-Ofast -march=native-O2 -march=native-O1-O3-Og-O3 -flto-O0-Ofast-O2-O2 -flto246810SE +/- 0.030, N = 3SE +/- 0.026, N = 3SE +/- 0.063, N = 3SE +/- 0.053, N = 3SE +/- 0.028, N = 3SE +/- 0.048, N = 3SE +/- 0.034, N = 3SE +/- 0.034, N = 3SE +/- 0.112, N = 3SE +/- 0.103, N = 12SE +/- 0.124, N = 12SE +/- 0.071, N = 38.6198.5138.5108.4998.1848.1358.1218.1098.0938.0328.0128.001-O3 -march=native -flto-O3 -march=native-Ofast -march=native-march=native-O1-O3-Og-O3 -flto-O0-Ofast-flto1. (CXX) g++ options: -O2 -pthread -lm

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p-O3 -march=native -flto-O3 -flto-O3 -march=native-O2 -flto-O2 -march=native-Ofast -march=native-O0-Og-O1-O2-O3-Ofast306090120150SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.24, N = 3SE +/- 0.23, N = 3SE +/- 0.28, N = 3SE +/- 0.20, N = 3SE +/- 0.15, N = 3SE +/- 0.10, N = 3142.04140.48140.40140.32139.61138.97137.47137.30137.23137.13136.89136.73-march=native -flto-flto-march=native-flto-march=native-Ofast -march=native-O0-Og-O1-Ofast1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O2 -march=native-O0-Ofast -march=native-O2 -flto-Og-O3 -flto-O2-O1-O3-Ofast4080120160200SE +/- 1.49, N = 10SE +/- 0.22, N = 3SE +/- 0.61, N = 3SE +/- 0.42, N = 3SE +/- 0.20, N = 3SE +/- 0.36, N = 3SE +/- 0.10, N = 3SE +/- 0.30, N = 3SE +/- 0.33, N = 3SE +/- 1.54, N = 9SE +/- 0.14, N = 3SE +/- 0.39, N = 3198.73195.12195.03193.96193.79193.70193.57193.53192.87191.41191.35190.49-march=native-march=native -flto-march=native-O0-Ofast -march=native-flto-Og-flto-O1-Ofast1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Medium-O3 -march=native-Ofast -march=native-O2 -march=native-O3 -march=native -flto-O2-Og-O3-O1-O3 -flto-O2 -flto-O0-Ofast0.98521.97042.95563.94084.926SE +/- 0.0026, N = 3SE +/- 0.0057, N = 3SE +/- 0.0054, N = 3SE +/- 0.0030, N = 3SE +/- 0.0027, N = 3SE +/- 0.0062, N = 3SE +/- 0.0036, N = 3SE +/- 0.0112, N = 3SE +/- 0.0088, N = 3SE +/- 0.0076, N = 3SE +/- 0.0088, N = 3SE +/- 0.0107, N = 34.21534.22964.23074.23594.34704.35114.36034.36064.36864.36974.37154.3786-O3 -march=native-Ofast -march=native-march=native-O3 -march=native-Og-O3-O1-O3-O0-Ofast1. (CXX) g++ options: -O2 -flto -pthread

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O2 -march=native-Ofast -march=native-O2 -flto-O3 -flto-Og-O1-O0-O2-O3-Ofast4080120160200SE +/- 0.27, N = 3SE +/- 0.27, N = 3SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.22, N = 3SE +/- 0.21, N = 3SE +/- 0.11, N = 3SE +/- 0.29, N = 3SE +/- 0.16, N = 3SE +/- 0.11, N = 3SE +/- 0.36, N = 3SE +/- 0.18, N = 3166.43165.01164.42163.33162.81162.78161.02160.73160.72160.47159.86159.74-march=native-march=native -flto-march=native-Ofast -march=native-flto-flto-Og-O1-O0-Ofast1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O2 -march=native-O2 -flto-O0-O3 -flto-O1-O2-Og-Ofast -march=native-O3-Ofast4080120160200SE +/- 0.17, N = 3SE +/- 0.10, N = 3SE +/- 0.48, N = 3SE +/- 0.74, N = 3SE +/- 0.43, N = 3SE +/- 0.21, N = 3SE +/- 0.07, N = 3SE +/- 0.19, N = 3SE +/- 0.06, N = 3SE +/- 0.26, N = 3SE +/- 0.10, N = 3SE +/- 0.36, N = 3204.96199.57198.90198.86198.38198.37198.18197.89197.85197.26196.43195.23-march=native-march=native -flto-march=native-flto-O0-flto-O1-Og-Ofast -march=native-Ofast1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p-O3 -march=native-O3 -march=native -flto-O2 -march=native-O2 -flto-O3 -flto-O0-O1-Og-O2-Ofast -march=native-O3-Ofast60120180240300SE +/- 0.60, N = 3SE +/- 0.37, N = 3SE +/- 0.18, N = 3SE +/- 0.29, N = 3SE +/- 0.32, N = 3SE +/- 0.45, N = 3SE +/- 0.19, N = 3SE +/- 0.29, N = 3SE +/- 0.15, N = 3SE +/- 0.39, N = 3SE +/- 0.11, N = 3SE +/- 0.41, N = 3279.12275.57274.64274.39273.85273.72271.99271.78271.66271.62270.19267.46-march=native-march=native -flto-march=native-flto-flto-O0-O1-Og-Ofast -march=native-Ofast1. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt


Phoronix Test Suite v10.8.5