Compiler Optimization Levels

Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0707 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2106106-IB-COMPILERO67
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 4 Tests
Bioinformatics 2 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 2 Tests
C++ Boost Tests 2 Tests
C/C++ Compiler Tests 17 Tests
CPU Massive 19 Tests
Creator Workloads 17 Tests
Cryptography 3 Tests
Database Test Suite 2 Tests
Encoding 8 Tests
Fortran Tests 2 Tests
Game Development 2 Tests
HPC - High Performance Computing 11 Tests
Imaging 2 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 5 Tests
MPI Benchmarks 2 Tests
Multi-Core 15 Tests
NVIDIA GPU Compute 2 Tests
OpenMPI Tests 4 Tests
Programmer / Developer System Benchmarks 3 Tests
Python Tests 2 Tests
Renderers 3 Tests
Scientific Computing 6 Tests
Server 2 Tests
Server CPU Tests 11 Tests
Single-Threaded 7 Tests
Telephony 2 Tests
Texture Compression 2 Tests
Video Encoding 4 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
-O3 -march=native
June 09 2021
  8 Hours, 5 Minutes
-O1
June 10 2021
  9 Hours, 19 Minutes
Invert Hiding All Results Option
  8 Hours, 42 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Compiler Optimization Levels - Phoronix Test Suite

Compiler Optimization Levels

Intel Core i9-11900K testing with a ASUS ROG MAXIMUS XIII HERO (0707 BIOS) and AMD Radeon VII 16GB on Fedora 34 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2106106-IB-COMPILERO67&rdt&grr.

Compiler Optimization LevelsProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen Resolution-O3 -march=native-O1Intel Core i9-11900K @ 5.10GHz (8 Cores / 16 Threads)ASUS ROG MAXIMUS XIII HERO (0707 BIOS)Intel Tiger Lake-H32GB2000GB Corsair Force MP600 + 257GB Flash DriveAMD Radeon VII 16GB (1801/1000MHz)Intel Tiger Lake-H HD AudioASUS MG28U2 x Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411Fedora 345.12.9-300.fc34.x86_64 (x86_64)GNOME Shell 40.1X Server + Wayland4.6 Mesa 21.1.1 (LLVM 12.0.0)GCC 11.1.1 20210531btrfs3840x2160OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- -O3 -march=native: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- -O1: CXXFLAGS=-O1 CFLAGS=-O1Compiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,lto --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver Disk Details- NONE / compress=zstd:1,relatime,rw,seclabel,space_cache,ssd,subvol=/home,subvolid=256 / Block Size: 4096 Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x3c - Thermald 2.4.4 Python Details- Python 3.9.5Security Details- SELinux + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Compiler Optimization Levelsqe: AUSURF112lammps: 20k Atomscryptopp: All Algorithmscryptopp: Keyed Algorithmscryptopp: Integer + Elliptic Curve Public Key Algorithmsgmpbench: Total Timecompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedhmmer: Pfam Database Searchkripke: botan: CAST-256 - Decryptbotan: CAST-256sysbench: CPUc-ray: Total Time - 4K, 16 Rays Per Pixelmrbayes: Primate Phylogeny Analysiscaffe: GoogleNet - CPU - 100onednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUcryptopp: Unkeyed Algorithmspjsip: INVITEsvt-hevc: 1 - Bosphorus 1080ppjsip: OPTIONS, Statefulmt-dgemm: Sustained Floating-Point Rategraphics-magick: Sharpengraphics-magick: Noise-Gaussiangraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Swirlgraphics-magick: HWB Color Spacegraphics-magick: Rotatecompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedastcenc: Exhaustivemnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0basis: UASTC Level 3compress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetsqlite-speedtest: Timed Time - Size 1,000x265: Bosphorus 4Kcaffe: AlexNet - CPU - 100compress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedbotan: AES-256 - Decryptbotan: AES-256compress-zstd: 3 - Decompression Speedcompress-zstd: 3 - Compression Speedclomp: Static OMP Speedupstockfish: Total Timeespeak: Text-To-Speech Synthesisbotan: ChaCha20Poly1305 - Decryptbotan: ChaCha20Poly1305botan: Blowfish - Decryptbotan: Blowfishbotan: Twofish - Decryptbotan: Twofishbotan: KASUMI - Decryptbotan: KASUMIbasis: UASTC Level 2coremark: CoreMark Size 666 - Iterations Per Secondpostmark: Disk Transaction Performancetjbench: Decompression Throughputchia-vdf: Square Plain C++aobench: 2048 x 2048 - Total Timeonednn: Deconvolution Batch shapes_1d - f32 - CPUbasis: ETC1Schia-vdf: Square Assembly Optimizedliquid-dsp: 1 - 256 - 57liquid-dsp: 4 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 8 - 256 - 57dav1d: Summer Nature 4Kencode-wavpack: WAV To WavPacktnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1onednn: IP Shapes 1D - f32 - CPUredis: SETredis: GETonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUsvt-vp9: VMAF Optimized - Bosphorus 1080pencode-flac: WAV To FLACencode-opus: WAV To Opus Encodepjsip: OPTIONS, Statelessastcenc: Thoroughonednn: IP Shapes 3D - f32 - CPUx265: Bosphorus 1080psmallpt: Global Illumination Renderer; 128 Samplesencode-mp3: WAV To MP3onednn: Convolution Batch Shapes Auto - f32 - CPUbasis: UASTC Level 0svt-hevc: 7 - Bosphorus 1080pastcenc: Mediumsvt-vp9: Visual Quality Optimized - Bosphorus 1080plammps: Rhodopsin Proteinsvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080ponednn: Deconvolution Batch shapes_3d - f32 - CPUsvt-hevc: 10 - Bosphorus 1080p-O3 -march=native-O12609.028.7372346.359074924.2129117194.8571046171.85346.01451.05542.9285.999.48433544357168.851168.75634770.1447.33583.430836253165.601876.42491.45498150609.4893753.6046411953102701222689128510944506.535.451.485322.5131.8831.91619.2243.74854.5864540.632.88.5715.2920.2118.239.6411.0854.3610.091.154.242.223.262.493.2111.7646.08716.02365585189.9192.68412.9618401.8524997.82731.54.82944311221.7651010.7871012.732553.519552.463451.660464.472112.027115.81629.138434724.8497449496271.67666420840021.5564.9828120.80825063399844333363760000188003333722756667687846667195.9411.098230.113227.4554.037812956462.004049394.673.52485198.735.9375.5952546109.360111.200267.858.4015.47314.27546.106140.404.2153166.438.513204.964.28984279.122525.868.3452114.624613751.4815216862.7866205215.31542.85385.7281.5103.74233790753149.807149.43934882.14128.90788.533847293133.281854.40472.94708949939.2093333.9222241623062181021592120710784406.435.453.252822.9421.9211.98219.5073.84854.5574506.032.99.7316.1821.2622.299.6211.4754.9111.401.245.243.173.453.194.1915.0249.01115.72366225075.8189.28885.1298879.3304847.52568.05.12944801724.0011004.6471019.913532.560533.956427.255430.951106.478108.27629.108366951.4842909259260.25661120923324.6054.9728820.84524793388411000316710000162046667672296667595816667185.9511.132243.162235.9634.048282962660.833982525.833.52499191.416.5906.8282471069.773411.028967.859.1337.67514.17006.114137.234.3606160.738.184198.184.28224271.99OpenBenchmarking.org

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112-O3 -march=native-O16001200180024003000SE +/- 5.73, N = 3SE +/- 27.81, N = 52609.022525.861. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms-O3 -march=native-O1246810SE +/- 0.020, N = 3SE +/- 0.035, N = 38.7378.345-O3 -march=native-O11. (CXX) g++ options: -O2 -pthread -lm

Crypto++

Test: All Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: All Algorithms-O3 -march=native-O15001000150020002500SE +/- 1.51, N = 3SE +/- 0.38, N = 32346.362114.62-O3 -march=native-O11. (CXX) g++ options: -fPIC -pthread -pipe

Crypto++

Test: Keyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Keyed Algorithms-O3 -march=native-O12004006008001000SE +/- 0.64, N = 3SE +/- 0.51, N = 3924.21751.48-O3 -march=native-O11. (CXX) g++ options: -fPIC -pthread -pipe

Crypto++

Test: Integer + Elliptic Curve Public Key Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Integer + Elliptic Curve Public Key Algorithms-O3 -march=native-O115003000450060007500SE +/- 1.75, N = 3SE +/- 4.50, N = 37194.866862.79-O3 -march=native-O11. (CXX) g++ options: -fPIC -pthread -pipe

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time-O3 -march=native130026003900520065006171.81. (CC) gcc options: -O3 -march=native -lm

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Decompression Speed-O3 -march=native-O111002200330044005500SE +/- 2.50, N = 15SE +/- 8.30, N = 35346.05215.3-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Compression Speed-O3 -march=native-O130060090012001500SE +/- 22.75, N = 15SE +/- 12.97, N = 31451.01542.8-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Decompression Speed-O3 -march=native-O112002400360048006000SE +/- 6.10, N = 15SE +/- 9.52, N = 35542.95385.7-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression Speed-O3 -march=native-O160120180240300SE +/- 2.25, N = 15SE +/- 2.78, N = 3285.9281.5-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.2Pfam Database Search-O3 -march=native-O120406080100SE +/- 0.04, N = 3SE +/- 0.08, N = 399.48103.74-O3 -march=native-O11. (CC) gcc options: -pthread -lhmmer -leasel -lm -lmpi

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.4-O3 -march=native-O17M14M21M28M35MSE +/- 88809.55, N = 3SE +/- 73388.54, N = 33354435733790753-O3 -march=native-O11. (CXX) g++ options: -O2 -fopenmp

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt-O3 -march=native-O14080120160200SE +/- 0.01, N = 3SE +/- 1.14, N = 15168.85149.811. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256-O3 -march=native-O14080120160200SE +/- 0.06, N = 3SE +/- 1.37, N = 15168.76149.441. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU-O3 -march=native-O17K14K21K28K35KSE +/- 2.38, N = 3SE +/- 6.87, N = 334770.1434882.14-O3 -march=native-O11. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per Pixel-O3 -march=native-O1306090120150SE +/- 0.15, N = 3SE +/- 0.06, N = 347.34128.91-march=native-O11. (CC) gcc options: -lm -lpthread -O3

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny Analysis-O3 -march=native-O120406080100SE +/- 0.09, N = 3SE +/- 0.17, N = 383.4388.53-march=native-O11. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msha -maes -mavx -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mrdrnd -mbmi -mbmi2 -madx -mmpx -mabm -O3 -std=c99 -pedantic -lm

Caffe

Model: GoogleNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 100-O3 -march=native-O120K40K60K80K100KSE +/- 43.97, N = 3SE +/- 10.17, N = 38362584729-O3 -march=native-O11. (CXX) g++ options: -fPIC -O2 -rdynamic -lboost_system -lboost_thread -lboost_filesystem -lboost_chrono -lboost_date_time -lboost_atomic -lglog -lgflags -lprotobuf -lpthread -lhdf5_cpp -lhdf5 -lhdf5_hl_cpp -lhdf5_hl -llmdb -lopenblas

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU-O3 -march=native-O17001400210028003500SE +/- 1.32, N = 3SE +/- 2.80, N = 33165.603133.28MIN: 3154.25-O1 - MIN: 3120.481. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU-O3 -march=native-O1400800120016002000SE +/- 1.46, N = 3SE +/- 4.14, N = 31876.421854.40MIN: 1865.18-O1 - MIN: 1837.761. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Crypto++

Test: Unkeyed Algorithms

OpenBenchmarking.orgMiB/second, More Is BetterCrypto++ 8.2Test: Unkeyed Algorithms-O3 -march=native-O1110220330440550SE +/- 0.05, N = 3SE +/- 0.06, N = 3491.45472.95-O3 -march=native-O11. (CXX) g++ options: -fPIC -pthread -pipe

PJSIP

Method: INVITE

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: INVITE-O3 -march=native-O111002200330044005500SE +/- 15.24, N = 3SE +/- 45.51, N = 350604993-O3 -march=native-O11. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p-O3 -march=native-O13691215SE +/- 0.01, N = 3SE +/- 0.01, N = 39.489.20-march=native-O11. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

PJSIP

Method: OPTIONS, Stateful

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateful-O3 -march=native-O12K4K6K8K10KSE +/- 7.69, N = 3SE +/- 4.41, N = 393759333-O3 -march=native-O11. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate-O3 -march=native-O10.88251.7652.64753.534.4125SE +/- 0.018800, N = 3SE +/- 0.023378, N = 33.6046413.922224-O11. (CC) gcc options: -O3 -march=native -fopenmp

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Sharpen-O3 -march=native-O14080120160200SE +/- 0.58, N = 3SE +/- 0.58, N = 3195162-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-Gaussian-O3 -march=native-O170140210280350SE +/- 0.88, N = 3310306-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Enhanced-O3 -march=native-O160120180240300SE +/- 0.33, N = 3270218-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Resizing-O3 -march=native-O130060090012001500SE +/- 2.33, N = 3SE +/- 1.00, N = 312221021-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Swirl-O3 -march=native-O1150300450600750SE +/- 2.67, N = 3SE +/- 1.00, N = 3689592-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color Space-O3 -march=native-O130060090012001500SE +/- 1.20, N = 3SE +/- 1.33, N = 312851207-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Rotate-O3 -march=native-O12004006008001000SE +/- 2.03, N = 3SE +/- 1.20, N = 310941078-O3 -march=native-O11. (CC) gcc options: -fopenmp -pthread -ljpeg -lz -lm -lpthread

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression Speed-O3 -march=native-O110002000300040005000SE +/- 18.10, N = 3SE +/- 6.02, N = 44506.54406.4-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression Speed-O3 -march=native-O1816243240SE +/- 0.48, N = 3SE +/- 0.43, N = 435.435.4-O3 -march=native-O11. (CC) gcc options: -pthread -lz

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Exhaustive-O3 -march=native-O11224364860SE +/- 0.04, N = 3SE +/- 0.02, N = 351.4953.25-O3 -march=native-O11. (CXX) g++ options: -O2 -flto -pthread

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3-O3 -march=native-O1510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 322.5122.94-march=native - MIN: 22.19 / MAX: 27.64-O1 - MIN: 22.65 / MAX: 29.531. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0-O3 -march=native-O10.43220.86441.29661.72882.161SE +/- 0.001, N = 3SE +/- 0.004, N = 31.8831.921-march=native - MIN: 1.85 / MAX: 7.81-O1 - MIN: 1.89 / MAX: 9.191. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224-O3 -march=native-O10.4460.8921.3381.7842.23SE +/- 0.008, N = 3SE +/- 0.011, N = 31.9161.982-march=native - MIN: 1.87 / MAX: 6.22-O1 - MIN: 1.93 / MAX: 7.731. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50-O3 -march=native-O1510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 319.2219.51-march=native - MIN: 19.06 / MAX: 24.92-O1 - MIN: 19.33 / MAX: 23.751. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0-O3 -march=native-O10.86581.73162.59743.46324.329SE +/- 0.024, N = 3SE +/- 0.019, N = 33.7483.848-march=native - MIN: 3.64 / MAX: 10.5-O1 - MIN: 3.75 / MAX: 8.081. (CXX) g++ options: -O3 -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -O2 -rdynamic -pthread -ldl

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3-O3 -march=native-O11224364860SE +/- 0.00, N = 3SE +/- 0.02, N = 354.5954.561. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression Speed-O3 -march=native-O110002000300040005000SE +/- 15.31, N = 3SE +/- 3.19, N = 34540.64506.0-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression Speed-O3 -march=native-O1816243240SE +/- 0.19, N = 3SE +/- 0.15, N = 332.832.9-O3 -march=native-O11. (CC) gcc options: -pthread -lz

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m-O3 -march=native-O13691215SE +/- 0.02, N = 3SE +/- 0.05, N = 38.579.73-O3 -march=native - MIN: 8.47 / MAX: 12.35-O1 - MIN: 9.55 / MAX: 14.411. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd-O3 -march=native-O148121620SE +/- 0.01, N = 3SE +/- 0.03, N = 315.2916.18-O3 -march=native - MIN: 15.14 / MAX: 19-O1 - MIN: 16.02 / MAX: 19.891. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny-O3 -march=native-O1510152025SE +/- 0.03, N = 3SE +/- 0.05, N = 320.2121.26-O3 -march=native - MIN: 20.03 / MAX: 23.86-O1 - MIN: 20.97 / MAX: 27.081. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50-O3 -march=native-O1510152025SE +/- 0.15, N = 3SE +/- 0.03, N = 318.2322.29-O3 -march=native - MIN: 17.79 / MAX: 22.11-O1 - MIN: 22.02 / MAX: 271. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet-O3 -march=native-O13691215SE +/- 0.02, N = 3SE +/- 0.01, N = 39.649.62-O3 -march=native - MIN: 9.53 / MAX: 13.24-O1 - MIN: 9.5 / MAX: 13.211. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18-O3 -march=native-O13691215SE +/- 0.14, N = 3SE +/- 0.01, N = 311.0811.47-O3 -march=native - MIN: 10.69 / MAX: 16.91-O1 - MIN: 11.34 / MAX: 15.371. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16-O3 -march=native-O11224364860SE +/- 0.11, N = 3SE +/- 0.09, N = 354.3654.91-O3 -march=native - MIN: 53.85 / MAX: 59.24-O1 - MIN: 54.36 / MAX: 58.941. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet-O3 -march=native-O13691215SE +/- 0.17, N = 3SE +/- 0.02, N = 310.0911.40-O3 -march=native - MIN: 9.67 / MAX: 13.94-O1 - MIN: 11.29 / MAX: 14.991. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface-O3 -march=native-O10.2790.5580.8371.1161.395SE +/- 0.03, N = 3SE +/- 0.01, N = 31.151.24-O3 -march=native - MIN: 1.08 / MAX: 2-O1 - MIN: 1.21 / MAX: 5.591. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0-O3 -march=native-O11.1792.3583.5374.7165.895SE +/- 0.01, N = 3SE +/- 0.01, N = 34.245.24-O3 -march=native - MIN: 4.19 / MAX: 7.9-O1 - MIN: 5.17 / MAX: 8.841. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet-O3 -march=native-O10.71331.42662.13992.85323.5665SE +/- 0.02, N = 3SE +/- 0.01, N = 32.223.17-O3 -march=native - MIN: 2.17 / MAX: 2.35-O1 - MIN: 3.14 / MAX: 6.81. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2-O3 -march=native-O10.77631.55262.32893.10523.8815SE +/- 0.01, N = 3SE +/- 0.01, N = 33.263.45-O3 -march=native - MIN: 3.18 / MAX: 6.94-O1 - MIN: 3.39 / MAX: 7.071. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3-O3 -march=native-O10.71781.43562.15342.87123.589SE +/- 0.00, N = 3SE +/- 0.01, N = 32.493.19-O3 -march=native - MIN: 2.44 / MAX: 6.14-O1 - MIN: 3.16 / MAX: 4.051. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2-O3 -march=native-O10.94281.88562.82843.77124.714SE +/- 0.01, N = 3SE +/- 0.01, N = 33.214.19-O3 -march=native - MIN: 3.08 / MAX: 4.11-O1 - MIN: 4.06 / MAX: 7.811. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet-O3 -march=native-O148121620SE +/- 0.06, N = 3SE +/- 0.00, N = 311.7615.02-O3 -march=native - MIN: 11.54 / MAX: 15.41-O1 - MIN: 14.88 / MAX: 18.661. (CXX) g++ options: -O2 -rdynamic -lgomp -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000-O3 -march=native-O11122334455SE +/- 0.15, N = 3SE +/- 0.26, N = 346.0949.01-O3 -march=native-O11. (CC) gcc options: -ldl -lz -lpthread

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K-O3 -march=native-O148121620SE +/- 0.12, N = 3SE +/- 0.17, N = 416.0215.72-O3 -march=native-O11. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

Caffe

Model: AlexNet - Acceleration: CPU - Iterations: 100

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 100-O3 -march=native-O18K16K24K32K40KSE +/- 51.83, N = 3SE +/- 14.01, N = 33655836622-O3 -march=native-O11. (CXX) g++ options: -fPIC -O2 -rdynamic -lboost_system -lboost_thread -lboost_filesystem -lboost_chrono -lboost_date_time -lboost_atomic -lglog -lgflags -lprotobuf -lpthread -lhdf5_cpp -lhdf5 -lhdf5_hl_cpp -lhdf5_hl -llmdb -lopenblas

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Decompression Speed-O3 -march=native-O111002200330044005500SE +/- 15.26, N = 3SE +/- 13.17, N = 35189.95075.8-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8 - Compression Speed-O3 -march=native-O14080120160200SE +/- 0.90, N = 3SE +/- 0.57, N = 3192.6189.2-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt-O3 -march=native-O12K4K6K8K10KSE +/- 5.34, N = 3SE +/- 2.06, N = 38412.968885.131. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256-O3 -march=native-O12K4K6K8K10KSE +/- 5.18, N = 3SE +/- 0.64, N = 38401.858879.331. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Zstd Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Decompression Speed-O3 -march=native-O111002200330044005500SE +/- 19.31, N = 3SE +/- 8.75, N = 34997.84847.5-O3 -march=native-O11. (CC) gcc options: -pthread -lz

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression Speed-O3 -march=native-O16001200180024003000SE +/- 14.92, N = 3SE +/- 8.18, N = 32731.52568.0-O3 -march=native-O11. (CC) gcc options: -pthread -lz

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup-O3 -march=native-O11.14752.2953.44254.595.7375SE +/- 0.07, N = 3SE +/- 0.06, N = 34.85.1-march=native-O11. (CC) gcc options: -fopenmp -O3 -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time-O3 -march=native-O16M12M18M24M30MSE +/- 193823.90, N = 3SE +/- 371064.83, N = 32944311229448017-march=native-O11. (CXX) g++ options: -lgcov -m64 -lpthread -O3 -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -fprofile-use -fno-peel-loops -fno-tracer -flto=jobserver

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis-O3 -march=native-O1612182430SE +/- 0.06, N = 4SE +/- 0.07, N = 421.7724.00-O3 -march=native-O11. (CC) gcc options: -std=c99 -lpthread -lm

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt-O3 -march=native-O12004006008001000SE +/- 0.23, N = 3SE +/- 1.73, N = 31010.791004.651. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305-O3 -march=native-O12004006008001000SE +/- 0.46, N = 3SE +/- 1.88, N = 31012.731019.911. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt-O3 -march=native-O1120240360480600SE +/- 0.26, N = 3SE +/- 1.04, N = 3553.52532.561. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish-O3 -march=native-O1120240360480600SE +/- 0.20, N = 3SE +/- 0.93, N = 3552.46533.961. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt-O3 -march=native-O1100200300400500SE +/- 0.62, N = 3SE +/- 0.13, N = 3451.66427.261. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish-O3 -march=native-O1100200300400500SE +/- 0.31, N = 3SE +/- 0.19, N = 3464.47430.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt-O3 -march=native-O1306090120150SE +/- 0.05, N = 3SE +/- 0.06, N = 3112.03106.481. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI-O3 -march=native-O1306090120150SE +/- 0.01, N = 3SE +/- 0.03, N = 3115.82108.281. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2-O3 -march=native-O1714212835SE +/- 0.08, N = 3SE +/- 0.08, N = 329.1429.111. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second-O3 -march=native-O190K180K270K360K450KSE +/- 533.17, N = 3SE +/- 661.73, N = 3434724.85366951.48-O3 -march=native-O11. (CC) gcc options: -O2 -lrt" -lrt

PostMark

Disk Transaction Performance

OpenBenchmarking.orgTPS, More Is BetterPostMark 1.51Disk Transaction Performance-O3 -march=native-O12K4K6K8K10KSE +/- 118.67, N = 394969259-march=native-O11. (CC) gcc options: -O3

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughput-O3 -march=native-O160120180240300SE +/- 0.45, N = 3SE +/- 0.31, N = 3271.68260.26-march=native-O11. (CC) gcc options: -O3 -rdynamic

Chia Blockchain VDF

Test: Square Plain C++

OpenBenchmarking.orgIPS, More Is BetterChia Blockchain VDF 1.0.1Test: Square Plain C++-O3 -march=native-O140K80K120K160K200KSE +/- 57.74, N = 3SE +/- 120.19, N = 32084002092331. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total Time-O3 -march=native-O1612182430SE +/- 0.01, N = 3SE +/- 0.04, N = 321.5624.61-march=native-O11. (CC) gcc options: -lm -O3

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU-O3 -march=native-O11.12112.24223.36334.48445.6055SE +/- 0.01117, N = 3SE +/- 0.01654, N = 34.982814.97288MIN: 3.81-O1 - MIN: 3.811. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1S-O3 -march=native-O1510152025SE +/- 0.02, N = 3SE +/- 0.03, N = 320.8120.851. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

Chia Blockchain VDF

Test: Square Assembly Optimized

OpenBenchmarking.orgIPS, More Is BetterChia Blockchain VDF 1.0.1Test: Square Assembly Optimized-O3 -march=native-O150K100K150K200K250KSE +/- 1105.04, N = 3SE +/- 1471.21, N = 32506332479331. (CXX) g++ options: -flto -no-pie -lgmpxx -lgmp -lboost_system -pthread

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O120M40M60M80M100MSE +/- 14836.14, N = 3SE +/- 6806.86, N = 39984433388411000-march=native-O11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O180M160M240M320M400MSE +/- 1410968.93, N = 3SE +/- 132035.35, N = 3363760000316710000-march=native-O11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O140M80M120M160M200MSE +/- 66416.20, N = 3SE +/- 601728.99, N = 3188003333162046667-march=native-O11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O1150M300M450M600M750MSE +/- 134824.99, N = 3SE +/- 328295.26, N = 3722756667672296667-march=native-O11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57-O3 -march=native-O1150M300M450M600M750MSE +/- 689597.31, N = 3SE +/- 736168.76, N = 3687846667595816667-march=native-O11. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.9.0Video Input: Summer Nature 4K-O3 -march=native-O14080120160200SE +/- 0.19, N = 3SE +/- 0.05, N = 3195.94185.95-O3 -march=native - MIN: 181.35 / MAX: 208.71-O1 - MIN: 169.98 / MAX: 195.751. (CC) gcc options: -pthread -lm

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack-O3 -march=native-O13691215SE +/- 0.00, N = 5SE +/- 0.01, N = 511.1011.13-O3 -march=native-O11. (CXX) g++ options: -rdynamic

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v2-O3 -march=native-O150100150200250SE +/- 0.06, N = 3SE +/- 0.20, N = 3230.11243.16-O3 -march=native - MIN: 229.52 / MAX: 232.81-O1 - MIN: 241.63 / MAX: 246.211. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.1-O3 -march=native-O150100150200250SE +/- 0.04, N = 3SE +/- 0.15, N = 3227.46235.96-O3 -march=native - MIN: 226.88 / MAX: 228.23-O1 - MIN: 234.76 / MAX: 237.841. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O2 -rdynamic -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU-O3 -march=native-O10.91091.82182.73273.64364.5545SE +/- 0.00473, N = 3SE +/- 0.00076, N = 34.037814.04828MIN: 3.92-O1 - MIN: 3.911. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET-O3 -march=native-O1600K1200K1800K2400K3000KSE +/- 33577.98, N = 3SE +/- 19439.73, N = 32956462.002962660.83-march=native-O11. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET-O3 -march=native-O1900K1800K2700K3600K4500KSE +/- 18099.88, N = 3SE +/- 33158.80, N = 34049394.673982525.83-march=native-O11. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU-O3 -march=native-O10.79311.58622.37933.17243.9655SE +/- 0.00042, N = 3SE +/- 0.00163, N = 33.524853.52499MIN: 3.46-O1 - MIN: 3.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p-O3 -march=native-O14080120160200SE +/- 1.49, N = 10SE +/- 1.54, N = 9198.73191.41-march=native-O11. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLAC-O3 -march=native-O1246810SE +/- 0.002, N = 5SE +/- 0.004, N = 55.9376.590-O3 -march=native-O11. (CXX) g++ options: -fvisibility=hidden -logg -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode-O3 -march=native-O1246810SE +/- 0.010, N = 5SE +/- 0.004, N = 55.5956.828-O3 -march=native-O11. (CXX) g++ options: -fvisibility=hidden -logg -lm

PJSIP

Method: OPTIONS, Stateless

OpenBenchmarking.orgResponses Per Second, More Is BetterPJSIP 2.11Method: OPTIONS, Stateless-O3 -march=native-O150K100K150K200K250KSE +/- 711.03, N = 3SE +/- 520.47, N = 3254610247106-O3 -march=native-O11. (CC) gcc options: -lstdc++ -lssl -lcrypto -lm -lrt -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Thorough-O3 -march=native-O13691215SE +/- 0.0151, N = 3SE +/- 0.0228, N = 39.36019.7734-O3 -march=native-O11. (CXX) g++ options: -O2 -flto -pthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU-O3 -march=native-O13691215SE +/- 0.00, N = 3SE +/- 0.01, N = 311.2011.03MIN: 11.11-O1 - MIN: 10.931. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p-O3 -march=native-O11530456075SE +/- 0.32, N = 3SE +/- 0.60, N = 367.8567.85-O3 -march=native-O11. (CXX) g++ options: -O2 -rdynamic -lpthread -lrt -ldl

Smallpt

Global Illumination Renderer; 128 Samples

OpenBenchmarking.orgSeconds, Fewer Is BetterSmallpt 1.0Global Illumination Renderer; 128 Samples-O3 -march=native-O13691215SE +/- 0.009, N = 3SE +/- 0.002, N = 38.4019.133-march=native-O11. (CXX) g++ options: -fopenmp -O3

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3-O3 -march=native-O1246810SE +/- 0.008, N = 3SE +/- 0.092, N = 45.4737.675-march=native-O11. (CC) gcc options: -O3 -ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr -pipe -lm

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU-O3 -march=native-O148121620SE +/- 0.01, N = 3SE +/- 0.02, N = 314.2814.17MIN: 14.18-O1 - MIN: 14.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0-O3 -march=native-O1246810SE +/- 0.002, N = 3SE +/- 0.005, N = 36.1066.1141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O2 -rdynamic -lm -lpthread

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p-O3 -march=native-O1306090120150SE +/- 0.11, N = 3SE +/- 0.28, N = 3140.40137.23-march=native-O11. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.0Preset: Medium-O3 -march=native-O10.98111.96222.94333.92444.9055SE +/- 0.0026, N = 3SE +/- 0.0112, N = 34.21534.3606-O3 -march=native-O11. (CXX) g++ options: -O2 -flto -pthread

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p-O3 -march=native-O14080120160200SE +/- 0.27, N = 3SE +/- 0.29, N = 3166.43160.73-march=native-O11. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein-O3 -march=native-O1246810SE +/- 0.026, N = 3SE +/- 0.028, N = 38.5138.184-O3 -march=native-O11. (CXX) g++ options: -O2 -pthread -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p-O3 -march=native-O14080120160200SE +/- 0.17, N = 3SE +/- 0.07, N = 3204.96198.18-march=native-O11. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -O2 -pie -rdynamic -lpthread -lrt -lm

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU-O3 -march=native-O10.96521.93042.89563.86084.826SE +/- 0.00621, N = 3SE +/- 0.00335, N = 34.289844.28224MIN: 4.17-O1 - MIN: 4.171. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -O2 -pie -lpthread -ldl

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p-O3 -march=native-O160120180240300SE +/- 0.60, N = 3SE +/- 0.19, N = 3279.12271.99-march=native-O11. (CC) gcc options: -O3 -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt


Phoronix Test Suite v10.8.4