Core i7 4770K Xmas

Intel Core i7-4770K testing with a Gigabyte Z97-HD3 (F10c BIOS) and Gigabyte Intel HD 4600 2GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012256-HA-COREI747702&gru&sro&rro.

Core i7 4770K XmasProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123Intel Core i7-4770K @ 3.90GHz (4 Cores / 8 Threads)Gigabyte Z97-HD3 (F10c BIOS)Intel 4th Gen Core DRAM8GB120GB ADATA SU700Gigabyte Intel HD 4600 2GB (1250MHz)Intel Xeon E3-1200 v3/4thDELL S2409WRealtek RTL8111/8168/8411Ubuntu 20.105.8.0-31-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.5 Mesa 20.2.11.2.145GCC 10.2.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x28 - Thermald 2.3 Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

Core i7 4770K Xmassimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDcoremark: CoreMark Size 666 - Iterations Per Secondnode-web-tooling: clomp: Static OMP Speedupbrl-cad: VGR Performance Metricvkmark: 1920 x 1080onednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mhmmer: Pfam Database Searchbuild-ffmpeg: Time To Compilebuild2: Time To Compilebuild-eigen: Time To Compileencode-ape: WAV To APEencode-opus: WAV To Opus Encodesqlite-speedtest: Timed Time - Size 1,000encode-wavpack: WAV To WavPack1230.600.40.660.68144430.9675179.300.94237329211.173415.91425.977934.7261332.119813.549718.926531.026514.419112.956110419.25784.0410639.45730.728.1043510641.55907.307.3295139.3010.428.7411.418.3714.263.2929.65127.4630.4825.1061.8551.5840.6120.6939.2410.568.7711.518.5314.403.2529.81126.4830.4725.0963.2151.8340.2720.70137.774145.942388.39396.65614.0599.13280.18615.5480.600.40.660.68144054.7592079.291.14236529211.361715.65135.941464.6935532.126113.355718.953530.925014.830712.954210472.45922.9910420.85954.528.0976610647.05822.137.3356739.2910.668.6611.428.4214.323.1730.43127.5531.4225.1663.2752.9840.3020.5539.5510.508.7211.428.4114.283.1830.53128.3430.6425.6263.5553.0541.2720.75137.939146.295385.58497.28913.8739.13381.52415.5440.600.40.660.68144381.8916419.241.14216929011.593318.89285.998724.9978632.578013.292219.073730.947414.567212.901910683.55971.4410853.56041.908.8544310925.15981.807.2363139.7711.048.9111.448.6514.433.2830.19128.0230.7525.1764.1054.2341.5720.9039.6410.898.8411.498.4314.263.2430.37127.9730.6925.1163.3453.4542.1020.88137.903146.978388.95397.29913.9539.11380.10015.562OpenBenchmarking.org

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya3210.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.600.600.601. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom3210.090.180.270.360.45SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.40.40.41. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets3210.14850.2970.44550.5940.7425SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.660.660.661. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID3210.1530.3060.4590.6120.765SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.680.680.681. (CXX) g++ options: -O3 -pthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second32130K60K90K120K150KSE +/- 511.87, N = 3SE +/- 324.47, N = 3SE +/- 201.95, N = 3144381.89144054.76144430.971. (CC) gcc options: -O2 -lrt" -lrt

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark3213691215SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 39.249.299.301. Nodejs v12.18.2

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup3210.24750.4950.74250.991.2375SE +/- 0.02, N = 12SE +/- 0.04, N = 9SE +/- 0.04, N = 111.11.10.91. (CC) gcc options: -fopenmp -O3 -lm

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric3219K18K27K36K45K4216942365423731. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 108032160120180240300SE +/- 0.67, N = 3SE +/- 0.33, N = 32902922921. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU3213691215SE +/- 0.02, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 311.5911.3611.17MIN: 9.81MIN: 9.64MIN: 9.591. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU321510152025SE +/- 0.18, N = 9SE +/- 0.23, N = 3SE +/- 0.19, N = 318.8915.6515.91MIN: 17.56MIN: 14.56MIN: 14.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3211.34972.69944.04915.39886.7485SE +/- 0.02064, N = 3SE +/- 0.01650, N = 3SE +/- 0.00573, N = 35.998725.941465.97793MIN: 5.39MIN: 5.35MIN: 5.371. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3211.12452.2493.37354.4985.6225SE +/- 0.00347, N = 3SE +/- 0.01254, N = 3SE +/- 0.00568, N = 34.997864.693554.72613MIN: 4.39MIN: 4.15MIN: 4.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU321816243240SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 332.5832.1332.12MIN: 31.18MIN: 30.76MIN: 30.771. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU3213691215SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 313.2913.3613.55MIN: 11.34MIN: 11.47MIN: 11.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU321510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 319.0718.9518.93MIN: 17.71MIN: 17.8MIN: 17.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU321714212835SE +/- 0.25, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 330.9530.9331.03MIN: 29.08MIN: 29.31MIN: 29.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU32148121620SE +/- 0.19, N = 3SE +/- 0.16, N = 7SE +/- 0.08, N = 314.5714.8314.42MIN: 12.66MIN: 12.64MIN: 12.591. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3213691215SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 312.9012.9512.96MIN: 11.73MIN: 11.96MIN: 11.961. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU3212K4K6K8K10KSE +/- 103.29, N = 3SE +/- 143.78, N = 3SE +/- 107.95, N = 310683.510472.410419.2MIN: 10271.5MIN: 10130.6MIN: 10130.61. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU32113002600390052006500SE +/- 76.96, N = 3SE +/- 39.11, N = 3SE +/- 67.54, N = 35971.445922.995784.04MIN: 5746.72MIN: 5689.07MIN: 5574.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU3212K4K6K8K10KSE +/- 89.46, N = 3SE +/- 55.37, N = 3SE +/- 126.69, N = 310853.510420.810639.4MIN: 10230.8MIN: 10122.1MIN: 10142.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU32113002600390052006500SE +/- 103.03, N = 3SE +/- 28.84, N = 3SE +/- 34.96, N = 36041.905954.525730.72MIN: 5795.41MIN: 5716.33MIN: 5610.111. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU321246810SE +/- 0.08022, N = 10SE +/- 0.00568, N = 3SE +/- 0.02583, N = 38.854438.097668.10435MIN: 7.41MIN: 7.36MIN: 7.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU3212K4K6K8K10KSE +/- 134.60, N = 3SE +/- 79.82, N = 3SE +/- 84.33, N = 310925.110647.010641.5MIN: 10351.4MIN: 10136.8MIN: 10074.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU32113002600390052006500SE +/- 45.12, N = 3SE +/- 36.61, N = 3SE +/- 36.51, N = 35981.805822.135907.30MIN: 5790.74MIN: 5676.41MIN: 5685.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU321246810SE +/- 0.02346, N = 3SE +/- 0.02785, N = 3SE +/- 0.01240, N = 37.236317.335677.32951MIN: 6.08MIN: 6.13MIN: 6.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet321918273645SE +/- 0.05, N = 3SE +/- 0.17, N = 3SE +/- 0.17, N = 339.7739.2939.30MIN: 38.23 / MAX: 54.99MIN: 37.69 / MAX: 53.53MIN: 37.67 / MAX: 52.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v23213691215SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.01, N = 311.0410.6610.42MIN: 9.49 / MAX: 23.36MIN: 9.15 / MAX: 22.02MIN: 9 / MAX: 26.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3321246810SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 38.918.668.74MIN: 7.68 / MAX: 24.14MIN: 7.66 / MAX: 11.93MIN: 7.43 / MAX: 21.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v23213691215SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 311.4411.4211.41MIN: 10.2 / MAX: 14.3MIN: 9.78 / MAX: 21.73MIN: 9.61 / MAX: 24.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet321246810SE +/- 0.14, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 38.658.428.37MIN: 7.52 / MAX: 22.41MIN: 7.43 / MAX: 10.7MIN: 7.43 / MAX: 11.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b032148121620SE +/- 0.07, N = 3SE +/- 0.17, N = 3SE +/- 0.06, N = 314.4314.3214.26MIN: 12.96 / MAX: 26.1MIN: 12.71 / MAX: 21.22MIN: 12.32 / MAX: 58.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface3210.74031.48062.22092.96123.7015SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 33.283.173.29MIN: 2.87 / MAX: 6.08MIN: 2.84 / MAX: 5.58MIN: 2.93 / MAX: 5.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet321714212835SE +/- 0.38, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 330.1930.4329.65MIN: 27.95 / MAX: 43.65MIN: 27.97 / MAX: 44.24MIN: 27.35 / MAX: 44.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16321306090120150SE +/- 0.28, N = 3SE +/- 0.12, N = 3SE +/- 0.47, N = 3128.02127.55127.46MIN: 124.75 / MAX: 141.83MIN: 124.34 / MAX: 142.98MIN: 123.62 / MAX: 155.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18321714212835SE +/- 0.49, N = 3SE +/- 0.35, N = 3SE +/- 0.48, N = 330.7531.4230.48MIN: 28.65 / MAX: 43.26MIN: 29.38 / MAX: 44.46MIN: 28.51 / MAX: 48.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet321612182430SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 325.1725.1625.10MIN: 23.93 / MAX: 36.81MIN: 23.92 / MAX: 36.39MIN: 23.66 / MAX: 35.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet503211428425670SE +/- 0.63, N = 3SE +/- 0.36, N = 3SE +/- 0.32, N = 364.1063.2761.85MIN: 60.8 / MAX: 78.22MIN: 60.14 / MAX: 79.04MIN: 59.67 / MAX: 77.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny3211224364860SE +/- 0.49, N = 3SE +/- 0.61, N = 3SE +/- 0.60, N = 354.2352.9851.58MIN: 51.36 / MAX: 69.51MIN: 49.76 / MAX: 62.27MIN: 49.32 / MAX: 73.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd321918273645SE +/- 0.08, N = 3SE +/- 0.31, N = 3SE +/- 0.38, N = 341.5740.3040.61MIN: 39.79 / MAX: 51.01MIN: 38.88 / MAX: 50.48MIN: 38.69 / MAX: 57.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m321510152025SE +/- 0.26, N = 3SE +/- 0.11, N = 3SE +/- 0.10, N = 320.9020.5520.69MIN: 19.89 / MAX: 33.22MIN: 19.68 / MAX: 41.87MIN: 19.76 / MAX: 32.471. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet321918273645SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 339.6439.5539.24MIN: 38.24 / MAX: 53.68MIN: 37.98 / MAX: 52.81MIN: 37.61 / MAX: 75.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v23213691215SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.07, N = 310.8910.5010.56MIN: 9.36 / MAX: 27.33MIN: 9.06 / MAX: 19.24MIN: 9.17 / MAX: 22.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3321246810SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.13, N = 38.848.728.77MIN: 7.72 / MAX: 20.26MIN: 7.36 / MAX: 21.89MIN: 7.65 / MAX: 16.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v23213691215SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 311.4911.4211.51MIN: 10.27 / MAX: 24.45MIN: 10.18 / MAX: 14.51MIN: 9.89 / MAX: 25.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet321246810SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 38.438.418.53MIN: 7.18 / MAX: 23.58MIN: 7.37 / MAX: 18.13MIN: 7.67 / MAX: 11.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b032148121620SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 314.2614.2814.40MIN: 12.68 / MAX: 28.42MIN: 12.77 / MAX: 26.63MIN: 12.89 / MAX: 32.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface3210.73131.46262.19392.92523.6565SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 33.243.183.25MIN: 2.74 / MAX: 13.7MIN: 2.85 / MAX: 7.23MIN: 2.92 / MAX: 5.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet321714212835SE +/- 0.30, N = 3SE +/- 0.15, N = 3SE +/- 0.08, N = 330.3730.5329.81MIN: 27.85 / MAX: 41.19MIN: 27.72 / MAX: 49.16MIN: 27.66 / MAX: 43.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg16321306090120150SE +/- 0.22, N = 3SE +/- 0.19, N = 3SE +/- 0.12, N = 3127.97128.34126.48MIN: 124.99 / MAX: 146.71MIN: 124.8 / MAX: 144.28MIN: 123.52 / MAX: 147.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18321714212835SE +/- 0.32, N = 3SE +/- 0.25, N = 3SE +/- 0.34, N = 330.6930.6430.47MIN: 29.02 / MAX: 42.89MIN: 29 / MAX: 45.72MIN: 28.73 / MAX: 39.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet321612182430SE +/- 0.03, N = 3SE +/- 0.30, N = 3SE +/- 0.03, N = 325.1125.6225.09MIN: 23.93 / MAX: 31.49MIN: 24.03 / MAX: 37.26MIN: 24.07 / MAX: 34.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet503211428425670SE +/- 0.70, N = 3SE +/- 0.50, N = 3SE +/- 1.40, N = 363.3463.5563.21MIN: 59.91 / MAX: 77.11MIN: 59.8 / MAX: 83.82MIN: 59.61 / MAX: 81.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny3211224364860SE +/- 0.37, N = 3SE +/- 0.51, N = 3SE +/- 0.34, N = 353.4553.0551.83MIN: 50.95 / MAX: 68.33MIN: 49.64 / MAX: 66.59MIN: 49.56 / MAX: 65.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd3211020304050SE +/- 0.42, N = 3SE +/- 0.71, N = 3SE +/- 0.38, N = 342.1041.2740.27MIN: 39.99 / MAX: 60.7MIN: 38.91 / MAX: 54.13MIN: 38.86 / MAX: 55.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m321510152025SE +/- 0.11, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 320.8820.7520.70MIN: 19.92 / MAX: 33.66MIN: 19.66 / MAX: 33.43MIN: 19.92 / MAX: 33.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search321306090120150SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3137.90137.94137.771. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile321306090120150SE +/- 1.80, N = 3SE +/- 2.25, N = 3SE +/- 2.20, N = 3146.98146.30145.94

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile32180160240320400SE +/- 1.04, N = 3SE +/- 1.77, N = 3SE +/- 2.56, N = 3388.95385.58388.39

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile32120406080100SE +/- 0.28, N = 3SE +/- 0.33, N = 3SE +/- 0.15, N = 397.3097.2996.66

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE32148121620SE +/- 0.04, N = 5SE +/- 0.01, N = 5SE +/- 0.09, N = 513.9513.8714.061. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode3213691215SE +/- 0.008, N = 5SE +/- 0.008, N = 5SE +/- 0.019, N = 59.1139.1339.1321. (CXX) g++ options: -fvisibility=hidden -logg -lm

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00032120406080100SE +/- 0.67, N = 3SE +/- 0.42, N = 3SE +/- 0.66, N = 380.1081.5280.191. (CC) gcc options: -O2 -ldl -lz -lpthread

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack32148121620SE +/- 0.04, N = 5SE +/- 0.03, N = 5SE +/- 0.03, N = 515.5615.5415.551. (CXX) g++ options: -rdynamic


Phoronix Test Suite v10.8.5