Core i7 4770K Xmas

Intel Core i7-4770K testing with a Gigabyte Z97-HD3 (F10c BIOS) and Gigabyte Intel HD 4600 2GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012256-HA-COREI747702.

Core i7 4770K XmasProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123Intel Core i7-4770K @ 3.90GHz (4 Cores / 8 Threads)Gigabyte Z97-HD3 (F10c BIOS)Intel 4th Gen Core DRAM8GB120GB ADATA SU700Gigabyte Intel HD 4600 2GB (1250MHz)Intel Xeon E3-1200 v3/4thDELL S2409WRealtek RTL8111/8168/8411Ubuntu 20.105.8.0-31-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.5 Mesa 20.2.11.2.145GCC 10.2.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x28 - Thermald 2.3 Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

Core i7 4770K Xmasvkmark: 1920 x 1080clomp: Static OMP Speeduphmmer: Pfam Database Searchsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUcoremark: CoreMark Size 666 - Iterations Per Secondbuild-ffmpeg: Time To Compilebuild2: Time To Compilebuild-eigen: Time To Compileencode-ape: WAV To APEencode-opus: WAV To Opus Encodenode-web-tooling: sqlite-speedtest: Timed Time - Size 1,000ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mencode-wavpack: WAV To WavPackbrl-cad: VGR Performance Metric1232920.9137.7740.600.40.660.6811.173415.91425.977934.7261332.119813.549718.926531.026514.419112.956110419.25784.0410639.45730.728.1043510641.55907.307.32951144430.967517145.942388.39396.65614.0599.1329.3080.18639.3010.428.7411.418.3714.263.2929.65127.4630.4825.1061.8551.5840.6120.6939.2410.568.7711.518.5314.403.2529.81126.4830.4725.0963.2151.8340.2720.7015.548423732921.1137.9390.600.40.660.6811.361715.65135.941464.6935532.126113.355718.953530.925014.830712.954210472.45922.9910420.85954.528.0976610647.05822.137.33567144054.759207146.295385.58497.28913.8739.1339.2981.52439.2910.668.6611.428.4214.323.1730.43127.5531.4225.1663.2752.9840.3020.5539.5510.508.7211.428.4114.283.1830.53128.3430.6425.6263.5553.0541.2720.7515.544423652901.1137.9030.600.40.660.6811.593318.89285.998724.9978632.578013.292219.073730.947414.567212.901910683.55971.4410853.56041.908.8544310925.15981.807.23631144381.891641146.978388.95397.29913.9539.1139.2480.10039.7711.048.9111.448.6514.433.2830.19128.0230.7525.1764.1054.2341.5720.9039.6410.898.8411.498.4314.263.2430.37127.9730.6925.1163.3453.4542.1020.8815.56242169OpenBenchmarking.org

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 108012360120180240300SE +/- 0.33, N = 3SE +/- 0.67, N = 32922922901. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1230.24750.4950.74250.991.2375SE +/- 0.04, N = 11SE +/- 0.04, N = 9SE +/- 0.02, N = 120.91.11.11. (CC) gcc options: -fopenmp -O3 -lm

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search123306090120150SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 3137.77137.94137.901. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.600.600.601. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.090.180.270.360.45SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.40.40.41. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.14850.2970.44550.5940.7425SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.660.660.661. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.1530.3060.4590.6120.765SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.680.680.681. (CXX) g++ options: -O3 -pthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 311.1711.3611.59MIN: 9.59MIN: 9.64MIN: 9.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU123510152025SE +/- 0.19, N = 3SE +/- 0.23, N = 3SE +/- 0.18, N = 915.9115.6518.89MIN: 14.84MIN: 14.56MIN: 17.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1231.34972.69944.04915.39886.7485SE +/- 0.00573, N = 3SE +/- 0.01650, N = 3SE +/- 0.02064, N = 35.977935.941465.99872MIN: 5.37MIN: 5.35MIN: 5.391. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1231.12452.2493.37354.4985.6225SE +/- 0.00568, N = 3SE +/- 0.01254, N = 3SE +/- 0.00347, N = 34.726134.693554.99786MIN: 4.17MIN: 4.15MIN: 4.391. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123816243240SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 332.1232.1332.58MIN: 30.77MIN: 30.76MIN: 31.181. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1233691215SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 313.5513.3613.29MIN: 11.76MIN: 11.47MIN: 11.341. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU123510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 318.9318.9519.07MIN: 17.94MIN: 17.8MIN: 17.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123714212835SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.25, N = 331.0330.9330.95MIN: 29.17MIN: 29.31MIN: 29.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU12348121620SE +/- 0.08, N = 3SE +/- 0.16, N = 7SE +/- 0.19, N = 314.4214.8314.57MIN: 12.59MIN: 12.64MIN: 12.661. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1233691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 312.9612.9512.90MIN: 11.96MIN: 11.96MIN: 11.731. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1232K4K6K8K10KSE +/- 107.95, N = 3SE +/- 143.78, N = 3SE +/- 103.29, N = 310419.210472.410683.5MIN: 10130.6MIN: 10130.6MIN: 10271.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU12313002600390052006500SE +/- 67.54, N = 3SE +/- 39.11, N = 3SE +/- 76.96, N = 35784.045922.995971.44MIN: 5574.2MIN: 5689.07MIN: 5746.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1232K4K6K8K10KSE +/- 126.69, N = 3SE +/- 55.37, N = 3SE +/- 89.46, N = 310639.410420.810853.5MIN: 10142.8MIN: 10122.1MIN: 10230.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU12313002600390052006500SE +/- 34.96, N = 3SE +/- 28.84, N = 3SE +/- 103.03, N = 35730.725954.526041.90MIN: 5610.11MIN: 5716.33MIN: 5795.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123246810SE +/- 0.02583, N = 3SE +/- 0.00568, N = 3SE +/- 0.08022, N = 108.104358.097668.85443MIN: 7.33MIN: 7.36MIN: 7.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1232K4K6K8K10KSE +/- 84.33, N = 3SE +/- 79.82, N = 3SE +/- 134.60, N = 310641.510647.010925.1MIN: 10074.8MIN: 10136.8MIN: 10351.41. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12313002600390052006500SE +/- 36.51, N = 3SE +/- 36.61, N = 3SE +/- 45.12, N = 35907.305822.135981.80MIN: 5685.76MIN: 5676.41MIN: 5790.741. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.01240, N = 3SE +/- 0.02785, N = 3SE +/- 0.02346, N = 37.329517.335677.23631MIN: 6.09MIN: 6.13MIN: 6.081. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12330K60K90K120K150KSE +/- 201.95, N = 3SE +/- 324.47, N = 3SE +/- 511.87, N = 3144430.97144054.76144381.891. (CC) gcc options: -O2 -lrt" -lrt

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile123306090120150SE +/- 2.20, N = 3SE +/- 2.25, N = 3SE +/- 1.80, N = 3145.94146.30146.98

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12380160240320400SE +/- 2.56, N = 3SE +/- 1.77, N = 3SE +/- 1.04, N = 3388.39385.58388.95

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile12320406080100SE +/- 0.15, N = 3SE +/- 0.33, N = 3SE +/- 0.28, N = 396.6697.2997.30

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE12348121620SE +/- 0.09, N = 5SE +/- 0.01, N = 5SE +/- 0.04, N = 514.0613.8713.951. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1233691215SE +/- 0.019, N = 5SE +/- 0.008, N = 5SE +/- 0.008, N = 59.1329.1339.1131. (CXX) g++ options: -fvisibility=hidden -logg -lm

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1233691215SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 39.309.299.241. Nodejs v12.18.2

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00012320406080100SE +/- 0.66, N = 3SE +/- 0.42, N = 3SE +/- 0.67, N = 380.1981.5280.101. (CC) gcc options: -O2 -ldl -lz -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet123918273645SE +/- 0.17, N = 3SE +/- 0.17, N = 3SE +/- 0.05, N = 339.3039.2939.77MIN: 37.67 / MAX: 52.04MIN: 37.69 / MAX: 53.53MIN: 38.23 / MAX: 54.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.01, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 310.4210.6611.04MIN: 9 / MAX: 26.06MIN: 9.15 / MAX: 22.02MIN: 9.49 / MAX: 23.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3123246810SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 38.748.668.91MIN: 7.43 / MAX: 21.86MIN: 7.66 / MAX: 11.93MIN: 7.68 / MAX: 24.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21233691215SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 311.4111.4211.44MIN: 9.61 / MAX: 24.27MIN: 9.78 / MAX: 21.73MIN: 10.2 / MAX: 14.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet123246810SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.14, N = 38.378.428.65MIN: 7.43 / MAX: 11.53MIN: 7.43 / MAX: 10.7MIN: 7.52 / MAX: 22.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b012348121620SE +/- 0.06, N = 3SE +/- 0.17, N = 3SE +/- 0.07, N = 314.2614.3214.43MIN: 12.32 / MAX: 58.63MIN: 12.71 / MAX: 21.22MIN: 12.96 / MAX: 26.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1230.74031.48062.22092.96123.7015SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 33.293.173.28MIN: 2.93 / MAX: 5.59MIN: 2.84 / MAX: 5.58MIN: 2.87 / MAX: 6.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet123714212835SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.38, N = 329.6530.4330.19MIN: 27.35 / MAX: 44.36MIN: 27.97 / MAX: 44.24MIN: 27.95 / MAX: 43.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16123306090120150SE +/- 0.47, N = 3SE +/- 0.12, N = 3SE +/- 0.28, N = 3127.46127.55128.02MIN: 123.62 / MAX: 155.82MIN: 124.34 / MAX: 142.98MIN: 124.75 / MAX: 141.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18123714212835SE +/- 0.48, N = 3SE +/- 0.35, N = 3SE +/- 0.49, N = 330.4831.4230.75MIN: 28.51 / MAX: 48.91MIN: 29.38 / MAX: 44.46MIN: 28.65 / MAX: 43.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet123612182430SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 325.1025.1625.17MIN: 23.66 / MAX: 35.12MIN: 23.92 / MAX: 36.39MIN: 23.93 / MAX: 36.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501231428425670SE +/- 0.32, N = 3SE +/- 0.36, N = 3SE +/- 0.63, N = 361.8563.2764.10MIN: 59.67 / MAX: 77.54MIN: 60.14 / MAX: 79.04MIN: 60.8 / MAX: 78.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1231224364860SE +/- 0.60, N = 3SE +/- 0.61, N = 3SE +/- 0.49, N = 351.5852.9854.23MIN: 49.32 / MAX: 73.38MIN: 49.76 / MAX: 62.27MIN: 51.36 / MAX: 69.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd123918273645SE +/- 0.38, N = 3SE +/- 0.31, N = 3SE +/- 0.08, N = 340.6140.3041.57MIN: 38.69 / MAX: 57.91MIN: 38.88 / MAX: 50.48MIN: 39.79 / MAX: 51.011. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m123510152025SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.26, N = 320.6920.5520.90MIN: 19.76 / MAX: 32.47MIN: 19.68 / MAX: 41.87MIN: 19.89 / MAX: 33.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet123918273645SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 339.2439.5539.64MIN: 37.61 / MAX: 75.08MIN: 37.98 / MAX: 52.81MIN: 38.24 / MAX: 53.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.07, N = 3SE +/- 0.13, N = 3SE +/- 0.09, N = 310.5610.5010.89MIN: 9.17 / MAX: 22.12MIN: 9.06 / MAX: 19.24MIN: 9.36 / MAX: 27.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3123246810SE +/- 0.13, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 38.778.728.84MIN: 7.65 / MAX: 16.44MIN: 7.36 / MAX: 21.89MIN: 7.72 / MAX: 20.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v21233691215SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 311.5111.4211.49MIN: 9.89 / MAX: 25.37MIN: 10.18 / MAX: 14.51MIN: 10.27 / MAX: 24.451. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet123246810SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 38.538.418.43MIN: 7.67 / MAX: 11.27MIN: 7.37 / MAX: 18.13MIN: 7.18 / MAX: 23.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b012348121620SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 314.4014.2814.26MIN: 12.89 / MAX: 32.02MIN: 12.77 / MAX: 26.63MIN: 12.68 / MAX: 28.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1230.73131.46262.19392.92523.6565SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 33.253.183.24MIN: 2.92 / MAX: 5.82MIN: 2.85 / MAX: 7.23MIN: 2.74 / MAX: 13.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet123714212835SE +/- 0.08, N = 3SE +/- 0.15, N = 3SE +/- 0.30, N = 329.8130.5330.37MIN: 27.66 / MAX: 43.24MIN: 27.72 / MAX: 49.16MIN: 27.85 / MAX: 41.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg16123306090120150SE +/- 0.12, N = 3SE +/- 0.19, N = 3SE +/- 0.22, N = 3126.48128.34127.97MIN: 123.52 / MAX: 147.25MIN: 124.8 / MAX: 144.28MIN: 124.99 / MAX: 146.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18123714212835SE +/- 0.34, N = 3SE +/- 0.25, N = 3SE +/- 0.32, N = 330.4730.6430.69MIN: 28.73 / MAX: 39.99MIN: 29 / MAX: 45.72MIN: 29.02 / MAX: 42.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet123612182430SE +/- 0.03, N = 3SE +/- 0.30, N = 3SE +/- 0.03, N = 325.0925.6225.11MIN: 24.07 / MAX: 34.98MIN: 24.03 / MAX: 37.26MIN: 23.93 / MAX: 31.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501231428425670SE +/- 1.40, N = 3SE +/- 0.50, N = 3SE +/- 0.70, N = 363.2163.5563.34MIN: 59.61 / MAX: 81.53MIN: 59.8 / MAX: 83.82MIN: 59.91 / MAX: 77.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1231224364860SE +/- 0.34, N = 3SE +/- 0.51, N = 3SE +/- 0.37, N = 351.8353.0553.45MIN: 49.56 / MAX: 65.5MIN: 49.64 / MAX: 66.59MIN: 50.95 / MAX: 68.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd1231020304050SE +/- 0.38, N = 3SE +/- 0.71, N = 3SE +/- 0.42, N = 340.2741.2742.10MIN: 38.86 / MAX: 55.4MIN: 38.91 / MAX: 54.13MIN: 39.99 / MAX: 60.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m123510152025SE +/- 0.08, N = 3SE +/- 0.10, N = 3SE +/- 0.11, N = 320.7020.7520.88MIN: 19.92 / MAX: 33.36MIN: 19.66 / MAX: 33.43MIN: 19.92 / MAX: 33.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12348121620SE +/- 0.03, N = 5SE +/- 0.03, N = 5SE +/- 0.04, N = 515.5515.5415.561. (CXX) g++ options: -rdynamic

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1239K18K27K36K45K4237342365421691. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm


Phoronix Test Suite v10.8.4