Core i9 9900K Xmas

Intel Core i9-9900K testing with a ASRock Z390M Pro4 (P4.20 BIOS) and Intel UHD 630 3GB on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012229-HA-COREI999049&sor.

Core i9 9900K XmasProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLOpenCLCompilerFile-SystemScreen Resolution123Intel Core i9-9900K @ 5.00GHz (8 Cores / 16 Threads)ASRock Z390M Pro4 (P4.20 BIOS)Intel Cannon Lake PCH16GB240GB Corsair Force MP510Intel UHD 630 3GB (1200MHz)Realtek ALC892G237HLIntel I219-VUbuntu 20.045.9.0-050900rc1daily20200819-generic (x86_64) 20200818GNOME Shell 3.36.4X Server 1.20.8modesetting 1.20.84.6 Mesa 20.0.4OpenCL 2.1GCC 9.3.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd6 - Thermald 1.9.1 Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Vulnerable; SMT vulnerable + meltdown: Not affected + spec_store_bypass: Vulnerable + spectre_v1: Vulnerable: __user pointer sanitization and usercopy barriers only; no swapgs barriers + spectre_v2: Vulnerable IBPB: disabled STIBP: disabled + srbds: Vulnerable + tsx_async_abort: Vulnerable

Core i9 9900K Xmasvkfft: vkresample: 2x - Doublevkresample: 2x - Singleclomp: Static OMP Speedupsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUbuild2: Time To Compilebuild-eigen: Time To Compileencode-ape: WAV To APEencode-ogg: WAV To Oggencode-opus: WAV To Opus Encodenode-web-tooling: ncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mncnn: Vulkan GPU - mobilenetncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - googlenetncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - regnety_400mencode-wavpack: WAV To WavPack1231501903.652388.5783.60.80.490.730.753.8429710.90801.756772.0287919.49685.238656.2478317.26235.767293.113163758.922073.483765.372066.093.877543763.982072.873.19467130.42066.6789.86918.1057.70914.2017.844.903.794.753.706.151.7813.5159.1013.1711.2725.0825.5618.8812.7817.904.893.744.763.726.211.7513.4759.1613.2611.2925.4625.6318.9312.6913.0521504918.533387.3243.60.80.490.730.753.8612411.09751.743222.0041919.58205.260206.2584717.15045.736493.129733760.832070.813767.162074.973.845473763.032074.813.19403130.21466.8359.86718.1147.73413.7217.824.913.844.773.726.131.7613.4559.0113.1711.2525.0525.5718.9312.4917.824.923.814.843.846.241.813.5258.9513.1611.2525.0425.5918.6213.4013.0261500903.689388.4333.30.80.490.730.753.8318111.20721.728852.0294019.62155.239026.2326517.26385.719313.118313765.102077.323767.182081.023.854133771.302077.343.19453131.29867.2759.90318.1567.69914.0517.824.903.774.763.696.141.7613.5359.0113.1911.3025.1725.6218.8812.3317.874.923.794.803.726.171.7613.4858.9513.3311.2725.1926.6918.7212.4113.046OpenBenchmarking.org

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.121330060090012001500SE +/- 0.67, N = 3SE +/- 1.76, N = 3SE +/- 1.00, N = 31504150115001. (CXX) g++ options: -O3 -pthread

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Double1322004006008001000SE +/- 1.21, N = 3SE +/- 0.41, N = 3SE +/- 0.93, N = 3903.65903.69918.531. (CXX) g++ options: -O3 -pthread

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single23180160240320400SE +/- 0.23, N = 3SE +/- 0.46, N = 3SE +/- 0.48, N = 3387.32388.43388.581. (CXX) g++ options: -O3 -pthread

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup2130.811.622.433.244.05SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 153.63.63.31. (CC) gcc options: -fopenmp -O3 -lm

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya3210.180.360.540.720.9SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.80.80.81. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom3210.11030.22060.33090.44120.5515SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.490.490.491. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets3210.16430.32860.49290.65720.8215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.730.730.731. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID3210.16880.33760.50640.67520.844SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.750.750.751. (CXX) g++ options: -O3 -pthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU3120.86881.73762.60643.47524.344SE +/- 0.04260, N = 7SE +/- 0.04278, N = 7SE +/- 0.03519, N = 103.831813.842973.86124MIN: 3.42MIN: 3.47MIN: 3.491. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 310.9111.1011.21MIN: 10.73MIN: 10.87MIN: 10.991. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3210.39530.79061.18591.58121.9765SE +/- 0.01630, N = 10SE +/- 0.01201, N = 15SE +/- 0.01447, N = 131.728851.743221.75677MIN: 1.55MIN: 1.53MIN: 1.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU2130.45660.91321.36981.82642.283SE +/- 0.00370, N = 3SE +/- 0.00340, N = 3SE +/- 0.00254, N = 32.004192.028792.02940MIN: 1.94MIN: 1.97MIN: 1.971. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 319.5019.5819.62MIN: 19.39MIN: 19.45MIN: 19.481. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1321.18352.3673.55054.7345.9175SE +/- 0.06499, N = 12SE +/- 0.06401, N = 12SE +/- 0.05219, N = 145.238655.239025.26020MIN: 4.34MIN: 4.35MIN: 4.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU312246810SE +/- 0.01139, N = 3SE +/- 0.00100, N = 3SE +/- 0.01417, N = 36.232656.247836.25847MIN: 5.94MIN: 5.97MIN: 5.961. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU21348121620SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 317.1517.2617.26MIN: 16.71MIN: 16.85MIN: 16.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU3211.29762.59523.89285.19046.488SE +/- 0.04354, N = 15SE +/- 0.05212, N = 13SE +/- 0.04711, N = 155.719315.736495.76729MIN: 5.03MIN: 5.03MIN: 5.031. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1320.70421.40842.11262.81683.521SE +/- 0.00605, N = 3SE +/- 0.00835, N = 3SE +/- 0.00115, N = 33.113163.118313.12973MIN: 3.04MIN: 3.05MIN: 3.051. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1238001600240032004000SE +/- 1.66, N = 3SE +/- 0.87, N = 3SE +/- 1.70, N = 33758.923760.833765.10MIN: 3744.47MIN: 3750.11MIN: 3745.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU213400800120016002000SE +/- 2.61, N = 3SE +/- 4.23, N = 3SE +/- 2.04, N = 32070.812073.482077.32MIN: 2057.1MIN: 2044.29MIN: 2063.571. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1238001600240032004000SE +/- 0.87, N = 3SE +/- 3.77, N = 3SE +/- 1.19, N = 33765.373767.163767.18MIN: 3754.23MIN: 3750.11MIN: 3748.261. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU123400800120016002000SE +/- 5.12, N = 3SE +/- 1.26, N = 3SE +/- 1.64, N = 32066.092074.972081.02MIN: 2046.99MIN: 2058.74MIN: 2063.521. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU2310.87241.74482.61723.48964.362SE +/- 0.01019, N = 3SE +/- 0.00211, N = 3SE +/- 0.00398, N = 33.845473.854133.87754MIN: 3.77MIN: 3.78MIN: 3.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU2138001600240032004000SE +/- 2.08, N = 3SE +/- 1.57, N = 3SE +/- 5.09, N = 33763.033763.983771.30MIN: 3749.98MIN: 3751.93MIN: 3754.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU123400800120016002000SE +/- 5.42, N = 3SE +/- 4.58, N = 3SE +/- 1.94, N = 32072.872074.812077.34MIN: 2051.09MIN: 2055.39MIN: 2060.341. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU2310.71881.43762.15642.87523.594SE +/- 0.03293, N = 8SE +/- 0.03391, N = 7SE +/- 0.04563, N = 33.194033.194533.19467MIN: 2.72MIN: 2.7MIN: 2.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile213306090120150SE +/- 0.30, N = 3SE +/- 0.20, N = 3SE +/- 0.13, N = 3130.21130.42131.30

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile1231530456075SE +/- 0.12, N = 3SE +/- 0.20, N = 3SE +/- 0.28, N = 366.6866.8467.28

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE2133691215SE +/- 0.013, N = 5SE +/- 0.009, N = 5SE +/- 0.024, N = 59.8679.8699.9031. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Ogg Audio Encoding

WAV To Ogg

OpenBenchmarking.orgSeconds, Fewer Is BetterOgg Audio Encoding 1.3.4WAV To Ogg12348121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 318.1118.1118.161. (CC) gcc options: -O2 -ffast-math -fsigned-char

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode312246810SE +/- 0.009, N = 5SE +/- 0.005, N = 5SE +/- 0.025, N = 57.6997.7097.7341. (CXX) g++ options: -fvisibility=hidden -logg -lm

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark13248121620SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 314.2014.0513.721. Nodejs v10.19.0

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet23148121620SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 317.8217.8217.84MIN: 17.61 / MAX: 19.74MIN: 17.52 / MAX: 19.19MIN: 17.55 / MAX: 18.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21321.10482.20963.31444.41925.524SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 34.904.904.91MIN: 4.76 / MAX: 6.22MIN: 4.74 / MAX: 6.25MIN: 4.74 / MAX: 5.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v33120.8641.7282.5923.4564.32SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 33.773.793.84MIN: 3.69 / MAX: 4.6MIN: 3.72 / MAX: 4.73MIN: 3.71 / MAX: 4.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21321.07332.14663.21994.29325.3665SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.754.764.77MIN: 4.72 / MAX: 5.82MIN: 4.73 / MAX: 5.74MIN: 4.71 / MAX: 5.821. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet3120.8371.6742.5113.3484.185SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 33.693.703.72MIN: 3.6 / MAX: 4.75MIN: 3.62 / MAX: 5.08MIN: 3.6 / MAX: 4.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0231246810SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 36.136.146.15MIN: 5.97 / MAX: 7.36MIN: 5.98 / MAX: 7.25MIN: 6.05 / MAX: 8.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface2310.40050.8011.20151.6022.0025SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 31.761.761.78MIN: 1.67 / MAX: 2.05MIN: 1.63 / MAX: 2.05MIN: 1.63 / MAX: 15.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet2133691215SE +/- 0.25, N = 3SE +/- 0.28, N = 3SE +/- 0.28, N = 313.4513.5113.53MIN: 12.78 / MAX: 14.09MIN: 12.77 / MAX: 14.18MIN: 12.81 / MAX: 14.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg162311326395265SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 359.0159.0159.10MIN: 58.7 / MAX: 59.8MIN: 58.68 / MAX: 62.22MIN: 58.72 / MAX: 60.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet181233691215SE +/- 0.13, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 313.1713.1713.19MIN: 12.77 / MAX: 15.6MIN: 12.75 / MAX: 14.11MIN: 12.73 / MAX: 14.21. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet2133691215SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.10, N = 311.2511.2711.30MIN: 11.02 / MAX: 11.94MIN: 11.03 / MAX: 13.48MIN: 11.05 / MAX: 11.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50213612182430SE +/- 0.37, N = 3SE +/- 0.37, N = 3SE +/- 0.36, N = 325.0525.0825.17MIN: 24.09 / MAX: 37.48MIN: 24.17 / MAX: 26.32MIN: 24.27 / MAX: 35.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny123612182430SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 325.5625.5725.62MIN: 24.68 / MAX: 26.45MIN: 24.68 / MAX: 26.46MIN: 25.42 / MAX: 26.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd132510152025SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 318.8818.8818.93MIN: 18.15 / MAX: 19.57MIN: 18.25 / MAX: 19.4MIN: 18.65 / MAX: 29.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m3213691215SE +/- 0.19, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 312.3312.4912.78MIN: 11.9 / MAX: 13.18MIN: 12.09 / MAX: 13.69MIN: 12.1 / MAX: 21.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet23148121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 317.8217.8717.90MIN: 17.55 / MAX: 18.33MIN: 17.62 / MAX: 19.75MIN: 17.81 / MAX: 19.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21231.1072.2143.3214.4285.535SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 34.894.924.92MIN: 4.76 / MAX: 14.13MIN: 4.75 / MAX: 6.03MIN: 4.76 / MAX: 6.221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31320.85731.71462.57193.42924.2865SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 33.743.793.81MIN: 3.7 / MAX: 4.76MIN: 3.7 / MAX: 4.79MIN: 3.7 / MAX: 4.851. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v21321.0892.1783.2674.3565.445SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 34.764.804.84MIN: 4.73 / MAX: 6.69MIN: 4.67 / MAX: 5.75MIN: 4.71 / MAX: 6.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet1320.8641.7282.5923.4564.32SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.13, N = 23.723.723.84MIN: 3.69 / MAX: 4.57MIN: 3.66 / MAX: 5.03MIN: 3.68 / MAX: 12.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b0312246810SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 36.176.216.24MIN: 6.03 / MAX: 7.42MIN: 6.11 / MAX: 7.67MIN: 6.1 / MAX: 8.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1320.4050.811.2151.622.025SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 31.751.761.80MIN: 1.63 / MAX: 2.06MIN: 1.67 / MAX: 2.01MIN: 1.63 / MAX: 2.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet1323691215SE +/- 0.23, N = 3SE +/- 0.30, N = 3SE +/- 0.36, N = 313.4713.4813.52MIN: 12.89 / MAX: 16MIN: 12.7 / MAX: 14.54MIN: 12.59 / MAX: 14.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg162311326395265SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.01, N = 358.9558.9559.16MIN: 58.61 / MAX: 59.89MIN: 58.53 / MAX: 67.6MIN: 58.94 / MAX: 60.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet182133691215SE +/- 0.16, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 213.1613.2613.33MIN: 12.64 / MAX: 14.54MIN: 12.86 / MAX: 14.3MIN: 12.89 / MAX: 14.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet2313691215SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 311.2511.2711.29MIN: 11.02 / MAX: 12.47MIN: 11.05 / MAX: 11.64MIN: 11.05 / MAX: 12.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet50231612182430SE +/- 0.37, N = 3SE +/- 0.37, N = 3SE +/- 0.02, N = 325.0425.1925.46MIN: 24.07 / MAX: 26.3MIN: 24.27 / MAX: 35.77MIN: 24.03 / MAX: 26.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny213612182430SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 1.12, N = 225.5925.6326.69MIN: 25.42 / MAX: 35.3MIN: 25.45 / MAX: 26.86MIN: 25.44 / MAX: 170.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd231510152025SE +/- 0.23, N = 3SE +/- 0.22, N = 3SE +/- 0.03, N = 318.6218.7218.93MIN: 18.08 / MAX: 19.05MIN: 18.2 / MAX: 30.1MIN: 18.69 / MAX: 21.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m3123691215SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.63, N = 312.4112.6913.40MIN: 11.89 / MAX: 13.64MIN: 12.4 / MAX: 14.98MIN: 12 / MAX: 16.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack2313691215SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.02, N = 513.0313.0513.051. (CXX) g++ options: -rdynamic


Phoronix Test Suite v10.8.4