Core i3 7100 Xmas Eve

Intel Core i3-7100 testing with a Gigabyte B250M-DS3H-CF (F9 BIOS) and Gigabyte Intel HD 630 3GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012250-HA-COREI371034&sor&grr.

Core i3 7100 Xmas EveProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123Intel Core i3-7100 @ 3.90GHz (2 Cores / 4 Threads)Gigabyte B250M-DS3H-CF (F9 BIOS)Intel Xeon E3-1200 v6/7th + B2508GB250GB Western Digital WDS250G1B0A-Gigabyte Intel HD 630 3GB (1100MHz)Realtek ALC887-VDVA2431Realtek RTL8111/8168/8411Ubuntu 20.105.8.0-28-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.6 Mesa 20.2.11.2.145GCC 10.2.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- MQ-DEADLINE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3 Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

Core i3 7100 Xmas Eveastcenc: Exhaustivevkfft: hpcc: G-HPLbuild2: Time To Compileasmfish: 1024 Hash Memory, 26 Depthbuild-ffmpeg: Time To Compileastcenc: Thoroughncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUvkmark: 1280 x 1024vkmark: 1920 x 1080clomp: Static OMP Speedupvkresample: 2x - Doublehmmer: Pfam Database Searchstockfish: Total Timeonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUbuild-eigen: Time To Compilenode-web-tooling: rav1e: 1rav1e: 5simdjson: Kostyasqlite: 1vkresample: 2x - Singlesimdjson: LargeRandrav1e: 6simdjson: PartialTweetssimdjson: DistinctUserIDrav1e: 10phpbench: PHP Benchmark Suiteencode-wavpack: WAV To WavPackcrafty: Elapsed Timeastcenc: Mediumcoremark: CoreMark Size 666 - Iterations Per Secondencode-ape: WAV To APEonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUencode-opus: WAV To Opus Encodeonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUmafft: Multiple Sequence Alignment - LSU RNAastcenc: Fastonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUhpcc: Max Ping Pong Bandwidthhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: EP-DGEMMhpcc: G-Ffte1231087.58133286.83917644.7896144009232.085135.2918.6047.1157.1768.524.9530.82111.3731.513.8815.449.8615.319.1810.6343.1218.6047.1957.1568.4324.9330.75111.2931.553.8615.329.8415.339.1910.6243.0713022.312989.912994.09056141.41011.320114.82642329537192.687184.047188.4992.8479.440.2530.7790.4930.315523.0340.371.0350.60.622.40465788716.693733220720.4277661.55799612.78120.064220.04299.59315.79956.9773014.0007.976.0388912.35204.6941115.536124.019920.753713.364124.57578711.3525.147010.210350.012709.874791.3112447.154431.755231104.70133585.29487648.3806257439232.002135.4218.6247.2857.2368.4224.9630.75111.6331.513.8715.419.9115.339.1810.6543.2418.6747.2457.1868.3624.9330.76111.3231.533.8715.369.8715.329.1510.6443.2013494.413484.913403.59066141.41010.740114.83440438717437.087447.307424.1092.7309.340.2480.7790.4931.916532.9570.371.0380.60.622.40665812616.696736977620.4277631.82691912.75220.024719.96849.59715.65806.9703013.8677.976.0311212.35254.8169514.836523.943920.633713.344724.62368813.0975.184620.209960.012679.888191.4607647.504031.762101088.46133286.81380645.3496240412231.806135.3118.6047.1557.2668.4224.9430.74111.5931.523.8815.369.8615.319.1910.6743.2418.5847.2657.1968.4224.9030.72111.0931.483.8715.379.8615.329.1510.6643.2613017.513005.413012.28816061.41006.890114.82641019807207.997201.517201.0992.9209.340.2510.7760.4931.702538.2860.371.0360.600.622.40665629516.709734702120.4278242.22611612.76819.928919.86129.59915.74347.0155914.0047.966.0527112.34014.7098116.181724.053120.744413.323924.60318644.7934.902550.209080.012779.820521.4607947.694601.75386OpenBenchmarking.org

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive1322004006008001000SE +/- 0.37, N = 3SE +/- 0.17, N = 3SE +/- 8.12, N = 31087.581088.461104.701. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.123130060090012001500SE +/- 1.67, N = 3SE +/- 1.20, N = 31335133213321. (CXX) g++ options: -O3 -pthread

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL13220406080100SE +/- 0.04, N = 3SE +/- 0.15, N = 3SE +/- 0.24, N = 386.8486.8185.291. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile132140280420560700SE +/- 0.27, N = 3SE +/- 0.41, N = 3SE +/- 0.56, N = 3644.79645.35648.38

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth2311.3M2.6M3.9M5.2M6.5MSE +/- 82994.07, N = 3SE +/- 17217.84, N = 3SE +/- 29250.15, N = 3625743962404126144009

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile32150100150200250SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.36, N = 3231.81232.00232.09

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough132306090120150SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3135.29135.31135.421. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m132510152025SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 318.6018.6018.62MIN: 18.49 / MAX: 28.74MIN: 18.41 / MAX: 29MIN: 18.42 / MAX: 26.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd1321122334455SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 347.1147.1547.28MIN: 46.79 / MAX: 57.46MIN: 46.78 / MAX: 57.9MIN: 46.79 / MAX: 58.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1231326395265SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 357.1757.2357.26MIN: 56.85 / MAX: 67.25MIN: 56.88 / MAX: 68MIN: 56.9 / MAX: 67.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet502311530456075SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 368.4268.4268.50MIN: 68.09 / MAX: 79.66MIN: 68.07 / MAX: 79.07MIN: 68.08 / MAX: 122.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet312612182430SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 324.9424.9524.96MIN: 24.72 / MAX: 35.4MIN: 24.75 / MAX: 34.42MIN: 24.78 / MAX: 34.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18321714212835SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 330.7430.7530.82MIN: 30.51 / MAX: 41.31MIN: 30.52 / MAX: 41.61MIN: 30.55 / MAX: 41.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1613220406080100SE +/- 0.11, N = 3SE +/- 0.22, N = 3SE +/- 0.25, N = 3111.37111.59111.63MIN: 110.74 / MAX: 121.81MIN: 110.86 / MAX: 122.85MIN: 110.94 / MAX: 123.31. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet123714212835SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 331.5131.5131.52MIN: 31.3 / MAX: 41.74MIN: 31.31 / MAX: 41.35MIN: 31.29 / MAX: 41.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface2130.8731.7462.6193.4924.365SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.873.883.88MIN: 3.81 / MAX: 4.48MIN: 3.82 / MAX: 4.53MIN: 3.82 / MAX: 4.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b032148121620SE +/- 0.01, N = 3SE +/- 0.05, N = 2SE +/- 0.04, N = 215.3615.4115.44MIN: 15.2 / MAX: 16.15MIN: 15.24 / MAX: 25.3MIN: 15.2 / MAX: 25.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1323691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 39.869.869.91MIN: 9.73 / MAX: 20.02MIN: 9.74 / MAX: 10.66MIN: 9.73 / MAX: 221. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v213248121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 315.3115.3115.33MIN: 15.18 / MAX: 15.94MIN: 15.14 / MAX: 25.64MIN: 15.14 / MAX: 26.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31233691215SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 39.189.189.19MIN: 9.02 / MAX: 10.67MIN: 9.03 / MAX: 10.57MIN: 9.02 / MAX: 19.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 310.6310.6510.67MIN: 10.46 / MAX: 12.77MIN: 10.45 / MAX: 20.53MIN: 10.49 / MAX: 25.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet1231020304050SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 343.1243.2443.24MIN: 42.84 / MAX: 53.15MIN: 42.94 / MAX: 54.26MIN: 42.96 / MAX: 53.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m312510152025SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 318.5818.6018.67MIN: 18.48 / MAX: 28.89MIN: 18.4 / MAX: 28.6MIN: 18.47 / MAX: 29.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd1231122334455SE +/- 0.11, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 347.1947.2447.26MIN: 46.77 / MAX: 98.44MIN: 46.79 / MAX: 55.85MIN: 46.76 / MAX: 57.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1231326395265SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 357.1557.1857.19MIN: 56.82 / MAX: 67.66MIN: 56.79 / MAX: 67.52MIN: 56.86 / MAX: 67.491. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet502311530456075SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 368.3668.4268.43MIN: 68.07 / MAX: 79.76MIN: 68.07 / MAX: 79.4MIN: 68.13 / MAX: 78.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet312612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 324.9024.9324.93MIN: 24.72 / MAX: 35.18MIN: 24.74 / MAX: 35MIN: 24.67 / MAX: 34.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18312714212835SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 330.7230.7530.76MIN: 30.5 / MAX: 40.97MIN: 30.52 / MAX: 40.83MIN: 30.56 / MAX: 41.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg1631220406080100SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 3111.09111.29111.32MIN: 110.52 / MAX: 121.85MIN: 110.65 / MAX: 122.03MIN: 110.69 / MAX: 121.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet321714212835SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 331.4831.5331.55MIN: 31.29 / MAX: 41.94MIN: 31.34 / MAX: 41.26MIN: 31.32 / MAX: 41.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1230.87081.74162.61243.48324.354SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 33.863.873.87MIN: 3.81 / MAX: 4.17MIN: 3.82 / MAX: 4.01MIN: 3.81 / MAX: 4.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b012348121620SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 215.3215.3615.37MIN: 15.19 / MAX: 16MIN: 15.2 / MAX: 25.21MIN: 15.21 / MAX: 25.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet1323691215SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 39.849.869.87MIN: 9.75 / MAX: 10.52MIN: 9.72 / MAX: 20.18MIN: 9.75 / MAX: 19.761. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v223148121620SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 315.3215.3215.33MIN: 15.18 / MAX: 27.16MIN: 15.15 / MAX: 25.61MIN: 15.18 / MAX: 25.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v32313691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 39.159.159.19MIN: 9.02 / MAX: 10.69MIN: 9.01 / MAX: 10.75MIN: 9.01 / MAX: 19.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 310.6210.6410.66MIN: 10.45 / MAX: 12.13MIN: 10.46 / MAX: 20.48MIN: 10.45 / MAX: 21.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet1231020304050SE +/- 0.01, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 343.0743.2043.26MIN: 42.83 / MAX: 53.49MIN: 42.87 / MAX: 53.52MIN: 42.86 / MAX: 53.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU3123K6K9K12K15KSE +/- 6.69, N = 3SE +/- 34.32, N = 3SE +/- 5.06, N = 313017.513022.313494.4MIN: 12988.3MIN: 12970.1MIN: 13309.71. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1323K6K9K12K15KSE +/- 2.60, N = 3SE +/- 5.12, N = 3SE +/- 28.75, N = 312989.913005.413484.9MIN: 12963.9MIN: 12979.2MIN: 13078.81. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1323K6K9K12K15KSE +/- 4.10, N = 3SE +/- 7.57, N = 3SE +/- 76.51, N = 312994.013012.213403.5MIN: 12966.6MIN: 12979.3MIN: 13048.61. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

VKMark

Resolution: 1280 x 1024

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1280 x 10242132004006008001000SE +/- 0.58, N = 3SE +/- 1.00, N = 3SE +/- 3.28, N = 39069058811. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 1080213130260390520650SE +/- 0.58, N = 3SE +/- 1.20, N = 3SE +/- 7.51, N = 36146146061. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup3210.3150.630.9451.261.575SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.41.41.41. (CC) gcc options: -fopenmp -O3 -lm

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Double3212004006008001000SE +/- 2.44, N = 3SE +/- 4.98, N = 3SE +/- 5.17, N = 31006.891010.741011.321. (CXX) g++ options: -O3 -pthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search132306090120150SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3114.83114.83114.831. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time132900K1800K2700K3600K4500KSE +/- 40004.50, N = 3SE +/- 14449.66, N = 3SE +/- 24874.87, N = 34232953410198040438711. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU13216003200480064008000SE +/- 5.80, N = 3SE +/- 7.86, N = 3SE +/- 4.96, N = 37192.687207.997437.08MIN: 7171.23MIN: 7180.31MIN: 7270.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU13216003200480064008000SE +/- 4.85, N = 3SE +/- 5.32, N = 3SE +/- 7.22, N = 37184.047201.517447.30MIN: 7162.14MIN: 7179.7MIN: 7277.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU13216003200480064008000SE +/- 4.08, N = 3SE +/- 2.75, N = 3SE +/- 4.88, N = 37188.497201.097424.10MIN: 7171.82MIN: 7180.72MIN: 7267.221. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile21320406080100SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 392.7392.8592.92

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1323691215SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 39.449.349.341. Nodejs v12.18.2

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 11320.05690.11380.17070.22760.2845SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 30.2530.2510.248

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 52130.17530.35060.52590.70120.8765SE +/- 0.003, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 30.7790.7790.776

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya3210.11030.22060.33090.44120.5515SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.490.490.491. (CXX) g++ options: -O3 -pthread

SQLite

Threads / Copies: 1

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite 3.30.1Threads / Copies: 1132714212835SE +/- 0.36, N = 15SE +/- 0.36, N = 3SE +/- 0.04, N = 330.3231.7031.921. (CC) gcc options: -O2 -lz -lm -ldl -lpthread

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single123120240360480600SE +/- 3.39, N = 3SE +/- 4.10, N = 3SE +/- 4.21, N = 3523.03532.96538.291. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom3210.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.370.371. (CXX) g++ options: -O3 -pthread

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 62310.23360.46720.70080.93441.168SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 31.0381.0361.035

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets3210.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.600.600.601. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID3210.13950.2790.41850.5580.6975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.620.620.621. (CXX) g++ options: -O3 -pthread

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 103210.54141.08281.62422.16562.707SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.008, N = 32.4062.4062.404

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite213140K280K420K560K700KSE +/- 1397.02, N = 3SE +/- 1373.01, N = 3SE +/- 1397.20, N = 3658126657887656295

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12348121620SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.02, N = 516.6916.7016.711. (CXX) g++ options: -rdynamic

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time2311.6M3.2M4.8M6.4M8MSE +/- 11879.59, N = 3SE +/- 3975.85, N = 3SE +/- 19203.96, N = 37369776734702173322071. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium123510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 320.4220.4220.421. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second31220K40K60K80K100KSE +/- 631.76, N = 3SE +/- 672.82, N = 3SE +/- 652.47, N = 378242.2377661.5677631.831. (CC) gcc options: -O2 -lrt" -lrt

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE2313691215SE +/- 0.07, N = 5SE +/- 0.07, N = 5SE +/- 0.04, N = 512.7512.7712.781. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU321510152025SE +/- 0.04, N = 3SE +/- 0.26, N = 3SE +/- 0.24, N = 319.9320.0220.06MIN: 19.62MIN: 19.56MIN: 19.521. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU321510152025SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 319.8619.9720.04MIN: 19.64MIN: 19.66MIN: 19.641. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1233691215SE +/- 0.021, N = 5SE +/- 0.022, N = 5SE +/- 0.022, N = 59.5939.5979.5991. (CXX) g++ options: -fvisibility=hidden -logg -lm

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU23148121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 315.6615.7415.80MIN: 14.75MIN: 14.72MIN: 14.731. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU213246810SE +/- 0.01316, N = 3SE +/- 0.01584, N = 3SE +/- 0.03839, N = 36.970306.977307.01559MIN: 6.87MIN: 6.86MIN: 6.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA21348121620SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 313.8714.0014.001. (CC) gcc options: -std=c99 -O3 -lm -lpthread

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast312246810SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 37.967.977.971. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU213246810SE +/- 0.00495, N = 3SE +/- 0.00365, N = 3SE +/- 0.00775, N = 36.031126.038896.05271MIN: 5.89MIN: 5.92MIN: 5.921. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU3123691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.3412.3512.35MIN: 12.1MIN: 12.13MIN: 12.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1321.08382.16763.25144.33525.419SE +/- 0.05317, N = 3SE +/- 0.00674, N = 3SE +/- 0.00803, N = 34.694114.709814.81695MIN: 4.48MIN: 4.56MIN: 4.391. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU21348121620SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 314.8415.5416.18MIN: 14.45MIN: 15.01MIN: 15.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU213612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 323.9424.0224.05MIN: 23.7MIN: 23.78MIN: 23.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU231510152025SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 320.6320.7420.75MIN: 20.39MIN: 20.47MIN: 20.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3213691215SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 313.3213.3413.36MIN: 13.18MIN: 13.14MIN: 13.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU132612182430SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 324.5824.6024.62MIN: 24.35MIN: 24.43MIN: 24.371. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth2132K4K6K8K10KSE +/- 66.91, N = 3SE +/- 98.36, N = 3SE +/- 80.85, N = 38813.108711.358644.791. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth2131.16652.3333.49954.6665.8325SE +/- 0.08185, N = 3SE +/- 0.09024, N = 3SE +/- 0.08913, N = 35.184625.147014.902551. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency3210.04730.09460.14190.18920.2365SE +/- 0.00069, N = 3SE +/- 0.00200, N = 3SE +/- 0.00110, N = 30.209080.209960.210351. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access3120.00290.00580.00870.01160.0145SE +/- 0.00004, N = 3SE +/- 0.00008, N = 3SE +/- 0.00011, N = 30.012770.012700.012671. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad2133691215SE +/- 0.00967, N = 3SE +/- 0.01832, N = 3SE +/- 0.01214, N = 39.888199.874799.820521. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans3210.32870.65740.98611.31481.6435SE +/- 0.00604, N = 3SE +/- 0.00835, N = 3SE +/- 0.15709, N = 31.460791.460761.311241. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM3211122334455SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.52, N = 347.6947.5047.151. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte2130.39650.7931.18951.5861.9825SE +/- 0.00193, N = 3SE +/- 0.00362, N = 3SE +/- 0.00359, N = 31.762101.755231.753861. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3


Phoronix Test Suite v10.8.4