TR 2990WX 2020

AMD Ryzen Threadripper 2990WX 32-Core testing with a ASUS ROG ZENITH EXTREME (1701 BIOS) and Gigabyte AMD Radeon RX 470/480/570/570X/580/580X/590 4GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012260-HA-TR2990WX254&grr.

TR 2990WX 2020ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123AMD Ryzen Threadripper 2990WX 32-Core @ 3.00GHz (32 Cores / 64 Threads)ASUS ROG ZENITH EXTREME (1701 BIOS)AMD 17h32GBSamsung SSD 970 EVO 500GB + 250GB Western Digital WDS250G2X0C-00L350Gigabyte AMD Radeon RX 470/480/570/570X/580/580X/590 4GB (1244/1750MHz)Realtek ALC1220LG Ultra HDIntel I211 + Qualcomm Atheros QCA6174 802.11ac + Wilocity Wil6200 802.11adUbuntu 20.105.8.0-33-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.94.6 Mesa 20.2.1 (LLVM 11.0.0)1.2.131GCC 10.2.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x800820dGraphics Details- GLAMORJava Details- OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional STIBP: disabled RSB filling + srbds: Not affected + tsx_async_abort: Not affected

TR 2990WX 2020hpcc: G-HPLbuild-clash: Time To Compilebasis: UASTC Level 2 + RDO Post-Processingncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetlammps: 20k Atomsncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetvkfft: ai-benchmark: Device AI Scoreai-benchmark: Device Training Scoreai-benchmark: Device Inference Scoreonednn: Recurrent Neural Network Training - u8s8f32 - CPUhmmer: Pfam Database Searchnumpy: onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUbrl-cad: VGR Performance Metricembree: Pathtracer - Asian Dragon Objembree: Pathtracer ISPC - Asian Dragon Objcompress-zstd: 19compress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedasmfish: 1024 Hash Memory, 26 Depthcompress-zstd: 3embree: Pathtracer - Asian Dragononednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUvkmark: 1280 x 1024vkmark: 1920 x 1080indigobench: CPU - Bedroomnode-web-tooling: gromacs: Water Benchmarkbuild2: Time To Compileembree: Pathtracer ISPC - Asian Dragonbuild-eigen: Time To Compileonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUastcenc: Exhaustiveonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUsqlite-speedtest: Timed Time - Size 1,000compress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedrav1e: 5rav1e: 1indigobench: CPU - Supercaronednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUembree: Pathtracer - Crownkvazaar: Bosphorus 4K - Slowsimdjson: LargeRandkvazaar: Bosphorus 4K - Mediumsimdjson: PartialTweetssimdjson: DistinctUserIDredis: GETclomp: Static OMP Speedupsimdjson: Kostyabasis: ETC1Srav1e: 6stockfish: Total Timeredis: LPOPespeak: Text-To-Speech Synthesisredis: SADDx265: Bosphorus 4Kbuild-ffmpeg: Time To Compilelibplacebo: av1_grain_laplibplacebo: hdr_peakdetectlibplacebo: polar_nocomputelibplacebo: deband_heavyphpbench: PHP Benchmark Suitecompress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedrav1e: 10onednn: IP Shapes 3D - f32 - CPUbetsy: ETC2 RGB - Highestonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUembree: Pathtracer ISPC - Crowncrafty: Elapsed Timecoremark: CoreMark Size 666 - Iterations Per Secondkvazaar: Bosphorus 4K - Very Fastbasis: UASTC Level 3betsy: ETC1 - Highestencode-ape: WAV To APEsunflow: Global Illumination + Image Synthesiskvazaar: Bosphorus 1080p - Slowencode-wavpack: WAV To WavPackkvazaar: Bosphorus 1080p - Mediumonednn: Deconvolution Batch shapes_1d - f32 - CPUvkresample: 2x - Singlemafft: Multiple Sequence Alignment - LSU RNAbasis: UASTC Level 2kvazaar: Bosphorus 4K - Ultra Fastonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUx265: Bosphorus 1080pastcenc: Thoroughencode-opus: WAV To Opus Encoderedis: LPUSHredis: SETwaifu2x-ncnn: 2x - 3 - Yeskvazaar: Bosphorus 1080p - Very Fastonednn: IP Shapes 3D - u8s8f32 - CPUbasis: UASTC Level 0lammps: Rhodopsin Proteinyquake2: Software CPU - 1920 x 1080astcenc: Mediumonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUastcenc: Fastkvazaar: Bosphorus 1080p - Ultra Fastyquake2: OpenGL 1.x - 1920 x 1080waifu2x-ncnn: 2x - 3 - Noonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUyquake2: OpenGL 3.x - 1920 x 1080hpcc: Max Ping Pong Bandwidthhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: EP-DGEMMhpcc: G-Ffte12354.59837463.220651.213102.9341.8950.1979.5337.7259.91102.3847.876.8320.0113.6015.2814.4415.6737.2415.501101.0141.2850.2572.0633.5655.89103.4141.089.8219.1415.6414.9214.5815.5937.3094122270966130413991.1163.935296.8013826.228684718.241817.679133.99106.946.60738040013606.721.956413861.23802.52585043845.0368.471.83496.20021.315693.2523808.573749.1873.363.6965767.2079128.247.460.9540.31511.0351.6576724.019310.290.3910.470.510.522201882.0351.70.4447.6021.270498212542431710.3530.8471892562.2116.6246.084448.241762.98268.72192.545772299506.28518.042.85911.469412.6971.4692822.725973295311146059.29469123.5425.22910.75414.0290.78926.9913.19927.692.9955949.01212.15315.78539.686.277152.1240841.029.547.7531368828.881675529.3810.03860.693.453707.59513.399106.66.3425.164319.97345.22116.80683.82.0523.437275.92604949.710862.0410.571201.536810.028191.309664.0016412.6172710.0466952.77753477.084647.924101.4740.8049.6975.0334.3454.2992.9842.046.6720.9214.5014.4215.7015.6034.5615.418100.2640.0150.5773.8632.3953.19100.7338.907.4019.2816.2614.6514.2216.3033.7894512317998131913295.0162.561298.3113783.329101319.299217.235742.09217.247.55748951873798.922.496613529.03821.24564442674.9948.251.85896.34021.667492.9813780.473782.3673.143.7616467.4589191.947.120.9510.32010.9632.0956724.615510.380.3910.560.500.522099666.5251.60.4447.6021.273554047722058658.4131.1531826296.8515.2630.918441.371762.78268.36192.435793609647.98619.842.83011.208512.5431.4854922.236573713741148000.04493223.5725.21710.51914.0100.73127.1313.21527.843.0172148.99612.26415.79039.676.250112.1089340.369.497.7951340079.291654163.6010.05560.693.490647.62612.913107.16.3225.102620.15385.22116.09619.93.449805.93856972.210831.9740.583001.534530.027981.307714.341068.803039.7408354.30507485.485647.089103.8939.1349.2573.6535.6964.4995.6142.957.6518.9714.1715.3015.2115.7834.9115.187101.2739.1448.2370.9235.9356.35100.1738.327.0920.4913.7715.7515.9514.9734.39941713669.2161.632295.7913668.419.401417.399936.29130.545.94726307373781.222.265513753.93711.96564042685.0578.211.83721.562293.0583777.653783.4073.103.4169967.2479188.346.870.9530.31811.0362.0098424.363610.320.4010.530.510.522091130.6751.80.4448.1641.274549840121998586.1631.0281974193.4616.5532.699446.381762.55267.90192.229540.08508.642.85310.928512.5531.4895722.645373667011146667.40611423.6125.14410.52914.04126.9227.783.0164549.02112.15715.75839.396.216232.1261141.069.457.7741366164.671577289.7910.07460.793.510877.60012.551106.26.3525.104120.25435.20116.68623.83.469095.92300972.510972.4940.593851.535090.028661.338204.1742813.034179.67646OpenBenchmarking.org

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL1231224364860SE +/- 0.14, N = 3SE +/- 0.10, N = 3SE +/- 0.24, N = 354.6052.7854.311. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Timed Clash Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Clash CompilationTime To Compile123110220330440550SE +/- 13.67, N = 9SE +/- 4.96, N = 9SE +/- 0.58, N = 3463.22477.08485.49

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing123140280420560700SE +/- 1.61, N = 3SE +/- 0.39, N = 3SE +/- 0.07, N = 3651.21647.92647.091. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m12320406080100SE +/- 1.12, N = 12SE +/- 1.29, N = 12SE +/- 3.17, N = 12102.93101.47103.89MIN: 90.68 / MAX: 1519.43MIN: 90.53 / MAX: 2458.71MIN: 90.75 / MAX: 3380.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd1231020304050SE +/- 1.53, N = 12SE +/- 1.74, N = 12SE +/- 1.90, N = 1241.8940.8039.13MIN: 31.38 / MAX: 429.4MIN: 31.26 / MAX: 438.66MIN: 32.02 / MAX: 459.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1231122334455SE +/- 0.68, N = 12SE +/- 1.04, N = 12SE +/- 0.86, N = 1250.1949.6949.25MIN: 39.36 / MAX: 224.81MIN: 39.24 / MAX: 224.21MIN: 39.85 / MAX: 267.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet5012320406080100SE +/- 5.46, N = 12SE +/- 2.78, N = 12SE +/- 4.79, N = 1279.5375.0373.65MIN: 38.42 / MAX: 565.15MIN: 38.58 / MAX: 559.97MIN: 39.25 / MAX: 638.311. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet123918273645SE +/- 1.89, N = 12SE +/- 2.63, N = 12SE +/- 1.62, N = 1237.7234.3435.69MIN: 15.66 / MAX: 106.73MIN: 15.18 / MAX: 103.77MIN: 16.27 / MAX: 106.471. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet181231428425670SE +/- 4.97, N = 12SE +/- 5.28, N = 12SE +/- 4.57, N = 1259.9154.2964.49MIN: 23.15 / MAX: 227.75MIN: 22 / MAX: 230.1MIN: 21.25 / MAX: 228.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1612320406080100SE +/- 3.95, N = 12SE +/- 1.61, N = 12SE +/- 2.05, N = 12102.3892.9895.61MIN: 62.04 / MAX: 220.23MIN: 61.58 / MAX: 223.7MIN: 64.1 / MAX: 227.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet1231122334455SE +/- 2.92, N = 12SE +/- 2.39, N = 12SE +/- 3.13, N = 1247.8742.0442.95MIN: 27.73 / MAX: 542.74MIN: 28.93 / MAX: 517.65MIN: 27.93 / MAX: 530.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface123246810SE +/- 0.16, N = 12SE +/- 0.10, N = 12SE +/- 0.60, N = 126.836.677.65MIN: 6.11 / MAX: 191.61MIN: 6.12 / MAX: 175.66MIN: 6.15 / MAX: 211.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0123510152025SE +/- 0.67, N = 12SE +/- 1.79, N = 12SE +/- 0.37, N = 1220.0120.9218.97MIN: 17.38 / MAX: 456.25MIN: 17.12 / MAX: 465.23MIN: 17.3 / MAX: 414.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet12348121620SE +/- 0.16, N = 12SE +/- 0.64, N = 12SE +/- 0.31, N = 1213.6014.5014.17MIN: 12.42 / MAX: 189.74MIN: 12.55 / MAX: 348.4MIN: 12.78 / MAX: 347.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v212348121620SE +/- 0.85, N = 12SE +/- 0.15, N = 12SE +/- 0.64, N = 1215.2814.4215.30MIN: 13.6 / MAX: 309.56MIN: 13.2 / MAX: 104.86MIN: 12.9 / MAX: 306.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v312348121620SE +/- 0.50, N = 12SE +/- 1.39, N = 12SE +/- 1.31, N = 1214.4415.7015.21MIN: 12.49 / MAX: 358.11MIN: 12.6 / MAX: 382.98MIN: 12.52 / MAX: 378.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v212348121620SE +/- 0.44, N = 12SE +/- 0.34, N = 12SE +/- 0.73, N = 1215.6715.6015.78MIN: 13.15 / MAX: 357.88MIN: 13.33 / MAX: 383.7MIN: 13.2 / MAX: 388.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet123918273645SE +/- 1.43, N = 12SE +/- 0.55, N = 12SE +/- 0.99, N = 1237.2434.5634.91MIN: 29.37 / MAX: 419.38MIN: 30.05 / MAX: 404.5MIN: 29.38 / MAX: 419.391. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms12348121620SE +/- 0.21, N = 3SE +/- 0.02, N = 3SE +/- 0.21, N = 415.5015.4215.191. (CXX) g++ options: -O3 -pthread -lm

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m12320406080100SE +/- 1.78, N = 12SE +/- 2.13, N = 12SE +/- 2.24, N = 9101.01100.26101.27MIN: 90.72 / MAX: 1587.31MIN: 90.99 / MAX: 1833.25MIN: 89.99 / MAX: 2458.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd123918273645SE +/- 1.34, N = 12SE +/- 1.39, N = 12SE +/- 1.29, N = 941.2840.0139.14MIN: 31.56 / MAX: 448.33MIN: 31.52 / MAX: 514.49MIN: 31.19 / MAX: 435.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1231122334455SE +/- 1.15, N = 12SE +/- 1.28, N = 12SE +/- 1.42, N = 950.2550.5748.23MIN: 39.88 / MAX: 213.87MIN: 39.57 / MAX: 214.93MIN: 39.54 / MAX: 230.791. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501231632486480SE +/- 3.10, N = 12SE +/- 5.68, N = 12SE +/- 6.02, N = 972.0673.8670.92MIN: 39.39 / MAX: 562.21MIN: 40.4 / MAX: 546.35MIN: 39.2 / MAX: 557.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet123816243240SE +/- 1.15, N = 12SE +/- 1.09, N = 12SE +/- 1.85, N = 933.5632.3935.93MIN: 15.36 / MAX: 104.32MIN: 17.56 / MAX: 91.67MIN: 17.55 / MAX: 96.421. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet181231326395265SE +/- 3.22, N = 12SE +/- 5.95, N = 12SE +/- 3.64, N = 955.8953.1956.35MIN: 23.8 / MAX: 226.22MIN: 21.74 / MAX: 222.51MIN: 22.77 / MAX: 219.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg1612320406080100SE +/- 2.14, N = 12SE +/- 2.43, N = 12SE +/- 2.35, N = 9103.41100.73100.17MIN: 63 / MAX: 216.59MIN: 65.03 / MAX: 221.49MIN: 63.35 / MAX: 242.131. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet123918273645SE +/- 2.33, N = 12SE +/- 1.77, N = 12SE +/- 1.78, N = 941.0838.9038.32MIN: 28.69 / MAX: 513.63MIN: 28.91 / MAX: 532.2MIN: 28.17 / MAX: 505.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1233691215SE +/- 1.94, N = 12SE +/- 0.76, N = 12SE +/- 0.51, N = 99.827.407.09MIN: 6.19 / MAX: 229.58MIN: 6.14 / MAX: 215.25MIN: 6.15 / MAX: 204.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b0123510152025SE +/- 0.25, N = 12SE +/- 0.70, N = 12SE +/- 0.66, N = 919.1419.2820.49MIN: 17.51 / MAX: 352.05MIN: 16.96 / MAX: 430.49MIN: 17.45 / MAX: 438.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet12348121620SE +/- 0.70, N = 11SE +/- 1.06, N = 12SE +/- 0.35, N = 915.6416.2613.77MIN: 12.3 / MAX: 352.98MIN: 12.28 / MAX: 390.46MIN: 12.4 / MAX: 378.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v212348121620SE +/- 0.51, N = 12SE +/- 0.14, N = 11SE +/- 0.93, N = 914.9214.6515.75MIN: 13.15 / MAX: 283.76MIN: 13.43 / MAX: 114.64MIN: 12.97 / MAX: 295.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v312348121620SE +/- 0.55, N = 12SE +/- 0.34, N = 12SE +/- 1.16, N = 914.5814.2215.95MIN: 12.63 / MAX: 382.37MIN: 12.46 / MAX: 387.96MIN: 12.47 / MAX: 359.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v212348121620SE +/- 0.35, N = 12SE +/- 0.63, N = 12SE +/- 0.24, N = 915.5916.3014.97MIN: 12.98 / MAX: 359.55MIN: 13.31 / MAX: 389.24MIN: 13.4 / MAX: 357.211. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet123918273645SE +/- 1.11, N = 12SE +/- 0.72, N = 12SE +/- 1.11, N = 937.3033.7834.39MIN: 29.33 / MAX: 427.64MIN: 29.45 / MAX: 408.23MIN: 29.33 / MAX: 412.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.11232K4K6K8K10KSE +/- 13.38, N = 3SE +/- 49.17, N = 3SE +/- 6.74, N = 39412945194171. (CXX) g++ options: -O3 -pthread

AI Benchmark Alpha

Device AI Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI Score12500100015002000250022702317

AI Benchmark Alpha

Device Training Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training Score122004006008001000966998

AI Benchmark Alpha

Device Inference Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference Score123006009001200150013041319

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1233K6K9K12K15KSE +/- 48.11, N = 3SE +/- 123.89, N = 10SE +/- 122.96, N = 1113991.113295.013669.2MIN: 13812.5MIN: 12213.4MIN: 12754.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search1234080120160200SE +/- 1.51, N = 10SE +/- 0.23, N = 3SE +/- 0.19, N = 3163.94162.56161.631. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12360120180240300SE +/- 0.54, N = 3SE +/- 0.29, N = 3SE +/- 0.70, N = 3296.80298.31295.79

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1233K6K9K12K15KSE +/- 141.29, N = 3SE +/- 53.39, N = 3SE +/- 128.38, N = 1213826.213783.313668.4MIN: 13450.2MIN: 13567.9MIN: 12362.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1260K120K180K240K300K2868472910131. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj123510152025SE +/- 0.25, N = 15SE +/- 0.37, N = 15SE +/- 0.44, N = 1218.2419.3019.40MIN: 16.42 / MAX: 21.1MIN: 16.68 / MAX: 22.31MIN: 16.59 / MAX: 22.2

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj12348121620SE +/- 0.39, N = 12SE +/- 0.39, N = 12SE +/- 0.30, N = 1517.6817.2417.40MIN: 15.66 / MAX: 20.6MIN: 14.95 / MAX: 20.29MIN: 15.12 / MAX: 19.79

Zstd Compression

Compression Level: 19

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 191231020304050SE +/- 0.37, N = 15SE +/- 0.34, N = 3SE +/- 0.39, N = 333.942.036.21. (CC) gcc options: -O3 -pthread -lz -llzma

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1232K4K6K8K10KSE +/- 20.12, N = 15SE +/- 4.46, N = 3SE +/- 18.60, N = 39106.99217.29130.51. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1231122334455SE +/- 0.43, N = 15SE +/- 0.77, N = 3SE +/- 0.01, N = 346.6047.5545.941. (CC) gcc options: -O3

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth12316M32M48M64M80MSE +/- 738089.20, N = 3SE +/- 126998.98, N = 3SE +/- 633798.86, N = 3738040017489518772630737

Zstd Compression

Compression Level: 3

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 31238001600240032004000SE +/- 88.27, N = 12SE +/- 129.73, N = 12SE +/- 107.86, N = 153606.73798.93781.21. (CC) gcc options: -O3 -pthread -lz -llzma

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon123510152025SE +/- 0.31, N = 15SE +/- 0.25, N = 15SE +/- 0.26, N = 1521.9622.5022.27MIN: 19.03 / MAX: 24.66MIN: 20.59 / MAX: 25.14MIN: 20.37 / MAX: 25.26

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1233K6K9K12K15KSE +/- 110.93, N = 3SE +/- 161.18, N = 3SE +/- 132.58, N = 313861.213529.013753.9MIN: 13490.9MIN: 13190.1MIN: 12649.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1238001600240032004000SE +/- 10.92, N = 3SE +/- 46.19, N = 3SE +/- 38.84, N = 83802.523821.243711.96MIN: 3624.28MIN: 3746.24MIN: 3503.461. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

VKMark

Resolution: 1280 x 1024

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1280 x 102412313002600390052006500SE +/- 2.19, N = 3SE +/- 5.90, N = 3SE +/- 2.33, N = 35850564456401. (CXX) g++ options: -ldl -pipe -std=c++14 -fPIC -MD -MQ -MF

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801239001800270036004500SE +/- 2.96, N = 3SE +/- 2.65, N = 3SE +/- 2.85, N = 34384426742681. (CXX) g++ options: -ldl -pipe -std=c++14 -fPIC -MD -MQ -MF

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1231.13782.27563.41344.55125.689SE +/- 0.025, N = 3SE +/- 0.054, N = 12SE +/- 0.019, N = 35.0364.9945.057

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark123246810SE +/- 0.03, N = 3SE +/- 0.12, N = 4SE +/- 0.04, N = 38.478.258.211. Nodejs v12.18.2

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.41810.83621.25431.67242.0905SE +/- 0.004, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 21.8341.8581.8371. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile1220406080100SE +/- 0.24, N = 3SE +/- 0.21, N = 396.2096.34

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon123510152025SE +/- 0.09, N = 3SE +/- 0.26, N = 15SE +/- 0.19, N = 1521.3221.6721.56MIN: 20.36 / MAX: 22.38MIN: 19.41 / MAX: 23.55MIN: 19.33 / MAX: 23.88

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile12320406080100SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 393.2592.9893.06

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1238001600240032004000SE +/- 3.43, N = 3SE +/- 13.99, N = 3SE +/- 10.39, N = 33808.573780.473777.65MIN: 3705.89MIN: 3731.84MIN: 3644.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1238001600240032004000SE +/- 16.86, N = 3SE +/- 8.72, N = 3SE +/- 8.76, N = 33749.183782.363783.40MIN: 3631.66MIN: 3756.39MIN: 3758.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive1231632486480SE +/- 0.09, N = 3SE +/- 0.12, N = 3SE +/- 0.10, N = 373.3673.1473.101. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.84641.69282.53923.38564.232SE +/- 0.07703, N = 15SE +/- 0.07933, N = 15SE +/- 0.01146, N = 33.696573.761643.41699MIN: 3.25MIN: 3.26MIN: 3.221. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001231530456075SE +/- 0.47, N = 3SE +/- 0.25, N = 3SE +/- 0.13, N = 367.2167.4667.251. (CC) gcc options: -O2 -ldl -lz -lpthread

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1232K4K6K8K10KSE +/- 64.14, N = 3SE +/- 19.51, N = 3SE +/- 57.74, N = 39128.29191.99188.31. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1231122334455SE +/- 0.54, N = 3SE +/- 0.25, N = 3SE +/- 0.29, N = 347.4647.1246.871. (CC) gcc options: -O3

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 51230.21470.42940.64410.85881.0735SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.004, N = 30.9540.9510.953

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 11230.0720.1440.2160.2880.36SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.3150.3200.318

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1233691215SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 311.0410.9611.04

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.47150.9431.41451.8862.3575SE +/- 0.05725, N = 15SE +/- 0.05017, N = 15SE +/- 0.03417, N = 151.657672.095672.00984MIN: 1.11MIN: 1.4MIN: 1.391. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown123612182430SE +/- 0.33, N = 15SE +/- 0.22, N = 3SE +/- 0.31, N = 324.0224.6224.36MIN: 19.33 / MAX: 25.81MIN: 23.95 / MAX: 25.61MIN: 23 / MAX: 25.59

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow1233691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 310.2910.3810.321. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.090.180.270.360.45SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.390.390.401. (CXX) g++ options: -O3 -pthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1233691215SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 310.4710.5610.531. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.11480.22960.34440.45920.574SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.510.500.511. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.1170.2340.3510.4680.585SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.520.520.521. (CXX) g++ options: -O3 -pthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123500K1000K1500K2000K2500KSE +/- 31653.62, N = 15SE +/- 31097.88, N = 15SE +/- 27660.05, N = 152201882.032099666.522091130.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1231224364860SE +/- 0.57, N = 3SE +/- 0.20, N = 2SE +/- 0.68, N = 351.751.651.81. (CC) gcc options: -fopenmp -O3 -lm

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.0990.1980.2970.3960.495SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.440.440.441. (CXX) g++ options: -O3 -pthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1231122334455SE +/- 0.22, N = 3SE +/- 0.10, N = 3SE +/- 0.24, N = 347.6047.6048.161. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 61230.28670.57340.86011.14681.4335SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 31.2701.2731.274

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time12312M24M36M48M60MSE +/- 796222.48, N = 3SE +/- 577141.31, N = 3SE +/- 749500.39, N = 34982125455404772549840121. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123500K1000K1500K2000K2500KSE +/- 44065.81, N = 15SE +/- 132410.87, N = 12SE +/- 139985.77, N = 122431710.352058658.411998586.161. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis123714212835SE +/- 0.06, N = 4SE +/- 0.17, N = 4SE +/- 0.07, N = 430.8531.1531.031. (CC) gcc options: -O2 -std=c99

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD123400K800K1200K1600K2000KSE +/- 20792.00, N = 15SE +/- 25792.40, N = 15SE +/- 16528.72, N = 31892562.211826296.851974193.461. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K12348121620SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 316.6215.2616.551. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile1231020304050SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.18, N = 346.0830.9232.70

Libplacebo

Test: av1_grain_lap

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: av1_grain_lap123100200300400500SE +/- 3.42, N = 3SE +/- 3.25, N = 3SE +/- 1.25, N = 3448.24441.37446.381. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: hdr_peakdetect

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: hdr_peakdetect123400800120016002000SE +/- 0.03, N = 3SE +/- 0.18, N = 3SE +/- 0.10, N = 31762.981762.781762.551. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: polar_nocompute

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: polar_nocompute12360120180240300SE +/- 0.83, N = 3SE +/- 0.53, N = 3SE +/- 0.38, N = 3268.72268.36267.901. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Libplacebo

Test: deband_heavy

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: deband_heavy1234080120160200SE +/- 0.36, N = 3SE +/- 0.37, N = 3SE +/- 0.29, N = 3192.54192.43192.221. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lSPIRV-Tools -lSPIRV-Tools-opt -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite12120K240K360K480K600KSE +/- 2547.76, N = 3SE +/- 145.08, N = 3577229579360

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1232K4K6K8K10KSE +/- 34.06, N = 3SE +/- 9.81, N = 3SE +/- 47.16, N = 39506.29647.99540.01. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed1232K4K6K8K10KSE +/- 65.90, N = 3SE +/- 79.09, N = 3SE +/- 51.76, N = 38518.048619.848508.641. (CC) gcc options: -O3

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 101230.64331.28661.92992.57323.2165SE +/- 0.006, N = 3SE +/- 0.018, N = 3SE +/- 0.012, N = 32.8592.8302.853

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.19, N = 12SE +/- 0.15, N = 15SE +/- 0.01, N = 311.4711.2110.93MIN: 10.94MIN: 10.67MIN: 10.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Betsy GPU Compressor

Codec: ETC2 RGB - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highest1233691215SE +/- 0.24, N = 15SE +/- 0.02, N = 3SE +/- 0.04, N = 312.7012.5412.551. (CXX) g++ options: -O3 -O2 -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.33520.67041.00561.34081.676SE +/- 0.01123, N = 15SE +/- 0.00191, N = 3SE +/- 0.00013, N = 31.469281.485491.48957MIN: 1.34MIN: 1.45MIN: 1.441. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown123510152025SE +/- 0.21, N = 3SE +/- 0.38, N = 3SE +/- 0.22, N = 322.7322.2422.65MIN: 21.97 / MAX: 23.66MIN: 20.82 / MAX: 23.48MIN: 21.43 / MAX: 23.62

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1231.6M3.2M4.8M6.4M8MSE +/- 7194.80, N = 3SE +/- 6808.24, N = 3SE +/- 5337.04, N = 37329531737137473667011. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second123200K400K600K800K1000KSE +/- 2731.02, N = 3SE +/- 686.93, N = 3SE +/- 1713.53, N = 31146059.291148000.041146667.411. (CC) gcc options: -O2 -lrt" -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast123612182430SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 323.5423.5723.611. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3123612182430SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 325.2325.2225.141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Betsy GPU Compressor

Codec: ETC1 - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highest1233691215SE +/- 0.23, N = 15SE +/- 0.01, N = 3SE +/- 0.01, N = 310.7510.5210.531. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE12348121620SE +/- 0.06, N = 5SE +/- 0.06, N = 5SE +/- 0.06, N = 514.0314.0114.041. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Sunflow Rendering System

Global Illumination + Image Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis120.17750.3550.53250.710.8875SE +/- 0.004, N = 3SE +/- 0.011, N = 150.7890.731MIN: 0.57 / MAX: 1.63MIN: 0.49 / MAX: 1.57

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow123612182430SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.18, N = 326.9927.1326.921. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack123691215SE +/- 0.01, N = 5SE +/- 0.01, N = 513.2013.221. (CXX) g++ options: -rdynamic

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123714212835SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 327.6927.8427.781. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1230.67891.35782.03672.71563.3945SE +/- 0.02534, N = 3SE +/- 0.01913, N = 3SE +/- 0.01010, N = 32.995593.017213.01645MIN: 2.82MIN: 2.84MIN: 2.851. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single1231122334455SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 349.0149.0049.021. (CXX) g++ options: -O3 -pthread

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1233691215SE +/- 0.04, N = 3SE +/- 0.15, N = 6SE +/- 0.12, N = 312.1512.2612.161. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 212348121620SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 315.7915.7915.761. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast123918273645SE +/- 0.28, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 339.6839.6739.391. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU123246810SE +/- 0.05437, N = 3SE +/- 0.08570, N = 3SE +/- 0.09143, N = 36.277156.250116.21623MIN: 5.63MIN: 5.72MIN: 5.671. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.47840.95681.43521.91362.392SE +/- 0.00488, N = 3SE +/- 0.00369, N = 3SE +/- 0.00796, N = 32.124082.108932.12611MIN: 2.04MIN: 2.03MIN: 2.051. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p123918273645SE +/- 0.23, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 341.0240.3641.061. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1233691215SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 39.549.499.451. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode123246810SE +/- 0.012, N = 5SE +/- 0.012, N = 5SE +/- 0.021, N = 57.7537.7957.7741. (CXX) g++ options: -fvisibility=hidden -logg -lm

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH123300K600K900K1200K1500KSE +/- 18400.26, N = 3SE +/- 19080.84, N = 4SE +/- 13973.47, N = 31368828.881340079.291366164.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET123400K800K1200K1600K2000KSE +/- 19958.53, N = 3SE +/- 23714.82, N = 4SE +/- 22941.42, N = 31675529.381654163.601577289.791. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes1233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 310.0410.0610.07

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast1231428425670SE +/- 0.21, N = 3SE +/- 0.20, N = 3SE +/- 0.26, N = 360.6960.6960.791. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.78991.57982.36973.15963.9495SE +/- 0.02421, N = 3SE +/- 0.04824, N = 3SE +/- 0.03734, N = 33.453703.490643.51087MIN: 1.93MIN: 2.01MIN: 21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 0123246810SE +/- 0.057, N = 3SE +/- 0.021, N = 3SE +/- 0.019, N = 37.5957.6267.6001. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1233691215SE +/- 0.36, N = 12SE +/- 0.30, N = 15SE +/- 0.15, N = 313.4012.9112.551. (CXX) g++ options: -O3 -pthread -lm

yquake2

Renderer: Software CPU - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 108012320406080100SE +/- 1.02, N = 3SE +/- 1.01, N = 3SE +/- 0.93, N = 3106.6107.1106.21. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium123246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.346.326.351. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 325.1625.1025.10MIN: 24.12MIN: 24.22MIN: 23.921. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 319.9720.1520.25MIN: 14.86MIN: 19.19MIN: 19.161. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1231.17452.3493.52354.6985.8725SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 35.225.225.201. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast123306090120150SE +/- 0.15, N = 3SE +/- 0.66, N = 3SE +/- 0.26, N = 3116.80116.09116.681. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

yquake2

Renderer: OpenGL 1.x - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 1920 x 1080123150300450600750SE +/- 4.44, N = 3SE +/- 6.33, N = 8SE +/- 5.54, N = 15683.8619.9623.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: No10.46170.92341.38511.84682.30852.052

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.78051.5612.34153.1223.9025SE +/- 0.00942, N = 3SE +/- 0.00155, N = 3SE +/- 0.02456, N = 33.437273.449803.46909MIN: 3.34MIN: 3.36MIN: 3.351. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1231.33622.67244.00865.34486.681SE +/- 0.01022, N = 3SE +/- 0.00211, N = 3SE +/- 0.00240, N = 35.926045.938565.92300MIN: 5.7MIN: 5.71MIN: 5.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

yquake2

Renderer: OpenGL 3.x - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 1920 x 10801232004006008001000SE +/- 12.14, N = 3SE +/- 13.22, N = 3SE +/- 10.37, N = 3949.7972.2972.51. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth1232K4K6K8K10KSE +/- 250.74, N = 3SE +/- 181.12, N = 3SE +/- 206.43, N = 310862.0410831.9710972.491. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth1230.13360.26720.40080.53440.668SE +/- 0.00203, N = 3SE +/- 0.01337, N = 3SE +/- 0.00691, N = 30.571200.583000.593851. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency1230.34580.69161.03741.38321.729SE +/- 0.00435, N = 3SE +/- 0.00474, N = 3SE +/- 0.00358, N = 31.536811.534531.535091. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access1230.00640.01280.01920.02560.032SE +/- 0.00038, N = 3SE +/- 0.00067, N = 3SE +/- 0.00018, N = 30.028190.027980.028661. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad1230.30110.60220.90331.20441.5055SE +/- 0.05676, N = 3SE +/- 0.03714, N = 3SE +/- 0.07867, N = 31.309661.307711.338201. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans1230.97671.95342.93013.90684.8835SE +/- 0.25728, N = 3SE +/- 0.03755, N = 3SE +/- 0.08946, N = 34.001644.341064.174281. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM1233691215SE +/- 0.86774, N = 3SE +/- 0.14154, N = 3SE +/- 0.34471, N = 312.617278.8030313.034171. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte1233691215SE +/- 0.04480, N = 3SE +/- 0.17781, N = 3SE +/- 0.14488, N = 310.046699.740839.676461. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3


Phoronix Test Suite v10.8.4