Core i7 4790K 202

Intel Core i7-4790K testing with a Gigabyte Z97-HD3P (F4 BIOS) and Gigabyte Intel Haswell Desktop 2GB on Ubuntu 19.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012207-HA-COREI747935&grs&sor.

Core i7 4790K 202ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123Intel Core i7-4790K @ 4.40GHz (4 Cores / 8 Threads)Gigabyte Z97-HD3P (F4 BIOS)Intel 4th Gen Core DRAM16GB120GB OCZ TRION100Gigabyte Intel Haswell Desktop 2GB (1250MHz)Intel Xeon E3-1200 v3/4thLG Ultra HDRealtek RTL8111/8168/8411Ubuntu 19.105.9.0-050900rc8daily20201009-generic (x86_64) 20201008GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.54.5 Mesa 19.2.81.1.102GCC 9.2.1 20191008ext43840x2160OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x28 - Thermald 1.9 Java Details- OpenJDK Runtime Environment (build 11.0.7+10-post-Ubuntu-2ubuntu219.10)Python Details- Python 2.7.17 + Python 3.7.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

Core i7 4790K 202onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUncnn: CPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v2-v2 - mobilenet-v2onednn: IP Shapes 3D - f32 - CPUwaifu2x-ncnn: 2x - 3 - Yesonednn: Deconvolution Batch shapes_3d - f32 - CPUhpcc: EP-DGEMMncnn: CPU - efficientnet-b0ncnn: Vulkan GPU - yolov4-tinyhpcc: Max Ping Pong Bandwidthonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUncnn: CPU - squeezenet_ssdsimdjson: DistinctUserIDncnn: CPU - blazefacencnn: Vulkan GPU - efficientnet-b0hpcc: G-Rand Accessrav1e: 10rav1e: 6ncnn: CPU - mnasnetbasis: UASTC Level 2 + RDO Post-Processingsimdjson: PartialTweetsncnn: CPU-v3-v3 - mobilenet-v3yquake2: Software CPU - 3840 x 2160onednn: Convolution Batch Shapes Auto - f32 - CPUncnn: CPU - mobilenetncnn: Vulkan GPU - mnasnetrav1e: 1compress-lz4: 9 - Compression Speedhpcc: G-HPLbasis: UASTC Level 0sunflow: Global Illumination + Image Synthesislibplacebo: hdr_peakdetectnode-web-tooling: ncnn: Vulkan GPU - googlenetncnn: CPU - yolov4-tinyyquake2: Software CPU - 2560 x 1440ncnn: CPU - googlenetbuild2: Time To Compilerav1e: 5ncnn: CPU - resnet50onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUsqlite-speedtest: Timed Time - Size 1,000ncnn: CPU - vgg16redis: SADDonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUembree: Pathtracer ISPC - Asian Dragoncoremark: CoreMark Size 666 - Iterations Per Secondncnn: Vulkan GPU - mobilenetncnn: CPU - resnet18ncnn: Vulkan GPU-v3-v3 - mobilenet-v3gromacs: Water Benchmarkncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - squeezenet_ssdmafft: Multiple Sequence Alignment - LSU RNAncnn: CPU - regnety_400monednn: Recurrent Neural Network Inference - f32 - CPUbasis: ETC1Sonednn: Recurrent Neural Network Training - f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUembree: Pathtracer ISPC - Crowncompress-lz4: 3 - Compression Speedyquake2: OpenGL 1.x - 2560 x 1440x265: Bosphorus 1080pstockfish: Total Timeonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUbrl-cad: VGR Performance Metriclibplacebo: polar_nocomputencnn: CPU - alexnetnumpy: embree: Pathtracer - Asian Dragononednn: IP Shapes 1D - u8s8f32 - CPUbuild-ffmpeg: Time To Compilencnn: Vulkan GPU - shufflenet-v2embree: Pathtracer ISPC - Asian Dragon Objyquake2: OpenGL 1.x - 1920 x 1080astcenc: Fastonednn: Recurrent Neural Network Training - u8s8f32 - CPUx265: Bosphorus 4Kembree: Pathtracer - Crownkvazaar: Bosphorus 1080p - Ultra Fastonednn: Recurrent Neural Network Inference - u8s8f32 - CPUhpcc: EP-STREAM Triadncnn: Vulkan GPU - vgg16indigobench: CPU - Bedroomonednn: Deconvolution Batch shapes_1d - f32 - CPUencode-wavpack: WAV To WavPackddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymapasmfish: 1024 Hash Memory, 26 Depthlibplacebo: deband_heavyembree: Pathtracer - Asian Dragon Objcompress-lz4: 1 - Compression Speedkvazaar: Bosphorus 4K - Mediumencode-ape: WAV To APEindigobench: CPU - Supercaryquake2: Software CPU - 1920 x 1080yquake2: OpenGL 3.x - 1920 x 1080compress-lz4: 1 - Decompression Speedkvazaar: Bosphorus 4K - Ultra Fastyquake2: OpenGL 3.x - 3840 x 2160ncnn: CPU - shufflenet-v2yquake2: OpenGL 1.x - 3840 x 2160basis: UASTC Level 2ddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - Multeasymapkvazaar: Bosphorus 1080p - Mediumbuild-eigen: Time To Compilekvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 1080p - Very Fastncnn: Vulkan GPU - resnet18ddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2phpbench: PHP Benchmark Suiteddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2kvazaar: Bosphorus 1080p - Slowcompress-lz4: 3 - Decompression Speedastcenc: Exhaustivehmmer: Pfam Database Searchyquake2: OpenGL 3.x - 2560 x 1440compress-lz4: 9 - Decompression Speedbasis: UASTC Level 3astcenc: Thoroughlibplacebo: av1_grain_lapastcenc: Mediumkvazaar: Bosphorus 4K - Slowsimdjson: LargeRandsimdjson: Kostyaclomp: Static OMP Speedupncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - blazefaceredis: SETredis: GETredis: LPUSHredis: LPOPespeak: Text-To-Speech Synthesisonednn: IP Shapes 3D - u8s8f32 - CPUlammps: Rhodopsin Proteinhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Ptranshpcc: G-Fftebetsy: ETC2 RGB - Highestbetsy: ETC1 - Highestwaifu2x-ncnn: 2x - 3 - No1236.865088.258.3514.25216.95016.540443.7908711.2846.1719281.57212.393933.220.772.5711.330.026502.5141.0746.83945.4750.736.9127.826.418531.476.800.25551.2890.3130810.5802.28048607.6011.0623.6344.0860.323.93252.4560.82850.696.1677169.413109.272112933.0226.071511.32646.5662159958.25751431.3525.787.050.41022.1433.7712.04516.394730.4073.3908495.759.145294741.155.388852.25130.429.9479901128489.114817513186.9122.24313.685.58065.21940124.8319.525.8902206.87.658480.056.554.750743.434741.204.03790109.450.75010.746013.794101.151224916118.955.20355671.512.2112.1881.68498.0200.96623.310.9862.89.4767.174.38729.589.7370.6616.0624.5725.7913.0371029345.469.466530.1568.11123.727125.96532.7145.88469.57711.8610.392.160.450.681.316.5751.242.581884493.332540423.881611833.092687331.8349.7334.259142.3183.393650.278790.493982.065266.7116.97524.7816.943589.039.1015.07416.53517.533442.5374711.8045.8019517.86012.603634.570.742.6411.710.026252.5951.1087.04949.4050.717.0928.026.922632.226.960.26151.2890.1368210.7222.29349578.6211.2723.5344.9059.724.27250.2640.82351.566.0641869.860110.792140255.0026.140011.28606.6290159234.48205931.7626.117.140.41322.4134.1811.92516.584693.2472.6198410.769.096584713.765.346952.40129.830.1679862878413.384823713223.2022.43313.695.61395.17862125.7939.455.9266208.27.708426.986.594.742343.684714.604.04885109.700.75410.721513.726100.651225497318.955.21015687.382.2212.2431.68797.6200.16643.610.9662.69.4567.274.24829.609.7570.7316.0724.5625.8113.0570996545.409.466527.5567.56123.806125.86537.8145.79469.61711.5110.392.160.450.681.117.4160.767.671808870.972339268.781622814.521682927.1651.4674.164332.2572.829670.327000.495181.956846.6686.28125.9408.223588.548.5915.42156.82416.634744.6802711.6044.1618703.02812.097433.460.742.5511.560.027132.5521.0796.84972.1340.737.1028.527.058831.646.810.25950.1192.1595710.4952.32949012.3211.2223.9744.4560.823.85254.6480.81451.486.0840270.595108.962104918.2625.743511.16416.5417161328.40394531.5325.787.060.40822.2834.1811.90416.454743.2973.2798446.759.187434758.635.339251.92131.029.8979187998482.734859913109.5522.25316.255.56965.18374125.6839.495.8849207.67.708475.516.564.722543.614729.754.06038109.110.75210.778413.724100.851230948619.045.18605697.272.2112.2171.69197.8200.56648.611.0062.79.4867.074.43729.539.7370.6016.0724.6025.7713.0370928245.449.456523.6568.09123.832125.96536.6145.87069.61711.7610.392.160.450.681.316.5952.102.611916548.162314175.991655968.921695087.3351.2135.296262.2583.213550.278470.479801.893106.4146.37425.933OpenBenchmarking.org

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time321714212835Min: 4.01 / Avg: 22.02 / Max: 31.07Min: 5.55 / Avg: 22.02 / Max: 28.23Min: 3.55 / Avg: 22.02 / Max: 28.051. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time312510152025Min: 4.46 / Avg: 9.94 / Max: 17Min: 4.45 / Avg: 9.95 / Max: 18.41Min: 4.4 / Avg: 9.98 / Max: 16.991. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time2131326395265Min: 5.76 / Avg: 33.96 / Max: 57.53Min: 6.24 / Avg: 33.97 / Max: 65.29Min: 5.77 / Avg: 34.02 / Max: 65.991. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123246810SE +/- 0.00679, N = 3SE +/- 0.09722, N = 3SE +/- 0.11002, N = 156.865086.943588.22358MIN: 6.69MIN: 6.67MIN: 7.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time

MinAvgMax271.676.684.9372.076.782.4172.376.885.1OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time204060801001. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21323691215SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 38.258.549.03MIN: 8.02 / MAX: 11.6MIN: 8.16 / MAX: 12.1MIN: 8.75 / MAX: 11.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21323691215SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 38.358.599.10MIN: 7.99 / MAX: 21.73MIN: 8.33 / MAX: 13.54MIN: 8.89 / MAX: 12.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU12348121620SE +/- 0.01, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 1514.2515.0715.42MIN: 14.13MIN: 14.72MIN: 14.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes231246810SE +/- 0.096, N = 4SE +/- 0.085, N = 15SE +/- 0.059, N = 126.5356.8246.950

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU13248121620SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.26, N = 316.5416.6317.53MIN: 16.27MIN: 16.28MIN: 16.551. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM3121020304050SE +/- 0.66, N = 3SE +/- 0.53, N = 3SE +/- 0.10, N = 344.6843.7942.541. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01323691215SE +/- 0.05, N = 3SE +/- 0.13, N = 3SE +/- 0.10, N = 311.2811.6011.80MIN: 10.83 / MAX: 19.59MIN: 11.37 / MAX: 26.59MIN: 11.52 / MAX: 12.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny3211020304050SE +/- 0.15, N = 3SE +/- 0.74, N = 3SE +/- 0.68, N = 344.1645.8046.17MIN: 43.39 / MAX: 57.44MIN: 43.83 / MAX: 58.33MIN: 44.33 / MAX: 611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth2134K8K12K16K20KSE +/- 639.68, N = 3SE +/- 146.25, N = 3SE +/- 289.88, N = 319517.8619281.5718703.031. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU3123691215SE +/- 0.06, N = 3SE +/- 0.20, N = 3SE +/- 0.18, N = 312.1012.3912.60MIN: 10.95MIN: 11.1MIN: 11.171. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd132816243240SE +/- 0.07, N = 3SE +/- 0.27, N = 3SE +/- 0.57, N = 333.2233.4634.57MIN: 32.95 / MAX: 34.34MIN: 32.94 / MAX: 43.71MIN: 33.72 / MAX: 36.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1320.17330.34660.51990.69320.8665SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 40.770.740.741. (CXX) g++ options: -O3 -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface3120.5941.1881.7822.3762.97SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 32.552.572.64MIN: 2.49 / MAX: 2.61MIN: 2.37 / MAX: 2.66MIN: 2.6 / MAX: 2.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b01323691215SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 311.3311.5611.71MIN: 11.18 / MAX: 14.87MIN: 11.28 / MAX: 25.04MIN: 11.47 / MAX: 14.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access3120.00610.01220.01830.02440.0305SE +/- 0.00050, N = 3SE +/- 0.00052, N = 3SE +/- 0.00077, N = 30.027130.026500.026251. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 102310.58391.16781.75172.33562.9195SE +/- 0.044, N = 3SE +/- 0.015, N = 3SE +/- 0.012, N = 32.5952.5522.514

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 62310.24930.49860.74790.99721.2465SE +/- 0.004, N = 3SE +/- 0.013, N = 3SE +/- 0.011, N = 31.1081.0791.074

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet132246810SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 36.836.847.04MIN: 6.61 / MAX: 7.1MIN: 6.59 / MAX: 10.33MIN: 6.85 / MAX: 9.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1232004006008001000SE +/- 1.28, N = 3SE +/- 2.85, N = 3SE +/- 12.55, N = 4945.48949.41972.131. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets3120.16430.32860.49290.65720.8215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 30.730.730.711. (CXX) g++ options: -O3 -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3123246810SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 36.917.097.10MIN: 6.69 / MAX: 12.06MIN: 6.9 / MAX: 10.15MIN: 6.88 / MAX: 9.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

yquake2

Renderer: Software CPU - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 3840 x 2160321714212835SE +/- 0.03, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 328.528.027.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123612182430SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 326.4226.9227.06MIN: 26.19MIN: 26.51MIN: 26.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet132714212835SE +/- 0.26, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 331.4731.6432.22MIN: 30.7 / MAX: 45.93MIN: 31.25 / MAX: 55.14MIN: 31.89 / MAX: 45.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet132246810SE +/- 0.06, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 36.806.816.96MIN: 6.62 / MAX: 10.18MIN: 6.53 / MAX: 7.12MIN: 6.68 / MAX: 21.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 12310.05870.11740.17610.23480.2935SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 30.2610.2590.255

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed2131224364860SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.85, N = 351.2851.2850.111. (CC) gcc options: -O3

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL31220406080100SE +/- 1.45, N = 3SE +/- 1.21, N = 4SE +/- 1.01, N = 692.1690.3190.141. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 03123691215SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 310.5010.5810.721. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Sunflow Rendering System

Global Illumination + Image Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1230.5241.0481.5722.0962.62SE +/- 0.012, N = 3SE +/- 0.019, N = 3SE +/- 0.024, N = 32.2802.2932.329MIN: 2.17 / MAX: 2.91MIN: 2.17 / MAX: 2.99MIN: 2.21 / MAX: 3.04

Libplacebo

Test: hdr_peakdetect

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: hdr_peakdetect23111K22K33K44K55KSE +/- 90.21, N = 3SE +/- 107.39, N = 3SE +/- 736.78, N = 349578.6249012.3248607.601. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark2313691215SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 311.2711.2211.061. Nodejs v10.15.2

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet213612182430SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 323.5323.6323.97MIN: 23.31 / MAX: 26.84MIN: 23.33 / MAX: 36.77MIN: 23.72 / MAX: 36.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1321020304050SE +/- 0.72, N = 3SE +/- 0.38, N = 3SE +/- 0.29, N = 344.0844.4544.90MIN: 42.69 / MAX: 58.93MIN: 43.34 / MAX: 46.59MIN: 44.01 / MAX: 52.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

yquake2

Renderer: Software CPU - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 2560 x 14403121428425670SE +/- 0.19, N = 3SE +/- 0.31, N = 3SE +/- 0.43, N = 360.860.359.71. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet312612182430SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.12, N = 323.8523.9324.27MIN: 23.58 / MAX: 37.59MIN: 23.66 / MAX: 24.5MIN: 23.92 / MAX: 271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile21360120180240300SE +/- 1.16, N = 3SE +/- 1.13, N = 3SE +/- 2.21, N = 3250.26252.46254.65

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 51230.18630.37260.55890.74520.9315SE +/- 0.005, N = 3SE +/- 0.013, N = 3SE +/- 0.006, N = 30.8280.8230.814

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501321224364860SE +/- 0.10, N = 3SE +/- 0.47, N = 3SE +/- 0.27, N = 350.6951.4851.56MIN: 50.36 / MAX: 63.24MIN: 50.22 / MAX: 66.07MIN: 50.54 / MAX: 64.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU231246810SE +/- 0.01547, N = 3SE +/- 0.06654, N = 3SE +/- 0.01016, N = 36.064186.084026.16771MIN: 5.69MIN: 5.81MIN: 5.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001231632486480SE +/- 0.25, N = 3SE +/- 0.12, N = 3SE +/- 0.54, N = 369.4169.8670.601. (CC) gcc options: -O2 -ldl -lz -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1631220406080100SE +/- 0.12, N = 3SE +/- 0.57, N = 3SE +/- 0.82, N = 3108.96109.27110.79MIN: 108.21 / MAX: 115.52MIN: 108.04 / MAX: 121.43MIN: 109.24 / MAX: 127.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD213500K1000K1500K2000K2500KSE +/- 8400.74, N = 3SE +/- 33704.56, N = 12SE +/- 30741.46, N = 132140255.002112933.022104918.261. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU312612182430SE +/- 0.09, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 325.7426.0726.14MIN: 25.45MIN: 25.78MIN: 25.951. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3213691215SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 311.1611.2911.33MIN: 10.46MIN: 10.69MIN: 10.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon213246810SE +/- 0.0431, N = 3SE +/- 0.0417, N = 3SE +/- 0.0287, N = 36.62906.56626.5417MIN: 6.51 / MAX: 6.81MIN: 6.46 / MAX: 6.79MIN: 6.45 / MAX: 6.74

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second31230K60K90K120K150KSE +/- 104.15, N = 3SE +/- 270.17, N = 3SE +/- 1169.64, N = 3161328.40159958.26159234.481. (CC) gcc options: -O2 -lrt" -lrt

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet132714212835SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.24, N = 331.3531.5331.76MIN: 31.01 / MAX: 43.41MIN: 31.22 / MAX: 33.07MIN: 31.13 / MAX: 48.021. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18132612182430SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.14, N = 325.7825.7826.11MIN: 25.38 / MAX: 39.91MIN: 25.3 / MAX: 26.66MIN: 25.75 / MAX: 41.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3132246810SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 37.057.067.14MIN: 6.84 / MAX: 8.55MIN: 6.87 / MAX: 10.89MIN: 6.95 / MAX: 21.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark2130.09290.18580.27870.37160.4645SE +/- 0.003, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 40.4130.4100.4081. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet132510152025SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.17, N = 322.1422.2822.41MIN: 21.81 / MAX: 24.68MIN: 21.96 / MAX: 22.64MIN: 21.96 / MAX: 34.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd123816243240SE +/- 0.38, N = 3SE +/- 0.27, N = 3SE +/- 0.14, N = 333.7734.1834.18MIN: 32.69 / MAX: 43.46MIN: 32.78 / MAX: 44.2MIN: 33.24 / MAX: 46.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA3213691215SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 311.9011.9312.051. (CC) gcc options: -std=c99 -O3 -lm -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m13248121620SE +/- 0.05, N = 3SE +/- 0.12, N = 3SE +/- 0.05, N = 316.3916.4516.58MIN: 16.23 / MAX: 18.48MIN: 16.16 / MAX: 17.78MIN: 16.23 / MAX: 31.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU21310002000300040005000SE +/- 15.09, N = 3SE +/- 0.95, N = 3SE +/- 3.10, N = 34693.244730.404743.29MIN: 4657.74MIN: 4721.36MIN: 4730.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S2311632486480SE +/- 0.68, N = 3SE +/- 0.38, N = 3SE +/- 0.54, N = 372.6273.2873.391. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU2312K4K6K8K10KSE +/- 20.81, N = 3SE +/- 15.35, N = 3SE +/- 17.63, N = 38410.768446.758495.75MIN: 8366.44MIN: 8413.69MIN: 8463.891. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU2133691215SE +/- 0.14569, N = 3SE +/- 0.03949, N = 3SE +/- 0.04734, N = 39.096589.145299.18743MIN: 8.69MIN: 8.74MIN: 8.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU21310002000300040005000SE +/- 16.47, N = 3SE +/- 15.27, N = 3SE +/- 24.18, N = 34713.764741.154758.63MIN: 4686.24MIN: 4705.91MIN: 4700.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown1231.21252.4253.63754.856.0625SE +/- 0.0366, N = 3SE +/- 0.0293, N = 3SE +/- 0.0582, N = 35.38885.34695.3392MIN: 5.28 / MAX: 5.51MIN: 5.26 / MAX: 5.47MIN: 5.19 / MAX: 5.51

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed2131224364860SE +/- 0.02, N = 3SE +/- 0.09, N = 3SE +/- 0.23, N = 352.4052.2551.921. (CC) gcc options: -O3

yquake2

Renderer: OpenGL 1.x - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 2560 x 1440312306090120150SE +/- 0.12, N = 3SE +/- 0.37, N = 3SE +/- 0.55, N = 3131.0130.4129.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p213714212835SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 330.1629.9429.891. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1232M4M6M8M10MSE +/- 42720.90, N = 3SE +/- 53934.69, N = 3SE +/- 100608.19, N = 37990112798628779187991. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU2312K4K6K8K10KSE +/- 13.64, N = 3SE +/- 20.61, N = 3SE +/- 11.29, N = 38413.388482.738489.11MIN: 8383.33MIN: 8441.98MIN: 8460.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric32110K20K30K40K50K4859948237481751. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Libplacebo

Test: polar_nocompute

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: polar_nocompute2133K6K9K12K15KSE +/- 37.92, N = 3SE +/- 29.01, N = 3SE +/- 44.87, N = 313223.2013186.9113109.551. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet132510152025SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 322.2422.2522.43MIN: 21.89 / MAX: 24.72MIN: 22 / MAX: 36.1MIN: 21.97 / MAX: 35.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark32170140210280350SE +/- 0.35, N = 3SE +/- 0.64, N = 3SE +/- 0.12, N = 3316.25313.69313.68

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon2131.26312.52623.78935.05246.3155SE +/- 0.0161, N = 3SE +/- 0.0040, N = 3SE +/- 0.0036, N = 35.61395.58065.5696MIN: 5.54 / MAX: 5.72MIN: 5.53 / MAX: 5.66MIN: 5.5 / MAX: 5.65

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU2311.17442.34883.52324.69765.872SE +/- 0.00946, N = 3SE +/- 0.01621, N = 3SE +/- 0.00525, N = 35.178625.183745.21940MIN: 4.94MIN: 4.84MIN: 4.851. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile132306090120150SE +/- 0.10, N = 3SE +/- 0.37, N = 3SE +/- 0.58, N = 3124.83125.68125.79

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v22313691215SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 39.459.499.52MIN: 9.35 / MAX: 12.21MIN: 9.39 / MAX: 11.92MIN: 9.42 / MAX: 11.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj2131.33352.6674.00055.3346.6675SE +/- 0.0196, N = 3SE +/- 0.0176, N = 3SE +/- 0.0272, N = 35.92665.89025.8849MIN: 5.87 / MAX: 6.03MIN: 5.83 / MAX: 6MIN: 5.81 / MAX: 5.99

yquake2

Renderer: OpenGL 1.x - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 1920 x 108023150100150200250SE +/- 0.55, N = 3SE +/- 0.32, N = 3SE +/- 0.42, N = 3208.2207.6206.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 37.657.707.701. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU2312K4K6K8K10KSE +/- 7.59, N = 3SE +/- 25.14, N = 3SE +/- 6.10, N = 38426.988475.518480.05MIN: 8406.64MIN: 8432.62MIN: 8460.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K231246810SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 36.596.566.551. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown1231.06892.13783.20674.27565.3445SE +/- 0.0091, N = 3SE +/- 0.0037, N = 3SE +/- 0.0074, N = 34.75074.74234.7225MIN: 4.71 / MAX: 4.83MIN: 4.71 / MAX: 4.82MIN: 4.69 / MAX: 4.82

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast2311020304050SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 343.6843.6143.431. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU23110002000300040005000SE +/- 5.23, N = 3SE +/- 19.23, N = 3SE +/- 22.26, N = 34714.604729.754741.20MIN: 4700.85MIN: 4687.53MIN: 4710.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad3210.91361.82722.74083.65444.568SE +/- 0.01163, N = 3SE +/- 0.00333, N = 3SE +/- 0.01197, N = 34.060384.048854.037901. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg1631220406080100SE +/- 0.21, N = 3SE +/- 0.26, N = 3SE +/- 0.32, N = 3109.11109.45109.70MIN: 108.06 / MAX: 125.36MIN: 108.54 / MAX: 123.94MIN: 108.72 / MAX: 136.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom2310.16970.33940.50910.67880.8485SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 30.7540.7520.750

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU2133691215SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 310.7210.7510.78MIN: 9.95MIN: 10.34MIN: 10.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack32148121620SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.04, N = 513.7213.7313.791. (CXX) g++ options: -rdynamic

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap13220406080100SE +/- 0.26, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 3101.15100.85100.65MIN: 54.32 / MAX: 224.92MIN: 37.89 / MAX: 224.47MIN: 58.03 / MAX: 227.271. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth3213M6M9M12M15MSE +/- 130255.47, N = 3SE +/- 117949.09, N = 3SE +/- 93407.88, N = 3123094861225497312249161

Libplacebo

Test: deband_heavy

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: deband_heavy321510152025SE +/- 0.09, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 319.0418.9518.951. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj2131.17232.34463.51694.68925.8615SE +/- 0.0125, N = 3SE +/- 0.0107, N = 3SE +/- 0.0051, N = 35.21015.20355.1860MIN: 5.17 / MAX: 5.28MIN: 5.16 / MAX: 5.26MIN: 5.15 / MAX: 5.26

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed32112002400360048006000SE +/- 2.69, N = 3SE +/- 5.97, N = 3SE +/- 7.62, N = 35697.275687.385671.511. (CC) gcc options: -O3

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium2310.49950.9991.49851.9982.4975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.222.212.211. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1323691215SE +/- 0.02, N = 5SE +/- 0.06, N = 5SE +/- 0.04, N = 512.1912.2212.241. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar3210.38050.7611.14151.5221.9025SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.005, N = 31.6911.6871.684

yquake2

Renderer: Software CPU - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 108013220406080100SE +/- 0.87, N = 3SE +/- 0.38, N = 3SE +/- 0.98, N = 398.097.897.61. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: OpenGL 3.x - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 1920 x 10801324080120160200SE +/- 0.53, N = 3SE +/- 0.95, N = 3SE +/- 0.23, N = 3200.9200.5200.11. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed32114002800420056007000SE +/- 1.39, N = 3SE +/- 4.53, N = 3SE +/- 10.10, N = 36648.66643.66623.31. (CC) gcc options: -O3

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast3123691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.0010.9810.961. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

yquake2

Renderer: OpenGL 3.x - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 3840 x 21601321428425670SE +/- 0.12, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 362.862.762.61. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v22133691215SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 39.459.479.48MIN: 9.32 / MAX: 12.68MIN: 9.36 / MAX: 12.12MIN: 9.35 / MAX: 13.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

yquake2

Renderer: OpenGL 1.x - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 3840 x 21602131530456075SE +/- 0.22, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 367.267.167.01. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 221320406080100SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.22, N = 374.2574.3974.441. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap213714212835SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 329.6029.5829.53MIN: 14.76 / MAX: 161.92MIN: 14.88 / MAX: 175.93MIN: 15.15 / MAX: 181.721. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium2313691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 39.759.739.731. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile3121632486480SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.20, N = 370.6070.6670.73

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast321246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 36.076.076.061. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast312612182430SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 324.6024.5724.561. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18312612182430SE +/- 0.32, N = 3SE +/- 0.29, N = 3SE +/- 0.19, N = 325.7725.7925.81MIN: 25 / MAX: 26.95MIN: 24.99 / MAX: 28.18MIN: 25.13 / MAX: 40.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore22313691215SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 313.0513.0313.03MIN: 10.32 / MAX: 14.01MIN: 11.77 / MAX: 14.01MIN: 11.75 / MAX: 13.981. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite123150K300K450K600K750KSE +/- 448.34, N = 3SE +/- 2148.66, N = 3SE +/- 709.37, N = 3710293709965709282

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore21321020304050SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 345.4645.4445.40MIN: 25.16 / MAX: 257.86MIN: 25.99 / MAX: 180.9MIN: 33.02 / MAX: 50.631. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow2133691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 39.469.469.451. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed12314002800420056007000SE +/- 1.59, N = 3SE +/- 1.58, N = 3SE +/- 1.37, N = 36530.16527.56523.61. (CC) gcc options: -O3

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive231120240360480600SE +/- 0.07, N = 3SE +/- 0.25, N = 3SE +/- 0.10, N = 3567.56568.09568.111. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search123306090120150SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3123.73123.81123.831. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

yquake2

Renderer: OpenGL 3.x - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 2560 x 1440312306090120150SE +/- 0.17, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 3125.9125.9125.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed23114002800420056007000SE +/- 5.32, N = 3SE +/- 1.35, N = 3SE +/- 3.54, N = 36537.86536.66532.71. (CC) gcc options: -O3

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3231306090120150SE +/- 0.28, N = 3SE +/- 0.06, N = 3SE +/- 0.14, N = 3145.79145.87145.881. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1231530456075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 369.5769.6169.611. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Libplacebo

Test: av1_grain_lap

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: av1_grain_lap132150300450600750SE +/- 0.24, N = 3SE +/- 0.22, N = 3SE +/- 0.36, N = 3711.86711.76711.511. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 310.3910.3910.391. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow3210.4860.9721.4581.9442.43SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.162.162.161. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom3210.10130.20260.30390.40520.5065SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.450.450.451. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya3210.1530.3060.4590.6120.765SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.680.680.681. (CXX) g++ options: -O3 -pthread

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup3120.29250.5850.87751.171.4625SE +/- 0.03, N = 12SE +/- 0.05, N = 12SE +/- 0.05, N = 121.31.31.11. (CC) gcc options: -fopenmp -O3 -lm

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m13248121620SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.68, N = 316.5716.5917.41MIN: 16.28 / MAX: 16.95MIN: 16.38 / MAX: 20.04MIN: 16.52 / MAX: 269.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501321428425670SE +/- 0.54, N = 3SE +/- 0.55, N = 3SE +/- 8.74, N = 351.2452.1060.76MIN: 50.5 / MAX: 66.36MIN: 50.52 / MAX: 64.77MIN: 51.25 / MAX: 1056.171. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface132246810SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 5.13, N = 32.582.617.67MIN: 2.47 / MAX: 2.74MIN: 2.53 / MAX: 2.69MIN: 2.41 / MAX: 416.671. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET312400K800K1200K1600K2000KSE +/- 25199.46, N = 3SE +/- 29746.00, N = 3SE +/- 49673.53, N = 151916548.161884493.331808870.971. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123500K1000K1500K2000K2500KSE +/- 21911.47, N = 13SE +/- 23853.74, N = 8SE +/- 48510.25, N = 122540423.882339268.782314175.991. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH321400K800K1200K1600K2000KSE +/- 5741.58, N = 3SE +/- 20312.72, N = 15SE +/- 35412.98, N = 121655968.921622814.521611833.091. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP132600K1200K1800K2400K3000KSE +/- 37569.90, N = 3SE +/- 2760.10, N = 3SE +/- 86194.23, N = 132687331.831695087.331682927.161. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1321224364860SE +/- 1.23, N = 16SE +/- 0.37, N = 4SE +/- 0.86, N = 1749.7351.2151.471. (CC) gcc options: -O2 -std=c99

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU2131.19172.38343.57514.76685.9585SE +/- 0.00978, N = 3SE +/- 0.00331, N = 3SE +/- 0.09775, N = 154.164334.259145.29626MIN: 4.05MIN: 4.15MIN: 4.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1320.52161.04321.56482.08642.608SE +/- 0.064, N = 12SE +/- 0.031, N = 15SE +/- 0.052, N = 142.3182.2582.2571. (CXX) g++ options: -O3 -pthread -lm

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth1320.76361.52722.29083.05443.818SE +/- 0.09620, N = 3SE +/- 0.24447, N = 3SE +/- 0.10194, N = 33.393653.213552.829671. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency3120.07360.14720.22080.29440.368SE +/- 0.04588, N = 3SE +/- 0.04635, N = 3SE +/- 0.00582, N = 30.278470.278790.327001. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans2130.11140.22280.33420.44560.557SE +/- 0.03821, N = 3SE +/- 0.01258, N = 3SE +/- 0.01039, N = 30.495180.493980.479801. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte1230.46470.92941.39411.85882.3235SE +/- 0.08108, N = 3SE +/- 0.13431, N = 3SE +/- 0.07276, N = 32.065261.956841.893101. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

Betsy GPU Compressor

Codec: ETC2 RGB - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highest321246810SE +/- 0.127, N = 12SE +/- 0.117, N = 15SE +/- 0.431, N = 156.4146.6686.7111. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Betsy GPU Compressor

Codec: ETC1 - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highest231246810SE +/- 0.134, N = 12SE +/- 0.143, N = 12SE +/- 0.422, N = 156.2816.3746.9751. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: No132612182430SE +/- 1.00, N = 12SE +/- 0.06, N = 3SE +/- 0.11, N = 324.7825.9325.94


Phoronix Test Suite v10.8.5