Core i7 4790K 202

Intel Core i7-4790K testing with a Gigabyte Z97-HD3P (F4 BIOS) and Gigabyte Intel Haswell Desktop 2GB on Ubuntu 19.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012207-HA-COREI747935&grs.

Core i7 4790K 202ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123Intel Core i7-4790K @ 4.40GHz (4 Cores / 8 Threads)Gigabyte Z97-HD3P (F4 BIOS)Intel 4th Gen Core DRAM16GB120GB OCZ TRION100Gigabyte Intel Haswell Desktop 2GB (1250MHz)Intel Xeon E3-1200 v3/4thLG Ultra HDRealtek RTL8111/8168/8411Ubuntu 19.105.9.0-050900rc8daily20201009-generic (x86_64) 20201008GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.54.5 Mesa 19.2.81.1.102GCC 9.2.1 20191008ext43840x2160OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x28 - Thermald 1.9 Java Details- OpenJDK Runtime Environment (build 11.0.7+10-post-Ubuntu-2ubuntu219.10)Python Details- Python 2.7.17 + Python 3.7.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

Core i7 4790K 202onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUncnn: CPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU-v2-v2 - mobilenet-v2onednn: IP Shapes 3D - f32 - CPUwaifu2x-ncnn: 2x - 3 - Yesonednn: Deconvolution Batch shapes_3d - f32 - CPUhpcc: EP-DGEMMncnn: CPU - efficientnet-b0ncnn: Vulkan GPU - yolov4-tinyhpcc: Max Ping Pong Bandwidthonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUncnn: CPU - squeezenet_ssdsimdjson: DistinctUserIDncnn: CPU - blazefacencnn: Vulkan GPU - efficientnet-b0hpcc: G-Rand Accessrav1e: 10rav1e: 6ncnn: CPU - mnasnetbasis: UASTC Level 2 + RDO Post-Processingsimdjson: PartialTweetsncnn: CPU-v3-v3 - mobilenet-v3yquake2: Software CPU - 3840 x 2160onednn: Convolution Batch Shapes Auto - f32 - CPUncnn: CPU - mobilenetncnn: Vulkan GPU - mnasnetrav1e: 1compress-lz4: 9 - Compression Speedhpcc: G-HPLbasis: UASTC Level 0sunflow: Global Illumination + Image Synthesislibplacebo: hdr_peakdetectnode-web-tooling: ncnn: Vulkan GPU - googlenetncnn: CPU - yolov4-tinyyquake2: Software CPU - 2560 x 1440ncnn: CPU - googlenetbuild2: Time To Compilerav1e: 5ncnn: CPU - resnet50onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUsqlite-speedtest: Timed Time - Size 1,000ncnn: CPU - vgg16redis: SADDonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUembree: Pathtracer ISPC - Asian Dragoncoremark: CoreMark Size 666 - Iterations Per Secondncnn: Vulkan GPU - mobilenetncnn: CPU - resnet18ncnn: Vulkan GPU-v3-v3 - mobilenet-v3gromacs: Water Benchmarkncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - squeezenet_ssdmafft: Multiple Sequence Alignment - LSU RNAncnn: CPU - regnety_400monednn: Recurrent Neural Network Inference - f32 - CPUbasis: ETC1Sonednn: Recurrent Neural Network Training - f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUembree: Pathtracer ISPC - Crowncompress-lz4: 3 - Compression Speedyquake2: OpenGL 1.x - 2560 x 1440x265: Bosphorus 1080pstockfish: Total Timeonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUbrl-cad: VGR Performance Metriclibplacebo: polar_nocomputencnn: CPU - alexnetnumpy: embree: Pathtracer - Asian Dragononednn: IP Shapes 1D - u8s8f32 - CPUbuild-ffmpeg: Time To Compilencnn: Vulkan GPU - shufflenet-v2embree: Pathtracer ISPC - Asian Dragon Objyquake2: OpenGL 1.x - 1920 x 1080astcenc: Fastonednn: Recurrent Neural Network Training - u8s8f32 - CPUx265: Bosphorus 4Kembree: Pathtracer - Crownkvazaar: Bosphorus 1080p - Ultra Fastonednn: Recurrent Neural Network Inference - u8s8f32 - CPUhpcc: EP-STREAM Triadncnn: Vulkan GPU - vgg16indigobench: CPU - Bedroomonednn: Deconvolution Batch shapes_1d - f32 - CPUencode-wavpack: WAV To WavPackddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymapasmfish: 1024 Hash Memory, 26 Depthlibplacebo: deband_heavyembree: Pathtracer - Asian Dragon Objcompress-lz4: 1 - Compression Speedkvazaar: Bosphorus 4K - Mediumencode-ape: WAV To APEindigobench: CPU - Supercaryquake2: Software CPU - 1920 x 1080yquake2: OpenGL 3.x - 1920 x 1080compress-lz4: 1 - Decompression Speedkvazaar: Bosphorus 4K - Ultra Fastyquake2: OpenGL 3.x - 3840 x 2160ncnn: CPU - shufflenet-v2yquake2: OpenGL 1.x - 3840 x 2160basis: UASTC Level 2ddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - Multeasymapkvazaar: Bosphorus 1080p - Mediumbuild-eigen: Time To Compilekvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 1080p - Very Fastncnn: Vulkan GPU - resnet18ddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2phpbench: PHP Benchmark Suiteddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2kvazaar: Bosphorus 1080p - Slowcompress-lz4: 3 - Decompression Speedastcenc: Exhaustivehmmer: Pfam Database Searchyquake2: OpenGL 3.x - 2560 x 1440compress-lz4: 9 - Decompression Speedbasis: UASTC Level 3astcenc: Thoroughlibplacebo: av1_grain_lapastcenc: Mediumkvazaar: Bosphorus 4K - Slowsimdjson: LargeRandsimdjson: Kostyaclomp: Static OMP Speedupncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - blazefaceredis: SETredis: GETredis: LPUSHredis: LPOPespeak: Text-To-Speech Synthesisonednn: IP Shapes 3D - u8s8f32 - CPUlammps: Rhodopsin Proteinhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Ptranshpcc: G-Fftebetsy: ETC2 RGB - Highestbetsy: ETC1 - Highestwaifu2x-ncnn: 2x - 3 - No1236.865088.258.3514.25216.95016.540443.7908711.2846.1719281.57212.393933.220.772.5711.330.026502.5141.0746.83945.4750.736.9127.826.418531.476.800.25551.2890.3130810.5802.28048607.6011.0623.6344.0860.323.93252.4560.82850.696.1677169.413109.272112933.0226.071511.32646.5662159958.25751431.3525.787.050.41022.1433.7712.04516.394730.4073.3908495.759.145294741.155.388852.25130.429.9479901128489.114817513186.9122.24313.685.58065.21940124.8319.525.8902206.87.658480.056.554.750743.434741.204.03790109.450.75010.746013.794101.151224916118.955.20355671.512.2112.1881.68498.0200.96623.310.9862.89.4767.174.38729.589.7370.6616.0624.5725.7913.0371029345.469.466530.1568.11123.727125.96532.7145.88469.57711.8610.392.160.450.681.316.5751.242.581884493.332540423.881611833.092687331.8349.7334.259142.3183.393650.278790.493982.065266.7116.97524.7816.943589.039.1015.07416.53517.533442.5374711.8045.8019517.86012.603634.570.742.6411.710.026252.5951.1087.04949.4050.717.0928.026.922632.226.960.26151.2890.1368210.7222.29349578.6211.2723.5344.9059.724.27250.2640.82351.566.0641869.860110.792140255.0026.140011.28606.6290159234.48205931.7626.117.140.41322.4134.1811.92516.584693.2472.6198410.769.096584713.765.346952.40129.830.1679862878413.384823713223.2022.43313.695.61395.17862125.7939.455.9266208.27.708426.986.594.742343.684714.604.04885109.700.75410.721513.726100.651225497318.955.21015687.382.2212.2431.68797.6200.16643.610.9662.69.4567.274.24829.609.7570.7316.0724.5625.8113.0570996545.409.466527.5567.56123.806125.86537.8145.79469.61711.5110.392.160.450.681.117.4160.767.671808870.972339268.781622814.521682927.1651.4674.164332.2572.829670.327000.495181.956846.6686.28125.9408.223588.548.5915.42156.82416.634744.6802711.6044.1618703.02812.097433.460.742.5511.560.027132.5521.0796.84972.1340.737.1028.527.058831.646.810.25950.1192.1595710.4952.32949012.3211.2223.9744.4560.823.85254.6480.81451.486.0840270.595108.962104918.2625.743511.16416.5417161328.40394531.5325.787.060.40822.2834.1811.90416.454743.2973.2798446.759.187434758.635.339251.92131.029.8979187998482.734859913109.5522.25316.255.56965.18374125.6839.495.8849207.67.708475.516.564.722543.614729.754.06038109.110.75210.778413.724100.851230948619.045.18605697.272.2112.2171.69197.8200.56648.611.0062.79.4867.074.43729.539.7370.6016.0724.6025.7713.0370928245.449.456523.6568.09123.832125.96536.6145.87069.61711.7610.392.160.450.681.316.5952.102.611916548.162314175.991655968.921695087.3351.2135.296262.2583.213550.278470.479801.893106.4146.37425.933OpenBenchmarking.org

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time123714212835Min: 3.55 / Avg: 22.02 / Max: 28.05Min: 5.55 / Avg: 22.02 / Max: 28.23Min: 4.01 / Avg: 22.02 / Max: 31.071. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time123510152025Min: 4.45 / Avg: 9.95 / Max: 18.41Min: 4.4 / Avg: 9.98 / Max: 16.99Min: 4.46 / Avg: 9.94 / Max: 171. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time1231326395265Min: 6.24 / Avg: 33.97 / Max: 65.29Min: 5.76 / Avg: 33.96 / Max: 57.53Min: 5.77 / Avg: 34.02 / Max: 65.991. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123246810SE +/- 0.00679, N = 3SE +/- 0.09722, N = 3SE +/- 0.11002, N = 156.865086.943588.22358MIN: 6.69MIN: 6.67MIN: 7.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time

MinAvgMax172.376.885.1271.676.684.9372.076.782.4OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time204060801001. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.10, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 38.259.038.54MIN: 8.02 / MAX: 11.6MIN: 8.75 / MAX: 11.63MIN: 8.16 / MAX: 12.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21233691215SE +/- 0.13, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 38.359.108.59MIN: 7.99 / MAX: 21.73MIN: 8.89 / MAX: 12.03MIN: 8.33 / MAX: 13.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU12348121620SE +/- 0.01, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 1514.2515.0715.42MIN: 14.13MIN: 14.72MIN: 14.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: Yes

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes123246810SE +/- 0.059, N = 12SE +/- 0.096, N = 4SE +/- 0.085, N = 156.9506.5356.824

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU12348121620SE +/- 0.05, N = 3SE +/- 0.26, N = 3SE +/- 0.12, N = 316.5417.5316.63MIN: 16.27MIN: 16.55MIN: 16.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM1231020304050SE +/- 0.53, N = 3SE +/- 0.10, N = 3SE +/- 0.66, N = 343.7942.5444.681. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01233691215SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.13, N = 311.2811.8011.60MIN: 10.83 / MAX: 19.59MIN: 11.52 / MAX: 12.14MIN: 11.37 / MAX: 26.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1231020304050SE +/- 0.68, N = 3SE +/- 0.74, N = 3SE +/- 0.15, N = 346.1745.8044.16MIN: 44.33 / MAX: 61MIN: 43.83 / MAX: 58.33MIN: 43.39 / MAX: 57.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth1234K8K12K16K20KSE +/- 146.25, N = 3SE +/- 639.68, N = 3SE +/- 289.88, N = 319281.5719517.8618703.031. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1233691215SE +/- 0.20, N = 3SE +/- 0.18, N = 3SE +/- 0.06, N = 312.3912.6012.10MIN: 11.1MIN: 11.17MIN: 10.951. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd123816243240SE +/- 0.07, N = 3SE +/- 0.57, N = 3SE +/- 0.27, N = 333.2234.5733.46MIN: 32.95 / MAX: 34.34MIN: 33.72 / MAX: 36.89MIN: 32.94 / MAX: 43.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.17330.34660.51990.69320.8665SE +/- 0.01, N = 3SE +/- 0.01, N = 4SE +/- 0.01, N = 30.770.740.741. (CXX) g++ options: -O3 -pthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1230.5941.1881.7822.3762.97SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 32.572.642.55MIN: 2.37 / MAX: 2.66MIN: 2.6 / MAX: 2.69MIN: 2.49 / MAX: 2.611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b01233691215SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 311.3311.7111.56MIN: 11.18 / MAX: 14.87MIN: 11.47 / MAX: 14.37MIN: 11.28 / MAX: 25.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access1230.00610.01220.01830.02440.0305SE +/- 0.00052, N = 3SE +/- 0.00077, N = 3SE +/- 0.00050, N = 30.026500.026250.027131. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 101230.58391.16781.75172.33562.9195SE +/- 0.012, N = 3SE +/- 0.044, N = 3SE +/- 0.015, N = 32.5142.5952.552

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 61230.24930.49860.74790.99721.2465SE +/- 0.011, N = 3SE +/- 0.004, N = 3SE +/- 0.013, N = 31.0741.1081.079

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet123246810SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 36.837.046.84MIN: 6.61 / MAX: 7.1MIN: 6.85 / MAX: 9.1MIN: 6.59 / MAX: 10.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1232004006008001000SE +/- 1.28, N = 3SE +/- 2.85, N = 3SE +/- 12.55, N = 4945.48949.41972.131. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.16430.32860.49290.65720.8215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 30.730.710.731. (CXX) g++ options: -O3 -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3123246810SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 36.917.097.10MIN: 6.69 / MAX: 12.06MIN: 6.9 / MAX: 10.15MIN: 6.88 / MAX: 9.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

yquake2

Renderer: Software CPU - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 3840 x 2160123714212835SE +/- 0.09, N = 3SE +/- 0.24, N = 3SE +/- 0.03, N = 327.828.028.51. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123612182430SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 326.4226.9227.06MIN: 26.19MIN: 26.51MIN: 26.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet123714212835SE +/- 0.26, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 331.4732.2231.64MIN: 30.7 / MAX: 45.93MIN: 31.89 / MAX: 45.17MIN: 31.25 / MAX: 55.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet123246810SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.10, N = 36.806.966.81MIN: 6.62 / MAX: 10.18MIN: 6.68 / MAX: 21.89MIN: 6.53 / MAX: 7.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 11230.05870.11740.17610.23480.2935SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 30.2550.2610.259

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1231224364860SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.85, N = 351.2851.2850.111. (CC) gcc options: -O3

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL12320406080100SE +/- 1.21, N = 4SE +/- 1.01, N = 6SE +/- 1.45, N = 390.3190.1492.161. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01233691215SE +/- 0.09, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 310.5810.7210.501. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Sunflow Rendering System

Global Illumination + Image Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1230.5241.0481.5722.0962.62SE +/- 0.012, N = 3SE +/- 0.019, N = 3SE +/- 0.024, N = 32.2802.2932.329MIN: 2.17 / MAX: 2.91MIN: 2.17 / MAX: 2.99MIN: 2.21 / MAX: 3.04

Libplacebo

Test: hdr_peakdetect

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: hdr_peakdetect12311K22K33K44K55KSE +/- 736.78, N = 3SE +/- 90.21, N = 3SE +/- 107.39, N = 348607.6049578.6249012.321. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1233691215SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 311.0611.2711.221. Nodejs v10.15.2

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet123612182430SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 323.6323.5323.97MIN: 23.33 / MAX: 36.77MIN: 23.31 / MAX: 26.84MIN: 23.72 / MAX: 36.581. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1231020304050SE +/- 0.72, N = 3SE +/- 0.29, N = 3SE +/- 0.38, N = 344.0844.9044.45MIN: 42.69 / MAX: 58.93MIN: 44.01 / MAX: 52.5MIN: 43.34 / MAX: 46.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

yquake2

Renderer: Software CPU - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 2560 x 14401231428425670SE +/- 0.31, N = 3SE +/- 0.43, N = 3SE +/- 0.19, N = 360.359.760.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet123612182430SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 323.9324.2723.85MIN: 23.66 / MAX: 24.5MIN: 23.92 / MAX: 27MIN: 23.58 / MAX: 37.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12360120180240300SE +/- 1.13, N = 3SE +/- 1.16, N = 3SE +/- 2.21, N = 3252.46250.26254.65

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 51230.18630.37260.55890.74520.9315SE +/- 0.005, N = 3SE +/- 0.013, N = 3SE +/- 0.006, N = 30.8280.8230.814

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501231224364860SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 0.47, N = 350.6951.5651.48MIN: 50.36 / MAX: 63.24MIN: 50.54 / MAX: 64.38MIN: 50.22 / MAX: 66.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.01016, N = 3SE +/- 0.01547, N = 3SE +/- 0.06654, N = 36.167716.064186.08402MIN: 5.88MIN: 5.69MIN: 5.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001231632486480SE +/- 0.25, N = 3SE +/- 0.12, N = 3SE +/- 0.54, N = 369.4169.8670.601. (CC) gcc options: -O2 -ldl -lz -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1612320406080100SE +/- 0.57, N = 3SE +/- 0.82, N = 3SE +/- 0.12, N = 3109.27110.79108.96MIN: 108.04 / MAX: 121.43MIN: 109.24 / MAX: 127.6MIN: 108.21 / MAX: 115.521. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD123500K1000K1500K2000K2500KSE +/- 33704.56, N = 12SE +/- 8400.74, N = 3SE +/- 30741.46, N = 132112933.022140255.002104918.261. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 326.0726.1425.74MIN: 25.78MIN: 25.95MIN: 25.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1233691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 311.3311.2911.16MIN: 10.76MIN: 10.69MIN: 10.461. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon123246810SE +/- 0.0417, N = 3SE +/- 0.0431, N = 3SE +/- 0.0287, N = 36.56626.62906.5417MIN: 6.46 / MAX: 6.79MIN: 6.51 / MAX: 6.81MIN: 6.45 / MAX: 6.74

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12330K60K90K120K150KSE +/- 270.17, N = 3SE +/- 1169.64, N = 3SE +/- 104.15, N = 3159958.26159234.48161328.401. (CC) gcc options: -O2 -lrt" -lrt

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet123714212835SE +/- 0.06, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 331.3531.7631.53MIN: 31.01 / MAX: 43.41MIN: 31.13 / MAX: 48.02MIN: 31.22 / MAX: 33.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18123612182430SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.08, N = 325.7826.1125.78MIN: 25.38 / MAX: 39.91MIN: 25.75 / MAX: 41.28MIN: 25.3 / MAX: 26.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3123246810SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 37.057.147.06MIN: 6.84 / MAX: 8.55MIN: 6.95 / MAX: 21.34MIN: 6.87 / MAX: 10.891. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.09290.18580.27870.37160.4645SE +/- 0.005, N = 3SE +/- 0.003, N = 3SE +/- 0.006, N = 40.4100.4130.4081. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet123510152025SE +/- 0.13, N = 3SE +/- 0.17, N = 3SE +/- 0.10, N = 322.1422.4122.28MIN: 21.81 / MAX: 24.68MIN: 21.96 / MAX: 34.32MIN: 21.96 / MAX: 22.641. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd123816243240SE +/- 0.38, N = 3SE +/- 0.27, N = 3SE +/- 0.14, N = 333.7734.1834.18MIN: 32.69 / MAX: 43.46MIN: 32.78 / MAX: 44.2MIN: 33.24 / MAX: 46.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1233691215SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 312.0511.9311.901. (CC) gcc options: -std=c99 -O3 -lm -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m12348121620SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.12, N = 316.3916.5816.45MIN: 16.23 / MAX: 18.48MIN: 16.23 / MAX: 31.03MIN: 16.16 / MAX: 17.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU12310002000300040005000SE +/- 0.95, N = 3SE +/- 15.09, N = 3SE +/- 3.10, N = 34730.404693.244743.29MIN: 4721.36MIN: 4657.74MIN: 4730.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1231632486480SE +/- 0.54, N = 3SE +/- 0.68, N = 3SE +/- 0.38, N = 373.3972.6273.281. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1232K4K6K8K10KSE +/- 17.63, N = 3SE +/- 20.81, N = 3SE +/- 15.35, N = 38495.758410.768446.75MIN: 8463.89MIN: 8366.44MIN: 8413.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.03949, N = 3SE +/- 0.14569, N = 3SE +/- 0.04734, N = 39.145299.096589.18743MIN: 8.74MIN: 8.69MIN: 8.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12310002000300040005000SE +/- 15.27, N = 3SE +/- 16.47, N = 3SE +/- 24.18, N = 34741.154713.764758.63MIN: 4705.91MIN: 4686.24MIN: 4700.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown1231.21252.4253.63754.856.0625SE +/- 0.0366, N = 3SE +/- 0.0293, N = 3SE +/- 0.0582, N = 35.38885.34695.3392MIN: 5.28 / MAX: 5.51MIN: 5.26 / MAX: 5.47MIN: 5.19 / MAX: 5.51

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1231224364860SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.23, N = 352.2552.4051.921. (CC) gcc options: -O3

yquake2

Renderer: OpenGL 1.x - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 2560 x 1440123306090120150SE +/- 0.37, N = 3SE +/- 0.55, N = 3SE +/- 0.12, N = 3130.4129.8131.01. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p123714212835SE +/- 0.08, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 329.9430.1629.891. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1232M4M6M8M10MSE +/- 42720.90, N = 3SE +/- 53934.69, N = 3SE +/- 100608.19, N = 37990112798628779187991. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1232K4K6K8K10KSE +/- 11.29, N = 3SE +/- 13.64, N = 3SE +/- 20.61, N = 38489.118413.388482.73MIN: 8460.3MIN: 8383.33MIN: 8441.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric12310K20K30K40K50K4817548237485991. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Libplacebo

Test: polar_nocompute

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: polar_nocompute1233K6K9K12K15KSE +/- 29.01, N = 3SE +/- 37.92, N = 3SE +/- 44.87, N = 313186.9113223.2013109.551. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet123510152025SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 322.2422.4322.25MIN: 21.89 / MAX: 24.72MIN: 21.97 / MAX: 35.74MIN: 22 / MAX: 36.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12370140210280350SE +/- 0.12, N = 3SE +/- 0.64, N = 3SE +/- 0.35, N = 3313.68313.69316.25

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon1231.26312.52623.78935.05246.3155SE +/- 0.0040, N = 3SE +/- 0.0161, N = 3SE +/- 0.0036, N = 35.58065.61395.5696MIN: 5.53 / MAX: 5.66MIN: 5.54 / MAX: 5.72MIN: 5.5 / MAX: 5.65

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1231.17442.34883.52324.69765.872SE +/- 0.00525, N = 3SE +/- 0.00946, N = 3SE +/- 0.01621, N = 35.219405.178625.18374MIN: 4.85MIN: 4.94MIN: 4.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile123306090120150SE +/- 0.10, N = 3SE +/- 0.58, N = 3SE +/- 0.37, N = 3124.83125.79125.68

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v21233691215SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 39.529.459.49MIN: 9.42 / MAX: 11.19MIN: 9.35 / MAX: 12.21MIN: 9.39 / MAX: 11.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj1231.33352.6674.00055.3346.6675SE +/- 0.0176, N = 3SE +/- 0.0196, N = 3SE +/- 0.0272, N = 35.89025.92665.8849MIN: 5.83 / MAX: 6MIN: 5.87 / MAX: 6.03MIN: 5.81 / MAX: 5.99

yquake2

Renderer: OpenGL 1.x - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 1920 x 108012350100150200250SE +/- 0.42, N = 3SE +/- 0.55, N = 3SE +/- 0.32, N = 3206.8208.2207.61. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 37.657.707.701. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1232K4K6K8K10KSE +/- 6.10, N = 3SE +/- 7.59, N = 3SE +/- 25.14, N = 38480.058426.988475.51MIN: 8460.68MIN: 8406.64MIN: 8432.621. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K123246810SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 36.556.596.561. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown1231.06892.13783.20674.27565.3445SE +/- 0.0091, N = 3SE +/- 0.0037, N = 3SE +/- 0.0074, N = 34.75074.74234.7225MIN: 4.71 / MAX: 4.83MIN: 4.71 / MAX: 4.82MIN: 4.69 / MAX: 4.82

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast1231020304050SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 343.4343.6843.611. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU12310002000300040005000SE +/- 22.26, N = 3SE +/- 5.23, N = 3SE +/- 19.23, N = 34741.204714.604729.75MIN: 4710.31MIN: 4700.85MIN: 4687.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad1230.91361.82722.74083.65444.568SE +/- 0.01197, N = 3SE +/- 0.00333, N = 3SE +/- 0.01163, N = 34.037904.048854.060381. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg1612320406080100SE +/- 0.26, N = 3SE +/- 0.32, N = 3SE +/- 0.21, N = 3109.45109.70109.11MIN: 108.54 / MAX: 123.94MIN: 108.72 / MAX: 136.67MIN: 108.06 / MAX: 125.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1230.16970.33940.50910.67880.8485SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.7500.7540.752

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1233691215SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 310.7510.7210.78MIN: 10.34MIN: 9.95MIN: 10.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack12348121620SE +/- 0.04, N = 5SE +/- 0.00, N = 5SE +/- 0.00, N = 513.7913.7313.721. (CXX) g++ options: -rdynamic

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap12320406080100SE +/- 0.26, N = 3SE +/- 0.12, N = 3SE +/- 0.07, N = 3101.15100.65100.85MIN: 54.32 / MAX: 224.92MIN: 58.03 / MAX: 227.27MIN: 37.89 / MAX: 224.471. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1233M6M9M12M15MSE +/- 93407.88, N = 3SE +/- 117949.09, N = 3SE +/- 130255.47, N = 3122491611225497312309486

Libplacebo

Test: deband_heavy

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: deband_heavy123510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.09, N = 318.9518.9519.041. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj1231.17232.34463.51694.68925.8615SE +/- 0.0107, N = 3SE +/- 0.0125, N = 3SE +/- 0.0051, N = 35.20355.21015.1860MIN: 5.16 / MAX: 5.26MIN: 5.17 / MAX: 5.28MIN: 5.15 / MAX: 5.26

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed12312002400360048006000SE +/- 7.62, N = 3SE +/- 5.97, N = 3SE +/- 2.69, N = 35671.515687.385697.271. (CC) gcc options: -O3

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1230.49950.9991.49851.9982.4975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.212.222.211. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1233691215SE +/- 0.02, N = 5SE +/- 0.04, N = 5SE +/- 0.06, N = 512.1912.2412.221. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1230.38050.7611.14151.5221.9025SE +/- 0.005, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 31.6841.6871.691

yquake2

Renderer: Software CPU - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 108012320406080100SE +/- 0.87, N = 3SE +/- 0.98, N = 3SE +/- 0.38, N = 398.097.697.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

yquake2

Renderer: OpenGL 3.x - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 1920 x 10801234080120160200SE +/- 0.53, N = 3SE +/- 0.23, N = 3SE +/- 0.95, N = 3200.9200.1200.51. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed12314002800420056007000SE +/- 10.10, N = 3SE +/- 4.53, N = 3SE +/- 1.39, N = 36623.36643.66648.61. (CC) gcc options: -O3

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast1233691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 310.9810.9611.001. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

yquake2

Renderer: OpenGL 3.x - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 3840 x 21601231428425670SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 362.862.662.71. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21233691215SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 39.479.459.48MIN: 9.36 / MAX: 12.12MIN: 9.32 / MAX: 12.68MIN: 9.35 / MAX: 13.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

yquake2

Renderer: OpenGL 1.x - Resolution: 3840 x 2160

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 3840 x 21601231530456075SE +/- 0.09, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 367.167.267.01. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 212320406080100SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.22, N = 374.3974.2574.441. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap123714212835SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 329.5829.6029.53MIN: 14.88 / MAX: 175.93MIN: 14.76 / MAX: 161.92MIN: 15.15 / MAX: 181.721. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium1233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.739.759.731. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile1231632486480SE +/- 0.05, N = 3SE +/- 0.20, N = 3SE +/- 0.04, N = 370.6670.7370.60

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast123246810SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 36.066.076.071. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast123612182430SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 324.5724.5624.601. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18123612182430SE +/- 0.29, N = 3SE +/- 0.19, N = 3SE +/- 0.32, N = 325.7925.8125.77MIN: 24.99 / MAX: 28.18MIN: 25.13 / MAX: 40.98MIN: 25 / MAX: 26.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

DDraceNetwork

Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore21233691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 313.0313.0513.03MIN: 11.75 / MAX: 13.98MIN: 10.32 / MAX: 14.01MIN: 11.77 / MAX: 14.011. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite123150K300K450K600K750KSE +/- 448.34, N = 3SE +/- 2148.66, N = 3SE +/- 709.37, N = 3710293709965709282

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore21231020304050SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 345.4645.4045.44MIN: 25.16 / MAX: 257.86MIN: 33.02 / MAX: 50.63MIN: 25.99 / MAX: 180.91. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow1233691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.469.469.451. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed12314002800420056007000SE +/- 1.59, N = 3SE +/- 1.58, N = 3SE +/- 1.37, N = 36530.16527.56523.61. (CC) gcc options: -O3

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive123120240360480600SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.25, N = 3568.11567.56568.091. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search123306090120150SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3123.73123.81123.831. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

yquake2

Renderer: OpenGL 3.x - Resolution: 2560 x 1440

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 2560 x 1440123306090120150SE +/- 0.12, N = 3SE +/- 0.07, N = 3SE +/- 0.17, N = 3125.9125.8125.91. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed12314002800420056007000SE +/- 3.54, N = 3SE +/- 5.32, N = 3SE +/- 1.35, N = 36532.76537.86536.61. (CC) gcc options: -O3

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3123306090120150SE +/- 0.14, N = 3SE +/- 0.28, N = 3SE +/- 0.06, N = 3145.88145.79145.871. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1231530456075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 369.5769.6169.611. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Libplacebo

Test: av1_grain_lap

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: av1_grain_lap123150300450600750SE +/- 0.24, N = 3SE +/- 0.36, N = 3SE +/- 0.22, N = 3711.86711.51711.761. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 310.3910.3910.391. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow1230.4860.9721.4581.9442.43SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.162.162.161. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.10130.20260.30390.40520.5065SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.450.450.451. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.1530.3060.4590.6120.765SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.680.680.681. (CXX) g++ options: -O3 -pthread

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1230.29250.5850.87751.171.4625SE +/- 0.05, N = 12SE +/- 0.05, N = 12SE +/- 0.03, N = 121.31.11.31. (CC) gcc options: -fopenmp -O3 -lm

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m12348121620SE +/- 0.10, N = 3SE +/- 0.68, N = 3SE +/- 0.05, N = 316.5717.4116.59MIN: 16.28 / MAX: 16.95MIN: 16.52 / MAX: 269.38MIN: 16.38 / MAX: 20.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501231428425670SE +/- 0.54, N = 3SE +/- 8.74, N = 3SE +/- 0.55, N = 351.2460.7652.10MIN: 50.5 / MAX: 66.36MIN: 51.25 / MAX: 1056.17MIN: 50.52 / MAX: 64.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface123246810SE +/- 0.02, N = 3SE +/- 5.13, N = 3SE +/- 0.03, N = 32.587.672.61MIN: 2.47 / MAX: 2.74MIN: 2.41 / MAX: 416.67MIN: 2.53 / MAX: 2.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET123400K800K1200K1600K2000KSE +/- 29746.00, N = 3SE +/- 49673.53, N = 15SE +/- 25199.46, N = 31884493.331808870.971916548.161. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123500K1000K1500K2000K2500KSE +/- 21911.47, N = 13SE +/- 23853.74, N = 8SE +/- 48510.25, N = 122540423.882339268.782314175.991. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH123400K800K1200K1600K2000KSE +/- 35412.98, N = 12SE +/- 20312.72, N = 15SE +/- 5741.58, N = 31611833.091622814.521655968.921. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123600K1200K1800K2400K3000KSE +/- 37569.90, N = 3SE +/- 86194.23, N = 13SE +/- 2760.10, N = 32687331.831682927.161695087.331. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1231224364860SE +/- 1.23, N = 16SE +/- 0.86, N = 17SE +/- 0.37, N = 449.7351.4751.211. (CC) gcc options: -O2 -std=c99

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1231.19172.38343.57514.76685.9585SE +/- 0.00331, N = 3SE +/- 0.00978, N = 3SE +/- 0.09775, N = 154.259144.164335.29626MIN: 4.15MIN: 4.05MIN: 4.251. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1230.52161.04321.56482.08642.608SE +/- 0.064, N = 12SE +/- 0.052, N = 14SE +/- 0.031, N = 152.3182.2572.2581. (CXX) g++ options: -O3 -pthread -lm

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth1230.76361.52722.29083.05443.818SE +/- 0.09620, N = 3SE +/- 0.10194, N = 3SE +/- 0.24447, N = 33.393652.829673.213551. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency1230.07360.14720.22080.29440.368SE +/- 0.04635, N = 3SE +/- 0.00582, N = 3SE +/- 0.04588, N = 30.278790.327000.278471. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans1230.11140.22280.33420.44560.557SE +/- 0.01258, N = 3SE +/- 0.03821, N = 3SE +/- 0.01039, N = 30.493980.495180.479801. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte1230.46470.92941.39411.85882.3235SE +/- 0.08108, N = 3SE +/- 0.13431, N = 3SE +/- 0.07276, N = 32.065261.956841.893101. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

Betsy GPU Compressor

Codec: ETC2 RGB - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highest123246810SE +/- 0.431, N = 15SE +/- 0.117, N = 15SE +/- 0.127, N = 126.7116.6686.4141. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Betsy GPU Compressor

Codec: ETC1 - Quality: Highest

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highest123246810SE +/- 0.422, N = 15SE +/- 0.134, N = 12SE +/- 0.143, N = 126.9756.2816.3741. (CXX) g++ options: -O3 -O2 -lpthread -ldl

Waifu2x-NCNN Vulkan

Scale: 2x - Denoise: 3 - TAA: No

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: No123612182430SE +/- 1.00, N = 12SE +/- 0.11, N = 3SE +/- 0.06, N = 324.7825.9425.93


Phoronix Test Suite v10.8.5