Core i7 4790K 202

Intel Core i7-4790K testing with a Gigabyte Z97-HD3P (F4 BIOS) and Gigabyte Intel Haswell Desktop 2GB on Ubuntu 19.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2012207-HA-COREI747935
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
December 19 2020
  11 Hours, 50 Minutes
2
December 19 2020
  12 Hours, 43 Minutes
3
December 20 2020
  11 Hours, 42 Minutes
Invert Behavior (Only Show Selected Data)
  12 Hours, 5 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Core i7 4790K 202ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution123Intel Core i7-4790K @ 4.40GHz (4 Cores / 8 Threads)Gigabyte Z97-HD3P (F4 BIOS)Intel 4th Gen Core DRAM16GB120GB OCZ TRION100Gigabyte Intel Haswell Desktop 2GB (1250MHz)Intel Xeon E3-1200 v3/4thLG Ultra HDRealtek RTL8111/8168/8411Ubuntu 19.105.9.0-050900rc8daily20201009-generic (x86_64) 20201008GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.54.5 Mesa 19.2.81.1.102GCC 9.2.1 20191008ext43840x2160OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-offload-targets=nvptx-none,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0x28 - Thermald 1.9 Java Details- OpenJDK Runtime Environment (build 11.0.7+10-post-Ubuntu-2ubuntu219.10)Python Details- Python 2.7.17 + Python 3.7.5Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Not affected

123Result OverviewPhoronix Test Suite100%105%109%114%CLOMPRedisBetsy GPU CompressorNCNNHPC ChallengeeSpeak-NG Speech EngineLAMMPS Molecular Dynamics SimulatoroneDNNWaifu2x-NCNN VulkanSunflow Rendering Systemrav1eNode.js V8 Web Tooling BenchmarkBuild2simdjsonSQLite SpeedtestCoremarkGROMACSTimed MAFFT AlignmentStockfishBRL-CADNumpy BenchmarkTimed FFmpeg Compilationx265EmbreeLibplaceboLZ4 CompressionWavPack Audio Encodingyquake2asmFishMonkey Audio EncodingBasis UniversalIndigoBenchASTC EncoderKvazaarPHPBenchTimed Eigen CompilationDDraceNetworkTimed HMMer Search

Core i7 4790K 202hpcc: G-HPLbasis: UASTC Level 2 + RDO Post-Processingastcenc: Exhaustiveclomp: Static OMP Speedupgromacs: Water Benchmarkkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumbuild2: Time To Compilebrl-cad: VGR Performance Metricasmfish: 1024 Hash Memory, 26 Depthespeak: Text-To-Speech Synthesisnumpy: basis: UASTC Level 3embree: Pathtracer - Asian Dragon Objembree: Pathtracer - Crownbuild-ffmpeg: Time To Compilehmmer: Pfam Database Searchembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer ISPC - Crownncnn: Vulkan GPU - regnety_400mncnn: Vulkan GPU - squeezenet_ssdncnn: Vulkan GPU - yolov4-tinyncnn: Vulkan GPU - resnet50ncnn: Vulkan GPU - alexnetncnn: Vulkan GPU - resnet18ncnn: Vulkan GPU - vgg16ncnn: Vulkan GPU - googlenetncnn: Vulkan GPU - blazefacencnn: Vulkan GPU - efficientnet-b0ncnn: Vulkan GPU - mnasnetncnn: Vulkan GPU - shufflenet-v2ncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: Vulkan GPU - mobilenetembree: Pathtracer - Asian Dragonncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUkvazaar: Bosphorus 4K - Very Fastembree: Pathtracer ISPC - Asian Dragonx265: Bosphorus 4Konednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUstockfish: Total Timerav1e: 1node-web-tooling: basis: UASTC Level 2astcenc: Thoroughbasis: ETC1Srav1e: 5build-eigen: Time To Compilesqlite-speedtest: Timed Time - Size 1,000simdjson: Kostyakvazaar: Bosphorus 1080p - Slowindigobench: CPU - Bedroomindigobench: CPU - Supercarkvazaar: Bosphorus 1080p - Mediumddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2ddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymapddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - Multeasymapddnet: 3840 x 2160 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2compress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedsimdjson: DistinctUserIDsimdjson: LargeRandrav1e: 6kvazaar: Bosphorus 4K - Ultra Fastsimdjson: PartialTweetswaifu2x-ncnn: 2x - 3 - Nolibplacebo: av1_grain_laplibplacebo: hdr_peakdetectlibplacebo: polar_nocomputelibplacebo: deband_heavyredis: GETlammps: Rhodopsin Proteinredis: LPUSHrav1e: 10redis: SADDcompress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedbetsy: ETC2 RGB - Highestonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUbetsy: ETC1 - Highestphpbench: PHP Benchmark Suiteredis: SETkvazaar: Bosphorus 1080p - Very Fastyquake2: Software CPU - 3840 x 2160coremark: CoreMark Size 666 - Iterations Per Secondwaifu2x-ncnn: 2x - 3 - Yessunflow: Global Illumination + Image Synthesisencode-wavpack: WAV To WavPackonednn: IP Shapes 3D - u8s8f32 - CPUredis: LPOPonednn: IP Shapes 3D - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUencode-ape: WAV To APEx265: Bosphorus 1080ponednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUastcenc: Mediumkvazaar: Bosphorus 1080p - Ultra Fastonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUmafft: Multiple Sequence Alignment - LSU RNAyquake2: OpenGL 3.x - 3840 x 2160yquake2: Software CPU - 2560 x 1440yquake2: OpenGL 1.x - 3840 x 2160basis: UASTC Level 0astcenc: Fastyquake2: Software CPU - 1920 x 1080onednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUyquake2: OpenGL 3.x - 2560 x 1440yquake2: OpenGL 1.x - 2560 x 1440yquake2: OpenGL 3.x - 1920 x 1080yquake2: OpenGL 1.x - 1920 x 1080onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUhpcc: Max Ping Pong Bandwidthhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: EP-DGEMMhpcc: G-Ffte12390.31308945.475568.111.30.4102.162.21252.456481751224916149.733313.68145.8845.20354.7507124.831123.7275.89025.388816.5733.7746.1751.2422.1425.79109.4523.632.5811.336.809.527.058.3531.355.580616.3933.2244.0850.6922.2425.78109.2723.932.5711.286.839.476.918.2531.478480.058489.118495.756.066.56626.554741.154741.204730.4079901120.25511.0674.38769.5773.3900.82870.66169.4130.689.460.7501.6849.7345.46101.1529.5813.036532.751.286530.152.250.770.451.07410.980.7324.781711.8648607.6013186.9118.952540423.882.3181611833.092.5142112933.026623.35671.516.7116.865086.9757102931884493.3324.5727.8159958.2575146.9502.28013.7944.259142687331.8314.252112.393910.746012.18829.949.145295.2194010.3943.436.1677112.04562.860.367.110.5807.6598.026.071526.4185125.9130.4200.9206.811.326416.540419281.5723.393650.278790.026504.037900.4939843.790872.0652690.13682949.405567.561.10.4132.162.22250.264482371225497351.467313.69145.7945.21014.7423125.793123.8065.92665.346917.4134.1845.8060.7622.4125.81109.7023.537.6711.716.969.457.149.1031.765.613916.5834.5744.9051.5622.4326.11110.7924.272.6411.807.049.457.099.0332.228426.988413.388410.766.076.62906.594713.764714.604693.2479862870.26111.2774.24869.6172.6190.82370.73169.8600.689.460.7541.6879.7545.40100.6529.6013.056537.851.286527.552.400.740.451.10810.960.7125.940711.5149578.6213223.2018.952339268.782.2571622814.522.5952140255.006643.65687.386.6686.943586.2817099651808870.9724.5628.0159234.4820596.5352.29313.7264.164331682927.1615.074112.603610.721512.24330.169.096585.1786210.3943.686.0641811.92562.659.767.210.7227.7097.626.140026.9226125.8129.8200.1208.211.286017.533419517.8602.829670.327000.026254.048850.4951842.537471.9568492.15957972.134568.091.30.4082.162.21254.648485991230948651.213316.25145.8705.18604.7225125.683123.8325.88495.339216.5934.1844.1652.1022.2825.77109.1123.972.6111.566.819.497.068.5931.535.569616.4533.4644.4551.4822.2525.78108.9623.852.5511.606.849.487.108.5431.648475.518482.738446.756.076.54176.564758.634729.754743.2979187990.25911.2274.43769.6173.2790.81470.60170.5950.689.450.7521.6919.7345.44100.8529.5313.036536.650.116523.651.920.740.451.07911.000.7325.933711.7649012.3213109.5519.042314175.992.2581655968.922.5522104918.266648.65697.276.4148.223586.3747092821916548.1624.6028.5161328.4039456.8242.32913.7245.296261695087.3315.421512.097410.778412.21729.899.187435.1837410.3943.616.0840211.90462.760.867.010.4957.7097.825.743527.0588125.9131.0200.5207.611.164116.634718703.0283.213550.278470.027134.060380.4798044.680271.89310OpenBenchmarking.org

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL32120406080100SE +/- 1.45, N = 3SE +/- 1.01, N = 6SE +/- 1.21, N = 492.1690.1490.311. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing3212004006008001000SE +/- 12.55, N = 4SE +/- 2.85, N = 3SE +/- 1.28, N = 3972.13949.41945.481. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive321120240360480600SE +/- 0.25, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3568.09567.56568.111. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup3210.29250.5850.87751.171.4625SE +/- 0.03, N = 12SE +/- 0.05, N = 12SE +/- 0.05, N = 121.31.11.31. (CC) gcc options: -fopenmp -O3 -lm

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark3210.09290.18580.27870.37160.4645SE +/- 0.006, N = 4SE +/- 0.003, N = 3SE +/- 0.005, N = 30.4080.4130.4101. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow3210.4860.9721.4581.9442.43SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.162.162.161. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium3210.49950.9991.49851.9982.4975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.212.222.211. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile32160120180240300SE +/- 2.21, N = 3SE +/- 1.16, N = 3SE +/- 1.13, N = 3254.65250.26252.46

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric32110K20K30K40K50K4859948237481751. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth3213M6M9M12M15MSE +/- 130255.47, N = 3SE +/- 117949.09, N = 3SE +/- 93407.88, N = 3123094861225497312249161

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis3211224364860SE +/- 0.37, N = 4SE +/- 0.86, N = 17SE +/- 1.23, N = 1651.2151.4749.731. (CC) gcc options: -O2 -std=c99

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark32170140210280350SE +/- 0.35, N = 3SE +/- 0.64, N = 3SE +/- 0.12, N = 3316.25313.69313.68

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3321306090120150SE +/- 0.06, N = 3SE +/- 0.28, N = 3SE +/- 0.14, N = 3145.87145.79145.881. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj3211.17232.34463.51694.68925.8615SE +/- 0.0051, N = 3SE +/- 0.0125, N = 3SE +/- 0.0107, N = 35.18605.21015.2035MIN: 5.15 / MAX: 5.26MIN: 5.17 / MAX: 5.28MIN: 5.16 / MAX: 5.26

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown3211.06892.13783.20674.27565.3445SE +/- 0.0074, N = 3SE +/- 0.0037, N = 3SE +/- 0.0091, N = 34.72254.74234.7507MIN: 4.69 / MAX: 4.82MIN: 4.71 / MAX: 4.82MIN: 4.71 / MAX: 4.83

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile321306090120150SE +/- 0.37, N = 3SE +/- 0.58, N = 3SE +/- 0.10, N = 3125.68125.79124.83

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search321306090120150SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 3123.83123.81123.731. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj3211.33352.6674.00055.3346.6675SE +/- 0.0272, N = 3SE +/- 0.0196, N = 3SE +/- 0.0176, N = 35.88495.92665.8902MIN: 5.81 / MAX: 5.99MIN: 5.87 / MAX: 6.03MIN: 5.83 / MAX: 6

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown3211.21252.4253.63754.856.0625SE +/- 0.0582, N = 3SE +/- 0.0293, N = 3SE +/- 0.0366, N = 35.33925.34695.3888MIN: 5.19 / MAX: 5.51MIN: 5.26 / MAX: 5.47MIN: 5.28 / MAX: 5.51

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m32148121620SE +/- 0.05, N = 3SE +/- 0.68, N = 3SE +/- 0.10, N = 316.5917.4116.57MIN: 16.38 / MAX: 20.04MIN: 16.52 / MAX: 269.38MIN: 16.28 / MAX: 16.951. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd321816243240SE +/- 0.14, N = 3SE +/- 0.27, N = 3SE +/- 0.38, N = 334.1834.1833.77MIN: 33.24 / MAX: 46.06MIN: 32.78 / MAX: 44.2MIN: 32.69 / MAX: 43.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny3211020304050SE +/- 0.15, N = 3SE +/- 0.74, N = 3SE +/- 0.68, N = 344.1645.8046.17MIN: 43.39 / MAX: 57.44MIN: 43.83 / MAX: 58.33MIN: 44.33 / MAX: 611. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet503211428425670SE +/- 0.55, N = 3SE +/- 8.74, N = 3SE +/- 0.54, N = 352.1060.7651.24MIN: 50.52 / MAX: 64.77MIN: 51.25 / MAX: 1056.17MIN: 50.5 / MAX: 66.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet321510152025SE +/- 0.10, N = 3SE +/- 0.17, N = 3SE +/- 0.13, N = 322.2822.4122.14MIN: 21.96 / MAX: 22.64MIN: 21.96 / MAX: 34.32MIN: 21.81 / MAX: 24.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet18321612182430SE +/- 0.32, N = 3SE +/- 0.19, N = 3SE +/- 0.29, N = 325.7725.8125.79MIN: 25 / MAX: 26.95MIN: 25.13 / MAX: 40.98MIN: 24.99 / MAX: 28.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg1632120406080100SE +/- 0.21, N = 3SE +/- 0.32, N = 3SE +/- 0.26, N = 3109.11109.70109.45MIN: 108.06 / MAX: 125.36MIN: 108.72 / MAX: 136.67MIN: 108.54 / MAX: 123.941. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet321612182430SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 323.9723.5323.63MIN: 23.72 / MAX: 36.58MIN: 23.31 / MAX: 26.84MIN: 23.33 / MAX: 36.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface321246810SE +/- 0.03, N = 3SE +/- 5.13, N = 3SE +/- 0.02, N = 32.617.672.58MIN: 2.53 / MAX: 2.69MIN: 2.41 / MAX: 416.67MIN: 2.47 / MAX: 2.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b03213691215SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 311.5611.7111.33MIN: 11.28 / MAX: 25.04MIN: 11.47 / MAX: 14.37MIN: 11.18 / MAX: 14.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet321246810SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 36.816.966.80MIN: 6.53 / MAX: 7.12MIN: 6.68 / MAX: 21.89MIN: 6.62 / MAX: 10.181. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v23213691215SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 39.499.459.52MIN: 9.39 / MAX: 11.92MIN: 9.35 / MAX: 12.21MIN: 9.42 / MAX: 11.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3321246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 37.067.147.05MIN: 6.87 / MAX: 10.89MIN: 6.95 / MAX: 21.34MIN: 6.84 / MAX: 8.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v23213691215SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.13, N = 38.599.108.35MIN: 8.33 / MAX: 13.54MIN: 8.89 / MAX: 12.03MIN: 7.99 / MAX: 21.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet321714212835SE +/- 0.09, N = 3SE +/- 0.24, N = 3SE +/- 0.06, N = 331.5331.7631.35MIN: 31.22 / MAX: 33.07MIN: 31.13 / MAX: 48.02MIN: 31.01 / MAX: 43.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon3211.26312.52623.78935.05246.3155SE +/- 0.0036, N = 3SE +/- 0.0161, N = 3SE +/- 0.0040, N = 35.56965.61395.5806MIN: 5.5 / MAX: 5.65MIN: 5.54 / MAX: 5.72MIN: 5.53 / MAX: 5.66

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m32148121620SE +/- 0.12, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 316.4516.5816.39MIN: 16.16 / MAX: 17.78MIN: 16.23 / MAX: 31.03MIN: 16.23 / MAX: 18.481. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd321816243240SE +/- 0.27, N = 3SE +/- 0.57, N = 3SE +/- 0.07, N = 333.4634.5733.22MIN: 32.94 / MAX: 43.71MIN: 33.72 / MAX: 36.89MIN: 32.95 / MAX: 34.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny3211020304050SE +/- 0.38, N = 3SE +/- 0.29, N = 3SE +/- 0.72, N = 344.4544.9044.08MIN: 43.34 / MAX: 46.59MIN: 44.01 / MAX: 52.5MIN: 42.69 / MAX: 58.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet503211224364860SE +/- 0.47, N = 3SE +/- 0.27, N = 3SE +/- 0.10, N = 351.4851.5650.69MIN: 50.22 / MAX: 66.07MIN: 50.54 / MAX: 64.38MIN: 50.36 / MAX: 63.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet321510152025SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.12, N = 322.2522.4322.24MIN: 22 / MAX: 36.1MIN: 21.97 / MAX: 35.74MIN: 21.89 / MAX: 24.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18321612182430SE +/- 0.08, N = 3SE +/- 0.14, N = 3SE +/- 0.07, N = 325.7826.1125.78MIN: 25.3 / MAX: 26.66MIN: 25.75 / MAX: 41.28MIN: 25.38 / MAX: 39.911. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1632120406080100SE +/- 0.12, N = 3SE +/- 0.82, N = 3SE +/- 0.57, N = 3108.96110.79109.27MIN: 108.21 / MAX: 115.52MIN: 109.24 / MAX: 127.6MIN: 108.04 / MAX: 121.431. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet321612182430SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 323.8524.2723.93MIN: 23.58 / MAX: 37.59MIN: 23.92 / MAX: 27MIN: 23.66 / MAX: 24.51. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface3210.5941.1881.7822.3762.97SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.02, N = 32.552.642.57MIN: 2.49 / MAX: 2.61MIN: 2.6 / MAX: 2.69MIN: 2.37 / MAX: 2.661. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b03213691215SE +/- 0.13, N = 3SE +/- 0.10, N = 3SE +/- 0.05, N = 311.6011.8011.28MIN: 11.37 / MAX: 26.59MIN: 11.52 / MAX: 12.14MIN: 10.83 / MAX: 19.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet321246810SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 36.847.046.83MIN: 6.59 / MAX: 10.33MIN: 6.85 / MAX: 9.1MIN: 6.61 / MAX: 7.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v23213691215SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 39.489.459.47MIN: 9.35 / MAX: 13.34MIN: 9.32 / MAX: 12.68MIN: 9.36 / MAX: 12.121. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3321246810SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 37.107.096.91MIN: 6.88 / MAX: 9.9MIN: 6.9 / MAX: 10.15MIN: 6.69 / MAX: 12.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v23213691215SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 38.549.038.25MIN: 8.16 / MAX: 12.1MIN: 8.75 / MAX: 11.63MIN: 8.02 / MAX: 11.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet321714212835SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.26, N = 331.6432.2231.47MIN: 31.25 / MAX: 55.14MIN: 31.89 / MAX: 45.17MIN: 30.7 / MAX: 45.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU3212K4K6K8K10KSE +/- 25.14, N = 3SE +/- 7.59, N = 3SE +/- 6.10, N = 38475.518426.988480.05MIN: 8432.62MIN: 8406.64MIN: 8460.681. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU3212K4K6K8K10KSE +/- 20.61, N = 3SE +/- 13.64, N = 3SE +/- 11.29, N = 38482.738413.388489.11MIN: 8441.98MIN: 8383.33MIN: 8460.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU3212K4K6K8K10KSE +/- 15.35, N = 3SE +/- 20.81, N = 3SE +/- 17.63, N = 38446.758410.768495.75MIN: 8413.69MIN: 8366.44MIN: 8463.891. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast321246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 36.076.076.061. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon321246810SE +/- 0.0287, N = 3SE +/- 0.0431, N = 3SE +/- 0.0417, N = 36.54176.62906.5662MIN: 6.45 / MAX: 6.74MIN: 6.51 / MAX: 6.81MIN: 6.46 / MAX: 6.79

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K321246810SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 36.566.596.551. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU32110002000300040005000SE +/- 24.18, N = 3SE +/- 16.47, N = 3SE +/- 15.27, N = 34758.634713.764741.15MIN: 4700.29MIN: 4686.24MIN: 4705.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU32110002000300040005000SE +/- 19.23, N = 3SE +/- 5.23, N = 3SE +/- 22.26, N = 34729.754714.604741.20MIN: 4687.53MIN: 4700.85MIN: 4710.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU32110002000300040005000SE +/- 3.10, N = 3SE +/- 15.09, N = 3SE +/- 0.95, N = 34743.294693.244730.40MIN: 4730.58MIN: 4657.74MIN: 4721.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time3212M4M6M8M10MSE +/- 100608.19, N = 3SE +/- 53934.69, N = 3SE +/- 42720.90, N = 37918799798628779901121. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 13210.05870.11740.17610.23480.2935SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.003, N = 30.2590.2610.255

Node.js V8 Web Tooling Benchmark

Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark3213691215SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 311.2211.2711.061. Nodejs v10.15.2

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 232120406080100SE +/- 0.22, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 374.4474.2574.391. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough3211530456075SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 369.6169.6169.571. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S3211632486480SE +/- 0.38, N = 3SE +/- 0.68, N = 3SE +/- 0.54, N = 373.2872.6273.391. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 53210.18630.37260.55890.74520.9315SE +/- 0.006, N = 3SE +/- 0.013, N = 3SE +/- 0.005, N = 30.8140.8230.828

Timed Eigen Compilation

This test times how long it takes to build all Eigen examples. The Eigen examples are compiled serially. Eigen is a C++ template library for linear algebra. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile3211632486480SE +/- 0.04, N = 3SE +/- 0.20, N = 3SE +/- 0.05, N = 370.6070.7370.66

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0003211632486480SE +/- 0.54, N = 3SE +/- 0.12, N = 3SE +/- 0.25, N = 370.6069.8669.411. (CC) gcc options: -O2 -ldl -lz -lpthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya3210.1530.3060.4590.6120.765SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.680.680.681. (CXX) g++ options: -O3 -pthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow3213691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.459.469.461. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom3210.16970.33940.50910.67880.8485SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 30.7520.7540.750

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar3210.38050.7611.14151.5221.9025SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.005, N = 31.6911.6871.684

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium3213691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 39.739.759.731. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

DDraceNetwork

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time321714212835Min: 4.01 / Avg: 22.02 / Max: 31.07Min: 5.55 / Avg: 22.02 / Max: 28.23Min: 3.55 / Avg: 22.02 / Max: 28.051. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore23211020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 345.4445.4045.46MIN: 25.99 / MAX: 180.9MIN: 33.02 / MAX: 50.63MIN: 25.16 / MAX: 257.861. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time321510152025Min: 4.46 / Avg: 9.94 / Max: 17Min: 4.4 / Avg: 9.98 / Max: 16.99Min: 4.45 / Avg: 9.95 / Max: 18.411. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap32120406080100SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.26, N = 3100.85100.65101.15MIN: 37.89 / MAX: 224.47MIN: 58.03 / MAX: 227.27MIN: 54.32 / MAX: 224.921. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time3211326395265Min: 5.77 / Avg: 34.02 / Max: 65.99Min: 5.76 / Avg: 33.96 / Max: 57.53Min: 6.24 / Avg: 33.97 / Max: 65.291. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap321714212835SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 329.5329.6029.58MIN: 15.15 / MAX: 181.72MIN: 14.76 / MAX: 161.92MIN: 14.88 / MAX: 175.931. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

MinAvgMax372.076.782.4271.676.684.9172.376.885.1OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time204060801001. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 3840 x 2160 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore23213691215SE +/- 0.02, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 313.0313.0513.03MIN: 11.77 / MAX: 14.01MIN: 10.32 / MAX: 14.01MIN: 11.75 / MAX: 13.981. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed32114002800420056007000SE +/- 1.35, N = 3SE +/- 5.32, N = 3SE +/- 3.54, N = 36536.66537.86532.71. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed3211224364860SE +/- 0.85, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 350.1151.2851.281. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed32114002800420056007000SE +/- 1.37, N = 3SE +/- 1.58, N = 3SE +/- 1.59, N = 36523.66527.56530.11. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed3211224364860SE +/- 0.23, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 351.9252.4052.251. (CC) gcc options: -O3

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID3210.17330.34660.51990.69320.8665SE +/- 0.01, N = 3SE +/- 0.01, N = 4SE +/- 0.01, N = 30.740.740.771. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom3210.10130.20260.30390.40520.5065SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.450.450.451. (CXX) g++ options: -O3 -pthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 63210.24930.49860.74790.99721.2465SE +/- 0.013, N = 3SE +/- 0.004, N = 3SE +/- 0.011, N = 31.0791.1081.074

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast3213691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.0010.9610.981. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets3210.16430.32860.49290.65720.8215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 30.730.710.731. (CXX) g++ options: -O3 -pthread

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: No321612182430SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 1.00, N = 1225.9325.9424.78

Libplacebo

Libplacebo is a multimedia rendering library based on the core rendering code of the MPV player. The libplacebo benchmark relies on the Vulkan API and tests various primitives. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: av1_grain_lap321150300450600750SE +/- 0.22, N = 3SE +/- 0.36, N = 3SE +/- 0.24, N = 3711.76711.51711.861. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: hdr_peakdetect32111K22K33K44K55KSE +/- 107.39, N = 3SE +/- 90.21, N = 3SE +/- 736.78, N = 349012.3249578.6248607.601. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: polar_nocompute3213K6K9K12K15KSE +/- 44.87, N = 3SE +/- 37.92, N = 3SE +/- 29.01, N = 313109.5513223.2013186.911. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

OpenBenchmarking.orgFPS, More Is BetterLibplacebo 2.72.2Test: deband_heavy321510152025SE +/- 0.09, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 319.0418.9518.951. (CXX) g++ options: -lm -lglslang -lHLSL -lOGLCompiler -lOSDependent -lSPIRV -lSPVRemapper -lpthread -pthread -pipe -std=c++11 -fvisibility=hidden -fPIC -MD -MQ -MF

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET321500K1000K1500K2000K2500KSE +/- 48510.25, N = 12SE +/- 23853.74, N = 8SE +/- 21911.47, N = 132314175.992339268.782540423.881. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein3210.52161.04321.56482.08642.608SE +/- 0.031, N = 15SE +/- 0.052, N = 14SE +/- 0.064, N = 122.2582.2572.3181. (CXX) g++ options: -O3 -pthread -lm

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH321400K800K1200K1600K2000KSE +/- 5741.58, N = 3SE +/- 20312.72, N = 15SE +/- 35412.98, N = 121655968.921622814.521611833.091. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 103210.58391.16781.75172.33562.9195SE +/- 0.015, N = 3SE +/- 0.044, N = 3SE +/- 0.012, N = 32.5522.5952.514

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD321500K1000K1500K2000K2500KSE +/- 30741.46, N = 13SE +/- 8400.74, N = 3SE +/- 33704.56, N = 122104918.262140255.002112933.021. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed32114002800420056007000SE +/- 1.39, N = 3SE +/- 4.53, N = 3SE +/- 10.10, N = 36648.66643.66623.31. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed32112002400360048006000SE +/- 2.69, N = 3SE +/- 5.97, N = 3SE +/- 7.62, N = 35697.275687.385671.511. (CC) gcc options: -O3

Betsy GPU Compressor

Betsy is an open-source GPU compressor of various GPU compression techniques. Betsy is written in GLSL for Vulkan/OpenGL (compute shader) support for GPU-based texture compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC2 RGB - Quality: Highest321246810SE +/- 0.127, N = 12SE +/- 0.117, N = 15SE +/- 0.431, N = 156.4146.6686.7111. (CXX) g++ options: -O3 -O2 -lpthread -ldl

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU321246810SE +/- 0.11002, N = 15SE +/- 0.09722, N = 3SE +/- 0.00679, N = 38.223586.943586.86508MIN: 7.36MIN: 6.67MIN: 6.691. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Betsy GPU Compressor

Betsy is an open-source GPU compressor of various GPU compression techniques. Betsy is written in GLSL for Vulkan/OpenGL (compute shader) support for GPU-based texture compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBetsy GPU Compressor 1.1 BetaCodec: ETC1 - Quality: Highest321246810SE +/- 0.143, N = 12SE +/- 0.134, N = 12SE +/- 0.422, N = 156.3746.2816.9751. (CXX) g++ options: -O3 -O2 -lpthread -ldl

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite321150K300K450K600K750KSE +/- 709.37, N = 3SE +/- 2148.66, N = 3SE +/- 448.34, N = 3709282709965710293

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET321400K800K1200K1600K2000KSE +/- 25199.46, N = 3SE +/- 49673.53, N = 15SE +/- 29746.00, N = 31916548.161808870.971884493.331. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast321612182430SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 324.6024.5624.571. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 3840 x 2160321714212835SE +/- 0.03, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 328.528.027.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second32130K60K90K120K150KSE +/- 104.15, N = 3SE +/- 1169.64, N = 3SE +/- 270.17, N = 3161328.40159234.48159958.261. (CC) gcc options: -O2 -lrt" -lrt

Waifu2x-NCNN Vulkan

Waifu2x-NCNN is an NCNN neural network implementation of the Waifu2x converter project and accelerated using the Vulkan API. NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. This test profile times how long it takes to increase the resolution of a sample image with Vulkan. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWaifu2x-NCNN Vulkan 20200818Scale: 2x - Denoise: 3 - TAA: Yes321246810SE +/- 0.085, N = 15SE +/- 0.096, N = 4SE +/- 0.059, N = 126.8246.5356.950

Sunflow Rendering System

This test runs benchmarks of the Sunflow Rendering System. The Sunflow Rendering System is an open-source render engine for photo-realistic image synthesis with a ray-tracing core. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis3210.5241.0481.5722.0962.62SE +/- 0.024, N = 3SE +/- 0.019, N = 3SE +/- 0.012, N = 32.3292.2932.280MIN: 2.21 / MAX: 3.04MIN: 2.17 / MAX: 2.99MIN: 2.17 / MAX: 2.91

WavPack Audio Encoding

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack32148121620SE +/- 0.00, N = 5SE +/- 0.00, N = 5SE +/- 0.04, N = 513.7213.7313.791. (CXX) g++ options: -rdynamic

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3211.19172.38343.57514.76685.9585SE +/- 0.09775, N = 15SE +/- 0.00978, N = 3SE +/- 0.00331, N = 35.296264.164334.25914MIN: 4.25MIN: 4.05MIN: 4.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP321600K1200K1800K2400K3000KSE +/- 2760.10, N = 3SE +/- 86194.23, N = 13SE +/- 37569.90, N = 31695087.331682927.162687331.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU32148121620SE +/- 0.15, N = 15SE +/- 0.13, N = 3SE +/- 0.01, N = 315.4215.0714.25MIN: 14.53MIN: 14.72MIN: 14.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU3213691215SE +/- 0.06, N = 3SE +/- 0.18, N = 3SE +/- 0.20, N = 312.1012.6012.39MIN: 10.95MIN: 11.17MIN: 11.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU3213691215SE +/- 0.11, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 310.7810.7210.75MIN: 10.43MIN: 9.95MIN: 10.341. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Monkey Audio Encoding

This test times how long it takes to encode a sample WAV file to Monkey's Audio APE format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE3213691215SE +/- 0.06, N = 5SE +/- 0.04, N = 5SE +/- 0.02, N = 512.2212.2412.191. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p321714212835SE +/- 0.07, N = 3SE +/- 0.14, N = 3SE +/- 0.08, N = 329.8930.1629.941. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU3213691215SE +/- 0.04734, N = 3SE +/- 0.14569, N = 3SE +/- 0.03949, N = 39.187439.096589.14529MIN: 8.82MIN: 8.69MIN: 8.741. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3211.17442.34883.52324.69765.872SE +/- 0.01621, N = 3SE +/- 0.00946, N = 3SE +/- 0.00525, N = 35.183745.178625.21940MIN: 4.84MIN: 4.94MIN: 4.851. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium3213691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 310.3910.3910.391. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast3211020304050SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 343.6143.6843.431. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU321246810SE +/- 0.06654, N = 3SE +/- 0.01547, N = 3SE +/- 0.01016, N = 36.084026.064186.16771MIN: 5.81MIN: 5.69MIN: 5.881. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA3213691215SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 311.9011.9312.051. (CC) gcc options: -std=c99 -O3 -lm -lpthread

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 3840 x 21603211428425670SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 362.762.662.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 2560 x 14403211428425670SE +/- 0.19, N = 3SE +/- 0.43, N = 3SE +/- 0.31, N = 360.859.760.31. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 3840 x 21603211530456075SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.09, N = 367.067.267.11. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 03213691215SE +/- 0.02, N = 3SE +/- 0.16, N = 3SE +/- 0.09, N = 310.5010.7210.581. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast321246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 37.707.707.651. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 108032120406080100SE +/- 0.38, N = 3SE +/- 0.98, N = 3SE +/- 0.87, N = 397.897.698.01. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU321612182430SE +/- 0.09, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 325.7426.1426.07MIN: 25.45MIN: 25.95MIN: 25.781. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU321612182430SE +/- 0.16, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 327.0626.9226.42MIN: 26.33MIN: 26.51MIN: 26.191. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

yquake2

This is a test of Yamagi Quake II. Yamagi Quake II is an enhanced client for id Software's Quake II with focus on offline and coop gameplay. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 2560 x 1440321306090120150SE +/- 0.17, N = 3SE +/- 0.07, N = 3SE +/- 0.12, N = 3125.9125.8125.91. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 2560 x 1440321306090120150SE +/- 0.12, N = 3SE +/- 0.55, N = 3SE +/- 0.37, N = 3131.0129.8130.41. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 3.x - Resolution: 1920 x 10803214080120160200SE +/- 0.95, N = 3SE +/- 0.23, N = 3SE +/- 0.53, N = 3200.5200.1200.91. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: OpenGL 1.x - Resolution: 1920 x 108032150100150200250SE +/- 0.32, N = 3SE +/- 0.55, N = 3SE +/- 0.42, N = 3207.6208.2206.81. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU3213691215SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 311.1611.2911.33MIN: 10.46MIN: 10.69MIN: 10.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU32148121620SE +/- 0.12, N = 3SE +/- 0.26, N = 3SE +/- 0.05, N = 316.6317.5316.54MIN: 16.28MIN: 16.55MIN: 16.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth3214K8K12K16K20KSE +/- 289.88, N = 3SE +/- 639.68, N = 3SE +/- 146.25, N = 318703.0319517.8619281.571. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth3210.76361.52722.29083.05443.818SE +/- 0.24447, N = 3SE +/- 0.10194, N = 3SE +/- 0.09620, N = 33.213552.829673.393651. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency3210.07360.14720.22080.29440.368SE +/- 0.04588, N = 3SE +/- 0.00582, N = 3SE +/- 0.04635, N = 30.278470.327000.278791. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access3210.00610.01220.01830.02440.0305SE +/- 0.00050, N = 3SE +/- 0.00077, N = 3SE +/- 0.00052, N = 30.027130.026250.026501. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad3210.91361.82722.74083.65444.568SE +/- 0.01163, N = 3SE +/- 0.00333, N = 3SE +/- 0.01197, N = 34.060384.048854.037901. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans3210.11140.22280.33420.44560.557SE +/- 0.01039, N = 3SE +/- 0.03821, N = 3SE +/- 0.01258, N = 30.479800.495180.493981. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM3211020304050SE +/- 0.66, N = 3SE +/- 0.10, N = 3SE +/- 0.53, N = 344.6842.5443.791. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte3210.46470.92941.39411.85882.3235SE +/- 0.07276, N = 3SE +/- 0.13431, N = 3SE +/- 0.08108, N = 31.893101.956842.065261. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 3.1.3

148 Results Shown

HPC Challenge
Basis Universal
ASTC Encoder
CLOMP
GROMACS
Kvazaar:
  Bosphorus 4K - Slow
  Bosphorus 4K - Medium
Build2
BRL-CAD
asmFish
eSpeak-NG Speech Engine
Numpy Benchmark
Basis Universal
Embree:
  Pathtracer - Asian Dragon Obj
  Pathtracer - Crown
Timed FFmpeg Compilation
Timed HMMer Search
Embree:
  Pathtracer ISPC - Asian Dragon Obj
  Pathtracer ISPC - Crown
NCNN:
  Vulkan GPU - regnety_400m
  Vulkan GPU - squeezenet_ssd
  Vulkan GPU - yolov4-tiny
  Vulkan GPU - resnet50
  Vulkan GPU - alexnet
  Vulkan GPU - resnet18
  Vulkan GPU - vgg16
  Vulkan GPU - googlenet
  Vulkan GPU - blazeface
  Vulkan GPU - efficientnet-b0
  Vulkan GPU - mnasnet
  Vulkan GPU - shufflenet-v2
  Vulkan GPU-v3-v3 - mobilenet-v3
  Vulkan GPU-v2-v2 - mobilenet-v2
  Vulkan GPU - mobilenet
Embree
NCNN:
  CPU - regnety_400m
  CPU - squeezenet_ssd
  CPU - yolov4-tiny
  CPU - resnet50
  CPU - alexnet
  CPU - resnet18
  CPU - vgg16
  CPU - googlenet
  CPU - blazeface
  CPU - efficientnet-b0
  CPU - mnasnet
  CPU - shufflenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU-v2-v2 - mobilenet-v2
  CPU - mobilenet
oneDNN:
  Recurrent Neural Network Training - u8s8f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Training - f32 - CPU
Kvazaar
Embree
x265
oneDNN:
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Recurrent Neural Network Inference - f32 - CPU
Stockfish
rav1e
Node.js V8 Web Tooling Benchmark
Basis Universal
ASTC Encoder
Basis Universal
rav1e
Timed Eigen Compilation
SQLite Speedtest
simdjson
Kvazaar
IndigoBench:
  CPU - Bedroom
  CPU - Supercar
Kvazaar
DDraceNetwork
DDraceNetwork
DDraceNetwork
DDraceNetwork
DDraceNetwork
DDraceNetwork
DDraceNetwork
DDraceNetwork
LZ4 Compression:
  9 - Decompression Speed
  9 - Compression Speed
  3 - Decompression Speed
  3 - Compression Speed
simdjson:
  DistinctUserID
  LargeRand
rav1e
Kvazaar
simdjson
Waifu2x-NCNN Vulkan
Libplacebo:
  av1_grain_lap
  hdr_peakdetect
  polar_nocompute
  deband_heavy
Redis
LAMMPS Molecular Dynamics Simulator
Redis
rav1e
Redis
LZ4 Compression:
  1 - Decompression Speed
  1 - Compression Speed
Betsy GPU Compressor
oneDNN
Betsy GPU Compressor
PHPBench
Redis
Kvazaar
yquake2
Coremark
Waifu2x-NCNN Vulkan
Sunflow Rendering System
WavPack Audio Encoding
oneDNN
Redis
oneDNN:
  IP Shapes 3D - f32 - CPU
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
Monkey Audio Encoding
x265
oneDNN:
  IP Shapes 1D - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU
ASTC Encoder
Kvazaar
oneDNN
Timed MAFFT Alignment
yquake2:
  OpenGL 3.x - 3840 x 2160
  Software CPU - 2560 x 1440
  OpenGL 1.x - 3840 x 2160
Basis Universal
ASTC Encoder
yquake2
oneDNN:
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Convolution Batch Shapes Auto - f32 - CPU
yquake2:
  OpenGL 3.x - 2560 x 1440
  OpenGL 1.x - 2560 x 1440
  OpenGL 3.x - 1920 x 1080
  OpenGL 1.x - 1920 x 1080
oneDNN:
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
HPC Challenge:
  Max Ping Pong Bandwidth
  Rand Ring Bandwidth
  Rand Ring Latency
  G-Rand Access
  EP-STREAM Triad
  G-Ptrans
  EP-DGEMM
  G-Ffte