Core i7 6800K Ubuntu 20.10

Intel Core i7-6800K testing with a MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS) and Zotac NVIDIA NV137 2GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2101059-HA-COREI768045&grs.

Core i7 6800K Ubuntu 20.10ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionOpenGLRun 1Run 2Run 3Intel Core i7-6800K @ 3.80GHz (6 Cores / 12 Threads)MSI X99A WORKSTATION (MS-7A54) v1.0 (1.10 BIOS)Intel Xeon E7 v4/Xeon16GB120GB TOSHIBA TR150Zotac NVIDIA GeForce GTX 1050Realtek ALC1150G237HLIntel I218-LM + Intel I210Ubuntu 20.105.8.0-33-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.9modesetting 1.20.9GCC 10.2.0ext41920x1080Zotac NVIDIA NV137 2GB4.3 Mesa 20.2.1OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_cpufreq ondemand - CPU Microcode: 0xb000038Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

Core i7 6800K Ubuntu 20.10redis: LPOPhpcc: EP-STREAM Triadastcenc: Mediumastcenc: Thoroughlammps: Rhodopsin Proteinredis: SEThpcc: Rand Ring Latencyncnn: CPU - resnet18redis: GETonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - f32 - CPUredis: SADDonednn: IP Shapes 3D - u8s8f32 - CPUncnn: CPU - resnet50ncnn: CPU - regnety_400mncnn: CPU-v2-v2 - mobilenet-v2hpcc: G-Rand Accessbasis: UASTC Level 0onednn: IP Shapes 1D - u8s8f32 - CPUncnn: CPU - efficientnet-b0onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUx265: Bosphorus 4Konednn: Deconvolution Batch shapes_3d - f32 - CPUcoremark: CoreMark Size 666 - Iterations Per Secondasmfish: 1024 Hash Memory, 26 Depthhpcc: EP-DGEMMx265: Bosphorus 1080prav1e: 5encode-opus: WAV To Opus Encodenode-web-tooling: embree: Pathtracer - Asian Dragonmafft: Multiple Sequence Alignment - LSU RNAbuild-ffmpeg: Time To Compilernnoise: astcenc: Exhaustiverav1e: 1ncnn: CPU-v3-v3 - mobilenet-v3onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUdeepspeech: CPUembree: Pathtracer ISPC - Asian Dragonbasis: UASTC Level 2 + RDO Post-Processinghpcc: Max Ping Pong Bandwidthindigobench: CPU - Supercarredis: LPUSHstockfish: Total Timeonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUncnn: CPU - mobilenetonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUbasis: ETC1Sembree: Pathtracer ISPC - Crownncnn: CPU - yolov4-tinyrav1e: 10encode-wavpack: WAV To WavPackgromacs: Water Benchmarkembree: Pathtracer - Crownncnn: CPU - alexnetrav1e: 6yquake2: Software CPU - 1920 x 1080indigobench: CPU - Bedroomai-benchmark: Device Inference Scorebuild-linux-kernel: Time To Compilebuild-eigen: Time To Compilencnn: CPU - googlenetncnn: CPU - squeezenet_ssdhpcc: G-HPLonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUhmmer: Pfam Database Searchhpcc: G-Ptransonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUcompress-lz4: 1 - Compression Speedphpbench: PHP Benchmark Suiteonednn: Deconvolution Batch shapes_1d - f32 - CPUdav1d: Summer Nature 1080pbuild2: Time To Compileonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUdav1d: Summer Nature 4Kkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 4K - Mediumai-benchmark: Device AI Scoreonednn: Convolution Batch Shapes Auto - f32 - CPUx264: H.264 Video Encodingkvazaar: Bosphorus 1080p - Ultra Fastcompress-zstd: 19crafty: Elapsed Timekvazaar: Bosphorus 4K - Very Fastdav1d: Chimera 1080pbrl-cad: VGR Performance Metricunpack-firefox: firefox-84.0.source.tar.xzdav1d: Chimera 1080p 10-bitonednn: Recurrent Neural Network Inference - u8s8f32 - CPUbasis: UASTC Level 3compress-lz4: 9 - Decompression Speedkvazaar: Bosphorus 1080p - Mediumcompress-zstd: 3sqlite-speedtest: Timed Time - Size 1,000ncnn: CPU - vgg16yafaray: Total Time For Sample Sceneencode-ape: WAV To APEkvazaar: Bosphorus 4K - Ultra Fastcompress-lz4: 1 - Decompression Speedai-benchmark: Device Training Scorecompress-lz4: 3 - Decompression Speedkvazaar: Bosphorus 1080p - Very Fastonednn: Recurrent Neural Network Training - u8s8f32 - CPUbasis: UASTC Level 2compress-lz4: 3 - Compression Speedcompress-lz4: 9 - Compression Speedastcenc: Fastkvazaar: Bosphorus 4K - Slowsimdjson: DistinctUserIDsimdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyaclomp: Static OMP Speedupncnn: CPU - blazefacencnn: CPU - mnasnetncnn: CPU - shufflenet-v2espeak: Text-To-Speech Synthesisonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUhpcc: Rand Ring Bandwidthhpcc: G-FfteRun 1Run 2Run 32200466.425.346058.8355.322.6031511980.130.4341255.652022289.5410.0500213.02741786977.083.81378173.2458.0727.040.0241010.8519.3283035.759.479298.6415.9999192969.4080091568364037.2311335.220.8289.4628.907.521211.416100.24825.152427.410.27717.2212.7573109.378009.2012957.16512775.5542.2751314301.941041422911020.075.0725.931068.9287.366479.512.37116.2730.5686.410635.401.09181.70.979815158.69499.30469.7759.7186.188727391.52131.7462.5027611058.97394.075400.2862211019.0877322.35264.64717.1744102.4511.002.75169616.945352.0552.7431.370348607.71361.597029123.66267.767400.87113.0126298.211.332971.679.758131.95272.34314.39714.046392.08816291.629.3811071.458.97943.2142.297.542.690.710.690.390.581.29.1117.0335.1239.6738.209942.511292.073871361545.285.133928.2052.222.7111574421.250.4487356.421965096.139.8073212.66051750815.543.77356169.8856.9727.220.0239710.8729.1674535.169.514038.6015.8302192628.8634871573916037.3829334.750.8189.5118.797.613711.554100.00924.959422.820.28017.0612.6702109.662309.1127966.39612753.0612.2541311290.871033690211062.674.5325.709068.6837.314679.192.36916.1570.5686.434035.201.08581.20.977820159.02198.96970.1760.0586.508607430.60131.0592.5128311071.27385.865389.9861952819.0626323.32265.49617.2067102.7411.042.76170216.999351.9752.9231.270472147.73362.527011623.60467.717404.31112.9286286.911.352966.579.637131.74271.97514.38114.046384.48826285.029.3511074.258.95143.2142.307.542.690.710.690.390.581.29.0118.1435.7940.5697.182952.360572.085541354046.585.544508.3152.172.6431558674.520.4363557.441960201.4610.095812.71061746188.193.73038171.1357.7027.560.0244111.0489.2105235.719.637478.7415.7450195702.0212031590280636.8729335.100.8299.5858.897.533011.472101.16125.239422.910.27817.0412.6246110.48519.1881962.77812872.7002.2741323367.411043042711119.075.1925.777469.2437.311578.922.3540.5646.453335.431.09281.30.983158.06499.54469.9959.7886.659727414.46131.3472.5007011111.77418.025377.6119.1393323.64265.69217.2416102.3411.012.7516.939452.1552.7831.270249697.71361.9267.607415.85113.1376295.411.332969.079.631131.81272.39514.37614.026385.26291.529.3611063.858.99443.2442.287.542.690.710.690.390.581.28.7017.2335.2640.8717.212892.449272.07874OpenBenchmarking.org

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOPRun 1Run 2Run 3500K1000K1500K2000K2500KSE +/- 25730.97, N = 3SE +/- 14358.36, N = 5SE +/- 6791.19, N = 32200466.421361545.281354046.581. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM TriadRun 1Run 2Run 31.24752.4953.74254.996.2375SE +/- 0.07070, N = 3SE +/- 0.09991, N = 3SE +/- 0.01514, N = 35.346055.133925.544501. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: MediumRun 1Run 2Run 3246810SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 38.838.208.311. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ThoroughRun 1Run 2Run 31224364860SE +/- 0.19, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 355.3252.2252.171. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinRun 1Run 2Run 30.611.221.832.443.05SE +/- 0.027, N = 15SE +/- 0.040, N = 15SE +/- 0.005, N = 32.6032.7112.6431. (CXX) g++ options: -O3 -pthread -lm

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SETRun 1Run 2Run 3300K600K900K1200K1500KSE +/- 8648.91, N = 3SE +/- 16434.34, N = 3SE +/- 14536.11, N = 71511980.131574421.251558674.521. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring LatencyRun 1Run 2Run 30.1010.2020.3030.4040.505SE +/- 0.01254, N = 3SE +/- 0.00111, N = 3SE +/- 0.01324, N = 30.434120.448730.436351. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet18Run 1Run 2Run 31326395265SE +/- 0.53, N = 3SE +/- 0.38, N = 3SE +/- 0.93, N = 455.6556.4257.44MIN: 29.24 / MAX: 174.71MIN: 29.6 / MAX: 196.71MIN: 32.72 / MAX: 234.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GETRun 1Run 2Run 3400K800K1200K1600K2000KSE +/- 12019.14, N = 3SE +/- 19126.07, N = 3SE +/- 21525.04, N = 32022289.541965096.131960201.461. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPURun 1Run 2Run 33691215SE +/- 0.09931, N = 3SE +/- 0.03591, N = 3SE +/- 0.04430, N = 310.050029.8073210.09580MIN: 8.56MIN: 8.52MIN: 8.611. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPURun 1Run 2Run 33691215SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 313.0312.6612.71MIN: 6.76MIN: 6.05MIN: 5.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADDRun 1Run 2Run 3400K800K1200K1600K2000KSE +/- 8709.31, N = 3SE +/- 5919.78, N = 3SE +/- 19777.69, N = 41786977.081750815.541746188.191. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPURun 1Run 2Run 30.85811.71622.57433.43244.2905SE +/- 0.00333, N = 3SE +/- 0.00200, N = 3SE +/- 0.01990, N = 33.813783.773563.73038MIN: 3.12MIN: 3.18MIN: 3.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet50Run 1Run 2Run 34080120160200SE +/- 0.53, N = 3SE +/- 2.32, N = 3SE +/- 1.71, N = 4173.24169.88171.13MIN: 100.86 / MAX: 302.11MIN: 98.02 / MAX: 304.64MIN: 94.76 / MAX: 337.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400mRun 1Run 2Run 31326395265SE +/- 0.08, N = 3SE +/- 0.25, N = 3SE +/- 0.54, N = 458.0756.9757.70MIN: 51.31 / MAX: 195.89MIN: 51.81 / MAX: 108.77MIN: 51.85 / MAX: 196.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v2Run 1Run 2Run 3612182430SE +/- 0.47, N = 3SE +/- 0.64, N = 3SE +/- 0.60, N = 427.0427.2227.56MIN: 15.02 / MAX: 98.6MIN: 6.36 / MAX: 114.32MIN: 7.54 / MAX: 160.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random AccessRun 1Run 2Run 30.00550.0110.01650.0220.0275SE +/- 0.00028, N = 3SE +/- 0.00072, N = 3SE +/- 0.00023, N = 30.024100.023970.024411. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 0Run 1Run 2Run 33691215SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.11, N = 1510.8510.8711.051. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPURun 1Run 2Run 33691215SE +/- 0.06357, N = 3SE +/- 0.03003, N = 3SE +/- 0.04781, N = 39.328309.167459.21052MIN: 3.98MIN: 3.97MIN: 3.971. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b0Run 1Run 2Run 3816243240SE +/- 0.37, N = 3SE +/- 0.38, N = 3SE +/- 0.39, N = 435.7535.1635.71MIN: 19.97 / MAX: 132.42MIN: 21.2 / MAX: 117.05MIN: 21.19 / MAX: 116.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPURun 1Run 2Run 33691215SE +/- 0.03038, N = 3SE +/- 0.01988, N = 3SE +/- 0.03653, N = 39.479299.514039.63747MIN: 4.26MIN: 4.29MIN: 4.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KRun 1Run 2Run 3246810SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 38.648.608.741. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPURun 1Run 2Run 348121620SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 316.0015.8315.75MIN: 12.18MIN: 11.82MIN: 11.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondRun 1Run 2Run 340K80K120K160K200KSE +/- 1296.05, N = 3SE +/- 1995.67, N = 4SE +/- 441.31, N = 3192969.41192628.86195702.021. (CC) gcc options: -O2 -lrt" -lrt

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 DepthRun 1Run 2Run 33M6M9M12M15MSE +/- 154965.11, N = 6SE +/- 44605.31, N = 3SE +/- 137366.03, N = 7156836401573916015902806

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMMRun 1Run 2Run 3918273645SE +/- 0.60, N = 3SE +/- 0.40, N = 3SE +/- 0.49, N = 337.2337.3836.871. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pRun 1Run 2Run 3816243240SE +/- 0.11, N = 3SE +/- 0.28, N = 3SE +/- 0.11, N = 335.2234.7535.101. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 5Run 1Run 2Run 30.18650.3730.55950.7460.9325SE +/- 0.011, N = 3SE +/- 0.006, N = 3SE +/- 0.003, N = 30.8280.8180.829

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeRun 1Run 2Run 33691215SE +/- 0.024, N = 5SE +/- 0.038, N = 5SE +/- 0.058, N = 259.4629.5119.5851. (CXX) g++ options: -fvisibility=hidden -logg -lm

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling BenchmarkRun 1Run 2Run 3246810SE +/- 0.10, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 38.908.798.891. Nodejs v12.18.2

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian DragonRun 1Run 2Run 3246810SE +/- 0.0019, N = 3SE +/- 0.0742, N = 3SE +/- 0.0113, N = 37.52127.61377.5330MIN: 7.48 / MAX: 7.63MIN: 7.43 / MAX: 7.81MIN: 7.49 / MAX: 7.62

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNARun 1Run 2Run 33691215SE +/- 0.13, N = 3SE +/- 0.16, N = 3SE +/- 0.12, N = 311.4211.5511.471. (CC) gcc options: -std=c99 -O3 -lm -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To CompileRun 1Run 2Run 320406080100SE +/- 0.37, N = 3SE +/- 0.11, N = 3SE +/- 0.88, N = 3100.25100.01101.16

RNNoise

OpenBenchmarking.orgSeconds, Fewer Is BetterRNNoise 2020-06-28Run 1Run 2Run 3612182430SE +/- 0.21, N = 8SE +/- 0.10, N = 3SE +/- 0.28, N = 525.1524.9625.241. (CC) gcc options: -O2 -pedantic -fvisibility=hidden

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: ExhaustiveRun 1Run 2Run 390180270360450SE +/- 4.31, N = 6SE +/- 0.16, N = 3SE +/- 0.03, N = 3427.41422.82422.911. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 1Run 1Run 2Run 30.0630.1260.1890.2520.315SE +/- 0.004, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 30.2770.2800.278

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v3Run 1Run 2Run 348121620SE +/- 0.10, N = 3SE +/- 0.20, N = 3SE +/- 0.10, N = 417.2217.0617.04MIN: 9.81 / MAX: 65.85MIN: 9.88 / MAX: 85.95MIN: 9.51 / MAX: 72.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPURun 1Run 2Run 33691215SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 312.7612.6712.62MIN: 8.27MIN: 8.03MIN: 7.631. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

DeepSpeech

Acceleration: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterDeepSpeech 0.6Acceleration: CPURun 1Run 2Run 320406080100SE +/- 0.34, N = 3SE +/- 0.33, N = 3SE +/- 0.23, N = 3109.38109.66110.49

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian DragonRun 1Run 2Run 33691215SE +/- 0.0541, N = 3SE +/- 0.0329, N = 3SE +/- 0.0422, N = 39.20129.11279.1881MIN: 9.04 / MAX: 9.42MIN: 9.01 / MAX: 9.31MIN: 9.09 / MAX: 9.41

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-ProcessingRun 1Run 2Run 32004006008001000SE +/- 4.07, N = 3SE +/- 1.57, N = 3SE +/- 11.07, N = 3957.17966.40962.781. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong BandwidthRun 1Run 2Run 33K6K9K12K15KSE +/- 41.78, N = 3SE +/- 19.51, N = 3SE +/- 46.85, N = 312775.5512753.0612872.701. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: SupercarRun 1Run 2Run 30.51191.02381.53572.04762.5595SE +/- 0.002, N = 3SE +/- 0.010, N = 3SE +/- 0.000, N = 32.2752.2542.274

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSHRun 1Run 2Run 3300K600K900K1200K1500KSE +/- 9622.95, N = 11SE +/- 15165.81, N = 3SE +/- 13263.10, N = 31314301.941311290.871323367.411. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total TimeRun 1Run 2Run 32M4M6M8M10MSE +/- 125432.49, N = 3SE +/- 55842.23, N = 3SE +/- 41101.79, N = 31041422910336902104304271. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPURun 1Run 2Run 32K4K6K8K10KSE +/- 10.72, N = 3SE +/- 26.91, N = 3SE +/- 17.48, N = 311020.011062.611119.0MIN: 10766MIN: 10773.5MIN: 10848.61. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenetRun 1Run 2Run 320406080100SE +/- 0.43, N = 3SE +/- 0.53, N = 3SE +/- 0.88, N = 475.0774.5375.19MIN: 52.17 / MAX: 259.67MIN: 53.79 / MAX: 180.72MIN: 52.93 / MAX: 204.621. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPURun 1Run 2Run 3612182430SE +/- 0.09, N = 3SE +/- 0.01, N = 3SE +/- 0.18, N = 325.9325.7125.78MIN: 9.72MIN: 9.67MIN: 9.781. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1SRun 1Run 2Run 31530456075SE +/- 0.29, N = 3SE +/- 0.38, N = 3SE +/- 0.50, N = 368.9368.6869.241. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: CrownRun 1Run 2Run 3246810SE +/- 0.0045, N = 3SE +/- 0.0251, N = 3SE +/- 0.0111, N = 37.36647.31467.3115MIN: 7.29 / MAX: 7.5MIN: 7.22 / MAX: 7.48MIN: 7.24 / MAX: 7.45

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tinyRun 1Run 2Run 320406080100SE +/- 0.35, N = 3SE +/- 0.30, N = 3SE +/- 0.35, N = 479.5179.1978.92MIN: 57.15 / MAX: 128.6MIN: 59.35 / MAX: 182.68MIN: 59.82 / MAX: 157.441. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 10Run 1Run 2Run 30.53351.0671.60052.1342.6675SE +/- 0.013, N = 3SE +/- 0.025, N = 3SE +/- 0.015, N = 32.3712.3692.354

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPackRun 1Run 248121620SE +/- 0.13, N = 5SE +/- 0.00, N = 516.2716.161. (CXX) g++ options: -rdynamic

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water BenchmarkRun 1Run 2Run 30.12780.25560.38340.51120.639SE +/- 0.005, N = 3SE +/- 0.006, N = 3SE +/- 0.004, N = 30.5680.5680.5641. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: CrownRun 1Run 2Run 3246810SE +/- 0.0067, N = 3SE +/- 0.0076, N = 3SE +/- 0.0142, N = 36.41066.43406.4533MIN: 6.35 / MAX: 6.51MIN: 6.37 / MAX: 6.53MIN: 6.39 / MAX: 6.57

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnetRun 1Run 2Run 3816243240SE +/- 0.39, N = 3SE +/- 0.17, N = 3SE +/- 0.33, N = 435.4035.2035.43MIN: 24.51 / MAX: 94.32MIN: 25.77 / MAX: 83.7MIN: 24.78 / MAX: 113.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 6Run 1Run 2Run 30.24570.49140.73710.98281.2285SE +/- 0.002, N = 3SE +/- 0.010, N = 3SE +/- 0.007, N = 31.0911.0851.092

yquake2

Renderer: Software CPU - Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is Betteryquake2 7.45Renderer: Software CPU - Resolution: 1920 x 1080Run 1Run 2Run 320406080100SE +/- 0.78, N = 3SE +/- 0.93, N = 3SE +/- 0.26, N = 381.781.281.31. (CC) gcc options: -lm -ldl -rdynamic -shared -lSDL2 -O2 -pipe -fomit-frame-pointer -std=gnu99 -fno-strict-aliasing -fwrapv -fvisibility=hidden -MMD -mfpmath=sse -fPIC

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: BedroomRun 1Run 2Run 30.22120.44240.66360.88481.106SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 30.9790.9770.983

AI Benchmark Alpha

Device Inference Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference ScoreRun 1Run 22004006008001000815820

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.4Time To CompileRun 1Run 2Run 34080120160200SE +/- 1.34, N = 3SE +/- 1.17, N = 3SE +/- 0.84, N = 3158.69159.02158.06

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To CompileRun 1Run 2Run 320406080100SE +/- 0.43, N = 3SE +/- 0.16, N = 3SE +/- 0.48, N = 399.3098.9799.54

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenetRun 1Run 2Run 31632486480SE +/- 0.42, N = 3SE +/- 0.96, N = 3SE +/- 0.44, N = 469.7770.1769.99MIN: 40.57 / MAX: 187.95MIN: 40.33 / MAX: 223.67MIN: 40.48 / MAX: 183.361. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssdRun 1Run 2Run 31326395265SE +/- 0.45, N = 3SE +/- 0.40, N = 3SE +/- 0.35, N = 459.7160.0559.78MIN: 43.82 / MAX: 136.53MIN: 43.01 / MAX: 117.51MIN: 44.03 / MAX: 108.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPLRun 1Run 2Run 320406080100SE +/- 0.94, N = 9SE +/- 1.20, N = 3SE +/- 0.89, N = 986.1986.5186.661. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPURun 1Run 2Run 316003200480064008000SE +/- 7.28, N = 3SE +/- 42.16, N = 3SE +/- 14.92, N = 37391.527430.607414.46MIN: 7132.96MIN: 7131.37MIN: 7129.341. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database SearchRun 1Run 2Run 3306090120150SE +/- 0.62, N = 3SE +/- 0.21, N = 3SE +/- 0.12, N = 3131.75131.06131.351. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-PtransRun 1Run 2Run 30.56541.13081.69622.26162.827SE +/- 0.00379, N = 3SE +/- 0.00766, N = 3SE +/- 0.00646, N = 32.502762.512832.500701. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPURun 1Run 2Run 32K4K6K8K10KSE +/- 10.33, N = 3SE +/- 4.29, N = 3SE +/- 7.07, N = 311058.911071.211111.7MIN: 10774.1MIN: 10804.1MIN: 10816.91. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPURun 1Run 2Run 316003200480064008000SE +/- 20.69, N = 3SE +/- 11.50, N = 3SE +/- 6.04, N = 37394.077385.867418.02MIN: 7102.56MIN: 7090.33MIN: 7150.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression SpeedRun 1Run 2Run 312002400360048006000SE +/- 2.58, N = 3SE +/- 1.99, N = 3SE +/- 0.16, N = 35400.285389.985377.611. (CC) gcc options: -O3

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteRun 1Run 2130K260K390K520K650KSE +/- 357.11, N = 3SE +/- 1039.69, N = 3622110619528

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPURun 1Run 2Run 3510152025SE +/- 0.16, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 319.0919.0619.14MIN: 8.36MIN: 8.04MIN: 8.131. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080pRun 1Run 2Run 370140210280350SE +/- 0.61, N = 3SE +/- 0.18, N = 3SE +/- 0.82, N = 3322.35323.32323.64MIN: 273.49 / MAX: 351.84MIN: 283.06 / MAX: 351.54MIN: 272 / MAX: 354.891. (CC) gcc options: -pthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To CompileRun 1Run 2Run 360120180240300SE +/- 0.63, N = 3SE +/- 0.57, N = 3SE +/- 0.98, N = 3264.65265.50265.69

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPURun 1Run 2Run 348121620SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 317.1717.2117.24MIN: 12.71MIN: 13.27MIN: 12.661. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4KRun 1Run 2Run 320406080100SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.15, N = 3102.45102.74102.34MIN: 96.59 / MAX: 115.32MIN: 96.51 / MAX: 116.33MIN: 96.39 / MAX: 115.071. (CC) gcc options: -pthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: SlowRun 1Run 2Run 33691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 311.0011.0411.011. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: MediumRun 1Run 2Run 30.6211.2421.8632.4843.105SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.752.762.751. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

AI Benchmark Alpha

Device AI Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreRun 1Run 240080012001600200016961702

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPURun 1Run 2Run 348121620SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 316.9517.0016.94MIN: 14.04MIN: 13.62MIN: 13.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video EncodingRun 1Run 2Run 31224364860SE +/- 0.41, N = 9SE +/- 0.43, N = 8SE +/- 0.42, N = 952.0551.9752.151. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra FastRun 1Run 2Run 31224364860SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 352.7452.9252.781. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Zstd Compression

Compression Level: 19

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 19Run 1Run 2Run 3714212835SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 331.331.231.21. (CC) gcc options: -O3 -pthread -lz -llzma

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed TimeRun 1Run 2Run 31.5M3M4.5M6M7.5MSE +/- 16316.81, N = 3SE +/- 5337.82, N = 3SE +/- 13562.60, N = 37034860704721470249691. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very FastRun 1Run 2Run 3246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 37.717.737.711. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080pRun 1Run 2Run 380160240320400SE +/- 0.56, N = 3SE +/- 1.27, N = 3SE +/- 1.19, N = 3361.59362.52361.92MIN: 267 / MAX: 561.79MIN: 267.49 / MAX: 564.49MIN: 266.52 / MAX: 567.251. (CC) gcc options: -pthread

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance MetricRun 1Run 215K30K45K60K75K70291701161. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xzRun 1Run 2612182430SE +/- 0.22, N = 20SE +/- 0.20, N = 2023.6623.60

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bitRun 1Run 2Run 31530456075SE +/- 0.07, N = 3SE +/- 0.13, N = 3SE +/- 0.01, N = 367.7667.7167.60MIN: 44.17 / MAX: 170.52MIN: 44.16 / MAX: 166.83MIN: 44.14 / MAX: 165.371. (CC) gcc options: -pthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPURun 1Run 2Run 316003200480064008000SE +/- 12.71, N = 3SE +/- 0.26, N = 3SE +/- 4.87, N = 37400.877404.317415.85MIN: 7138.98MIN: 7147.81MIN: 7149.41. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3Run 1Run 2Run 3306090120150SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3113.01112.93113.141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedRun 1Run 2Run 313002600390052006500SE +/- 2.81, N = 3SE +/- 4.84, N = 3SE +/- 3.26, N = 36298.26286.96295.41. (CC) gcc options: -O3

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: MediumRun 1Run 2Run 33691215SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 311.3311.3511.331. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Zstd Compression

Compression Level: 3

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 3Run 1Run 2Run 36001200180024003000SE +/- 3.47, N = 3SE +/- 4.29, N = 3SE +/- 7.85, N = 32971.62966.52969.01. (CC) gcc options: -O3 -pthread -lz -llzma

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,000Run 1Run 2Run 320406080100SE +/- 0.26, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 379.7679.6479.631. (CC) gcc options: -O2 -ldl -lz -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg16Run 1Run 2Run 3306090120150SE +/- 0.58, N = 3SE +/- 0.47, N = 3SE +/- 0.72, N = 4131.95131.74131.81MIN: 105.72 / MAX: 192.42MIN: 104.27 / MAX: 191.62MIN: 98.07 / MAX: 183.141. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

YafaRay

Total Time For Sample Scene

OpenBenchmarking.orgSeconds, Fewer Is BetterYafaRay 3.4.1Total Time For Sample SceneRun 1Run 2Run 360120180240300SE +/- 0.45, N = 3SE +/- 0.37, N = 3SE +/- 0.28, N = 3272.34271.98272.401. (CXX) g++ options: -std=c++11 -O3 -ffast-math -rdynamic -ldl -lImath -lIlmImf -lIex -lHalf -lz -lIlmThread -lxml2 -lfreetype -lpthread

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APERun 1Run 2Run 348121620SE +/- 0.07, N = 5SE +/- 0.06, N = 24SE +/- 0.08, N = 514.4014.3814.381. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra FastRun 1Run 2Run 348121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 314.0414.0414.021. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression SpeedRun 1Run 2Run 314002800420056007000SE +/- 1.39, N = 3SE +/- 3.86, N = 3SE +/- 0.96, N = 36392.06384.46385.21. (CC) gcc options: -O3

AI Benchmark Alpha

Device Training Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training ScoreRun 1Run 22004006008001000881882

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedRun 1Run 2Run 313002600390052006500SE +/- 1.24, N = 3SE +/- 2.97, N = 3SE +/- 3.87, N = 36291.66285.06291.51. (CC) gcc options: -O3

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very FastRun 1Run 2Run 3714212835SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.09, N = 329.3829.3529.361. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPURun 1Run 2Run 32K4K6K8K10KSE +/- 8.16, N = 3SE +/- 10.87, N = 3SE +/- 11.56, N = 311071.411074.211063.8MIN: 10803.3MIN: 10796.5MIN: 10764.61. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2Run 1Run 2Run 31326395265SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 358.9858.9558.991. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedRun 1Run 2Run 31020304050SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 343.2143.2143.241. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedRun 1Run 2Run 31020304050SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 342.2942.3042.281. (CC) gcc options: -O3

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: FastRun 1Run 2Run 3246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 37.547.547.541. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: SlowRun 1Run 2Run 30.60531.21061.81592.42123.0265SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.692.692.691. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserIDRun 1Run 2Run 30.15980.31960.47940.63920.799SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.710.710.711. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweetsRun 1Run 2Run 30.15530.31060.46590.62120.7765SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.690.690.691. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandomRun 1Run 2Run 30.08780.17560.26340.35120.439SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.390.390.391. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: KostyaRun 1Run 2Run 30.13050.2610.39150.5220.6525SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.580.580.581. (CXX) g++ options: -O3 -pthread

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP SpeedupRun 1Run 2Run 30.270.540.811.081.35SE +/- 0.01, N = 15SE +/- 0.00, N = 3SE +/- 0.01, N = 151.21.21.21. (CC) gcc options: -fopenmp -O3 -lm

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazefaceRun 1Run 2Run 33691215SE +/- 0.48, N = 3SE +/- 0.72, N = 3SE +/- 0.31, N = 49.119.018.70MIN: 2.68 / MAX: 132.97MIN: 2.68 / MAX: 92.41MIN: 2.68 / MAX: 149.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnetRun 1Run 2Run 348121620SE +/- 0.07, N = 3SE +/- 0.75, N = 3SE +/- 0.24, N = 417.0318.1417.23MIN: 13.56 / MAX: 48.96MIN: 5.59 / MAX: 157.6MIN: 13.21 / MAX: 64.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v2Run 1Run 2Run 3816243240SE +/- 0.29, N = 3SE +/- 1.24, N = 3SE +/- 0.25, N = 435.1235.7935.26MIN: 20.41 / MAX: 103.78MIN: 20.38 / MAX: 148.98MIN: 17.61 / MAX: 123.061. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech SynthesisRun 1Run 2Run 3918273645SE +/- 0.53, N = 20SE +/- 0.92, N = 20SE +/- 0.84, N = 2039.6740.5740.871. (CC) gcc options: -O2 -std=c99

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPURun 1Run 2Run 3246810SE +/- 0.48864, N = 15SE +/- 0.02084, N = 3SE +/- 0.02417, N = 38.209947.182957.21289MIN: 3.29MIN: 3.37MIN: 3.381. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring BandwidthRun 1Run 2Run 30.5651.131.6952.262.825SE +/- 0.09513, N = 3SE +/- 0.01108, N = 3SE +/- 0.10589, N = 32.511292.360572.449271. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-FfteRun 1Run 2Run 30.46920.93841.40761.87682.346SE +/- 0.13748, N = 3SE +/- 0.05507, N = 3SE +/- 0.05394, N = 32.073872.085542.078741. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3


Phoronix Test Suite v10.8.4