1260L v5 Skylake December

Intel Xeon E3-1260L v5 testing with a ASRock E3V5 WS (P7.10 BIOS) and llvmpipe on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012197-HA-1260LV5SK58&sro&grr.

1260L v5 Skylake DecemberProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution123Intel Xeon E3-1260L v5 @ 3.90GHz (4 Cores / 8 Threads)ASRock E3V5 WS (P7.10 BIOS)Intel Xeon E3-1200 v5/E3-15008GB120GB INTEL SSDSC2BW12llvmpipeRealtek ALC892Intel I219-LMUbuntu 20.105.8.0-20-generic (x86_64)GNOME Shell 3.38.0X Server 1.20.8modesetting 1.20.83.3 Mesa 20.1.8 (LLVM 10.0.1 256 bits)GCC 10.2.0ext41024x768GNOME Shell 3.38.14.5 Mesa 20.2.1 (LLVM 11.0.0 256 bits)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xdc - Thermald 2.3Java Details- 1: OpenJDK Runtime Environment (build 11.0.8+10-post-Ubuntu-0ubuntu1)- 2: OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)- 3: OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

1260L v5 Skylake Decemberlammps: 20k Atomshpcc: G-HPLbasis: UASTC Level 2 + RDO Post-Processingastcenc: Exhaustivebuild2: Time To Compilegromacs: Water Benchmarkkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumbrl-cad: VGR Performance Metricnumpy: asmfish: 1024 Hash Memory, 26 Depthbasis: UASTC Level 3build-ffmpeg: Time To Compilehmmer: Pfam Database Searchkvazaar: Bosphorus 4K - Very Fastonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUstockfish: Total Timex265: Bosphorus 4Konednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUastcenc: Thoroughnode-web-tooling: basis: UASTC Level 2basis: ETC1Ssimdjson: Kostyasqlite-speedtest: Timed Time - Size 1,000kvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumcompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedkvazaar: Bosphorus 4K - Ultra Fastrav1e: 5indigobench: CPU - Bedroomindigobench: CPU - Supercarrav1e: 1simdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDrav1e: 6espeak: Text-To-Speech Synthesisrav1e: 10compress-lz4: 1 - Decompression Speedcompress-lz4: 1 - Compression Speedphpbench: PHP Benchmark Suitekvazaar: Bosphorus 1080p - Very Fastsunflow: Global Illumination + Image Synthesisredis: SETredis: GETcoremark: CoreMark Size 666 - Iterations Per Secondcrafty: Elapsed Timeredis: LPOPonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUx265: Bosphorus 1080pastcenc: Mediumkvazaar: Bosphorus 1080p - Ultra Fastonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUbasis: UASTC Level 0redis: LPUSHredis: SADDonednn: IP Shapes 3D - u8s8f32 - CPUastcenc: Fastonednn: IP Shapes 3D - f32 - CPUlammps: Rhodopsin Proteinonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUhpcc: Max Ping Pong Bandwidthhpcc: Rand Ring Bandwidthhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: EP-STREAM Triadhpcc: G-Ptranshpcc: EP-DGEMMhpcc: G-Ffte1232.62261.31747904.511641.49367.8440.4491.811.8542083314.6411038178158.285136.907123.9255.138905.028915.608905.5569879746.374627.434630.544632.1580.269.9680.06077.3180.4875.2927.978.197862.944.687858.945.739.410.9450.6591.4850.3310.370.60.621.26931.7532.6717828.56684.4565671420.672.7631703146.702301015.75138255.21232973859712395940.2412.570512.636228.9912.3137.599.457574.303435.426977.4175811.5941510008.081951090.962.989109.4711.66432.62321.479622.76488.2720516.562111177.5872.618940.263500.023954.847130.8018932.131302.772522.62158.31242904.494641.37362.1230.4481.811.8541832314.3210934873158.303136.969123.8905.148951.158951.958959.9471384406.374651.694658.804652.8680.1710.0380.05377.6420.4875.3557.998.227849.944.697838.245.649.360.9570.6571.4960.3300.370.60.621.25132.4462.6107882.96669.4065545420.662.7811714807.712132136.12137667.82306173906021524653.4612.490112.421928.6812.3037.719.337804.314035.472757.3978111.6211527380.461959806.673.010239.5212.25862.61421.271522.42998.2195416.526110433.2592.499890.343750.022304.720800.5252727.988971.974852.62060.70520903.735641.22368.0920.4501.811.8541850312.0611016030158.278136.816123.8795.148949.818947.598944.0569469166.384650.304652.314657.3080.2110.0580.14077.9070.4876.2458.008.227859.644.397841.145.739.410.9500.6581.4900.3320.370.60.621.26431.4872.5687879.26661.7965546720.732.7741704781.922125085.74137386.87801474044001516168.5012.670012.441028.8012.3037.969.400664.289455.428167.4172711.6221518627.291951427.422.965519.5811.86652.61221.318622.45508.2294316.486911232.4662.596180.261530.023584.857640.8033432.569472.76829OpenBenchmarking.org

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms1230.591.181.772.362.95SE +/- 0.011, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 32.6222.6212.6201. (CXX) g++ options: -O3 -pthread -lm

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL1231428425670SE +/- 0.47, N = 3SE +/- 1.67, N = 9SE +/- 0.83, N = 461.3258.3160.711. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1232004006008001000SE +/- 0.14, N = 3SE +/- 0.24, N = 3SE +/- 0.19, N = 3904.51904.49903.741. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive123140280420560700SE +/- 0.12, N = 3SE +/- 0.21, N = 3SE +/- 0.03, N = 3641.49641.37641.221. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12380160240320400SE +/- 3.60, N = 9SE +/- 0.88, N = 3SE +/- 4.62, N = 3367.84362.12368.09

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.10130.20260.30390.40520.5065SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.000, N = 30.4490.4480.4501. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow1230.40730.81461.22191.62922.0365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.811.811.811. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1230.41630.83261.24891.66522.0815SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.851.851.851. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1239K18K27K36K45K4208341832418501. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12370140210280350SE +/- 0.32, N = 3SE +/- 0.53, N = 3SE +/- 0.27, N = 3314.64314.32312.06

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1232M4M6M8M10MSE +/- 112387.29, N = 3SE +/- 122181.06, N = 3SE +/- 60107.63, N = 3110381781093487311016030

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3123306090120150SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3158.29158.30158.281. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile123306090120150SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3136.91136.97136.82

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search123306090120150SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3123.93123.89123.881. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast1231.15652.3133.46954.6265.7825SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.135.145.141. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1232K4K6K8K10KSE +/- 1.58, N = 3SE +/- 3.04, N = 3SE +/- 24.18, N = 38905.028951.158949.81MIN: 8882.16MIN: 8921.6MIN: 8898.571. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1232K4K6K8K10KSE +/- 3.12, N = 3SE +/- 5.20, N = 3SE +/- 5.53, N = 38915.608951.958947.59MIN: 8894.86MIN: 8924.86MIN: 8923.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1232K4K6K8K10KSE +/- 5.25, N = 3SE +/- 3.26, N = 3SE +/- 6.92, N = 38905.558959.948944.05MIN: 8887.6MIN: 8942.77MIN: 8922.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1231.5M3M4.5M6M7.5MSE +/- 68860.30, N = 3SE +/- 38176.62, N = 3SE +/- 31151.41, N = 36987974713844069469161. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 36.376.376.381. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU12310002000300040005000SE +/- 7.67, N = 3SE +/- 1.59, N = 3SE +/- 1.90, N = 34627.434651.694650.30MIN: 4596.02MIN: 4632.21MIN: 4632.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU12310002000300040005000SE +/- 1.40, N = 3SE +/- 1.79, N = 3SE +/- 5.10, N = 34630.544658.804652.31MIN: 4614MIN: 4642.81MIN: 4626.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12310002000300040005000SE +/- 7.69, N = 3SE +/- 3.42, N = 3SE +/- 2.70, N = 34632.154652.864657.30MIN: 4606.26MIN: 4635.53MIN: 4635.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough12320406080100SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 380.2680.1780.211. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1233691215SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 39.9610.0310.051. Nodejs v12.18.2

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 212320406080100SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 380.0680.0580.141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S12320406080100SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 377.3277.6477.911. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.480.480.481. (CXX) g++ options: -O3 -pthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00012320406080100SE +/- 0.53, N = 3SE +/- 0.24, N = 3SE +/- 0.14, N = 375.2975.3676.251. (CC) gcc options: -O2 -ldl -lz -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow123246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 37.977.998.001. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 38.198.228.221. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1232K4K6K8K10KSE +/- 2.95, N = 3SE +/- 1.88, N = 3SE +/- 5.68, N = 37862.97849.97859.61. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1231020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 344.6844.6944.391. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1232K4K6K8K10KSE +/- 2.36, N = 3SE +/- 3.51, N = 3SE +/- 4.55, N = 37858.97838.27841.11. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1231020304050SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 345.7345.6445.731. (CC) gcc options: -O3

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast1233691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.419.369.411. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 51230.21530.43060.64590.86121.0765SE +/- 0.007, N = 3SE +/- 0.005, N = 3SE +/- 0.007, N = 30.9450.9570.950

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1230.14830.29660.44490.59320.7415SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.6590.6570.658

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1230.33660.67321.00981.34641.683SE +/- 0.008, N = 3SE +/- 0.003, N = 3SE +/- 0.005, N = 31.4851.4961.490

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 11230.07470.14940.22410.29880.3735SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.3310.3300.332

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.370.371. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.60.60.61. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.13950.2790.41850.5580.6975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.620.620.621. (CXX) g++ options: -O3 -pthread

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 61230.28550.5710.85651.1421.4275SE +/- 0.006, N = 3SE +/- 0.015, N = 3SE +/- 0.004, N = 31.2691.2511.264

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis123816243240SE +/- 0.36, N = 4SE +/- 0.40, N = 5SE +/- 0.17, N = 431.7532.4531.491. (CC) gcc options: -O2 -std=c99

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 101230.6011.2021.8032.4043.005SE +/- 0.004, N = 3SE +/- 0.019, N = 3SE +/- 0.029, N = 32.6712.6102.568

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1232K4K6K8K10KSE +/- 35.72, N = 3SE +/- 5.94, N = 3SE +/- 1.63, N = 37828.57882.97879.21. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed12314002800420056007000SE +/- 15.48, N = 3SE +/- 3.14, N = 3SE +/- 5.22, N = 36684.456669.406661.791. (CC) gcc options: -O3

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite123140K280K420K560K700KSE +/- 858.56, N = 3SE +/- 578.17, N = 3SE +/- 915.50, N = 3656714655454655467

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast123510152025SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 320.6720.6620.731. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Sunflow Rendering System

Global Illumination + Image Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1230.62571.25141.87712.50283.1285SE +/- 0.046, N = 3SE +/- 0.032, N = 3SE +/- 0.002, N = 32.7632.7812.774MIN: 2.57 / MAX: 3.48MIN: 2.61 / MAX: 3.45MIN: 2.61 / MAX: 3.64

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET123400K800K1200K1600K2000KSE +/- 30192.95, N = 12SE +/- 17021.33, N = 3SE +/- 17337.60, N = 81703146.701714807.711704781.921. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123500K1000K1500K2000K2500KSE +/- 6176.61, N = 3SE +/- 27951.91, N = 5SE +/- 32022.09, N = 152301015.752132136.122125085.741. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12330K60K90K120K150KSE +/- 994.84, N = 3SE +/- 1278.58, N = 3SE +/- 423.97, N = 3138255.21137667.82137386.881. (CC) gcc options: -O2 -lrt" -lrt

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1231.6M3.2M4.8M6.4M8MSE +/- 18336.90, N = 3SE +/- 21649.68, N = 3SE +/- 9014.23, N = 37385971739060274044001. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123500K1000K1500K2000K2500KSE +/- 54405.26, N = 12SE +/- 10692.18, N = 3SE +/- 5643.18, N = 32395940.241524653.461516168.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1233691215SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 312.5712.4912.67MIN: 11.06MIN: 11.04MIN: 11.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1233691215SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.6412.4212.44MIN: 11.41MIN: 11.31MIN: 11.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p123714212835SE +/- 0.02, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 328.9928.6828.801. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1233691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 312.3112.3012.301. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast123918273645SE +/- 0.42, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 337.5937.7137.961. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.01261, N = 3SE +/- 0.01956, N = 3SE +/- 0.01972, N = 39.457579.337809.40066MIN: 8.28MIN: 8.31MIN: 8.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.97071.94142.91213.88284.8535SE +/- 0.00682, N = 3SE +/- 0.02231, N = 3SE +/- 0.00714, N = 34.303434.314034.28945MIN: 3.79MIN: 3.81MIN: 3.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1231.23142.46283.69424.92566.157SE +/- 0.00422, N = 3SE +/- 0.00939, N = 3SE +/- 0.01688, N = 35.426975.472755.42816MIN: 5.34MIN: 5.37MIN: 5.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.00986, N = 3SE +/- 0.01057, N = 3SE +/- 0.01212, N = 37.417587.397817.41727MIN: 6.58MIN: 6.58MIN: 6.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01233691215SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 311.5911.6211.621. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH123300K600K900K1200K1500KSE +/- 21655.16, N = 3SE +/- 8440.76, N = 3SE +/- 3955.37, N = 31510008.081527380.461518627.291. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD123400K800K1200K1600K2000KSE +/- 19659.30, N = 3SE +/- 5504.90, N = 3SE +/- 19831.43, N = 31951090.961959806.671951427.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.67731.35462.03192.70923.3865SE +/- 0.01485, N = 3SE +/- 0.00799, N = 3SE +/- 0.00894, N = 32.989103.010232.96551MIN: 2.87MIN: 2.92MIN: 2.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1233691215SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 39.479.529.581. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 311.6612.2611.87MIN: 11.49MIN: 12.09MIN: 11.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1230.59021.18041.77062.36082.951SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.010, N = 32.6232.6142.6121. (CXX) g++ options: -O3 -pthread -lm

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123510152025SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 321.4821.2721.32MIN: 21.27MIN: 20.98MIN: 21.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 322.7622.4322.46MIN: 22.59MIN: 22.34MIN: 22.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.01822, N = 3SE +/- 0.01580, N = 3SE +/- 0.00792, N = 38.272058.219548.22943MIN: 7.77MIN: 7.73MIN: 7.731. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU12348121620SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 316.5616.5316.49MIN: 15.42MIN: 15.31MIN: 15.321. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth1232K4K6K8K10KSE +/- 141.61, N = 3SE +/- 244.43, N = 3SE +/- 130.50, N = 311177.5910433.2611232.471. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth1230.58931.17861.76792.35722.9465SE +/- 0.02632, N = 3SE +/- 0.04035, N = 3SE +/- 0.00727, N = 32.618942.499892.596181. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency1230.07730.15460.23190.30920.3865SE +/- 0.00081, N = 3SE +/- 0.04057, N = 3SE +/- 0.00123, N = 30.263500.343750.261531. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access1230.00540.01080.01620.02160.027SE +/- 0.00038, N = 3SE +/- 0.00091, N = 3SE +/- 0.00040, N = 30.023950.022300.023581. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad1231.0932.1863.2794.3725.465SE +/- 0.00206, N = 3SE +/- 0.10428, N = 3SE +/- 0.00099, N = 34.847134.720804.857641. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans1230.18080.36160.54240.72320.904SE +/- 0.00067, N = 3SE +/- 0.00058, N = 3SE +/- 0.00136, N = 30.801890.525270.803341. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM123816243240SE +/- 0.09, N = 3SE +/- 0.55, N = 3SE +/- 0.06, N = 332.1327.9932.571. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte1230.62381.24761.87142.49523.119SE +/- 0.00451, N = 3SE +/- 0.03153, N = 3SE +/- 0.00546, N = 32.772521.974852.768291. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3


Phoronix Test Suite v10.8.4