1260L v5 Skylake December

Intel Xeon E3-1260L v5 testing with a ASRock E3V5 WS (P7.10 BIOS) and llvmpipe on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012197-HA-1260LV5SK58&sro&grs.

1260L v5 Skylake DecemberProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution123Intel Xeon E3-1260L v5 @ 3.90GHz (4 Cores / 8 Threads)ASRock E3V5 WS (P7.10 BIOS)Intel Xeon E3-1200 v5/E3-15008GB120GB INTEL SSDSC2BW12llvmpipeRealtek ALC892Intel I219-LMUbuntu 20.105.8.0-20-generic (x86_64)GNOME Shell 3.38.0X Server 1.20.8modesetting 1.20.83.3 Mesa 20.1.8 (LLVM 10.0.1 256 bits)GCC 10.2.0ext41024x768GNOME Shell 3.38.14.5 Mesa 20.2.1 (LLVM 11.0.0 256 bits)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xdc - Thermald 2.3Java Details- 1: OpenJDK Runtime Environment (build 11.0.8+10-post-Ubuntu-0ubuntu1)- 2: OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)- 3: OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

1260L v5 Skylake Decemberhpcc: G-Ptranshpcc: G-Fftehpcc: EP-DGEMMredis: GEThpcc: Max Ping Pong Bandwidthonednn: IP Shapes 3D - f32 - CPUhpcc: Rand Ring Bandwidthrav1e: 10espeak: Text-To-Speech Synthesishpcc: EP-STREAM Triadstockfish: Total Timeonednn: Deconvolution Batch shapes_1d - f32 - CPUbuild2: Time To Compileonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUrav1e: 6onednn: IP Shapes 1D - f32 - CPUrav1e: 5sqlite-speedtest: Timed Time - Size 1,000astcenc: Fastredis: LPUSHx265: Bosphorus 1080pkvazaar: Bosphorus 1080p - Ultra Fastonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUasmfish: 1024 Hash Memory, 26 Depthnode-web-tooling: onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUnumpy: basis: ETC1Sindigobench: CPU - Supercarcompress-lz4: 1 - Decompression Speedcompress-lz4: 9 - Compression Speedsunflow: Global Illumination + Image Synthesisonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUcoremark: CoreMark Size 666 - Iterations Per Secondonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUrav1e: 1brl-cad: VGR Performance Metriconednn: IP Shapes 1D - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUkvazaar: Bosphorus 4K - Ultra Fastonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUredis: SADDgromacs: Water Benchmarklammps: Rhodopsin Proteinonednn: Recurrent Neural Network Training - u8s8f32 - CPUkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumcompress-lz4: 1 - Compression Speedkvazaar: Bosphorus 1080p - Very Fastindigobench: CPU - Bedroomonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUcompress-lz4: 3 - Decompression Speedcrafty: Elapsed Timebasis: UASTC Level 0compress-lz4: 3 - Compression Speedkvazaar: Bosphorus 4K - Very Fastphpbench: PHP Benchmark Suitecompress-lz4: 9 - Decompression Speedx265: Bosphorus 4Kastcenc: Thoroughbuild-ffmpeg: Time To Compilebasis: UASTC Level 2basis: UASTC Level 2 + RDO Post-Processingastcenc: Mediumlammps: 20k Atomsastcenc: Exhaustivehmmer: Pfam Database Searchbasis: UASTC Level 3kvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Slowsimdjson: DistinctUserIDsimdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyaredis: SETredis: LPOPhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: G-HPL1230.801892.7725232.131302301015.7511177.58711.66432.618942.67131.7534.84713698797412.6362367.8442.9891022.764812.57051.2699.457570.94575.2929.471510008.0828.9937.5921.4796110381789.965.42697314.6477.3181.4857828.544.682.7638.27205138255.2123298905.554630.540.331420834.303434632.159.414627.438905.0216.56211951090.960.4492.6238915.607.978.196684.4520.670.6597.417587858.9738597111.59445.735.136567147862.96.3780.26136.90780.060904.51112.312.622641.49123.925158.2851.851.810.620.60.370.481703146.702395940.240.263500.0239561.317470.525271.9748527.988972132136.1210433.25912.25862.499892.61032.4464.72080713844012.4219362.1233.0102322.429912.49011.2519.337800.95775.3559.521527380.4628.6837.7121.27151093487310.035.47275314.3277.6421.4967882.944.692.7818.21954137667.8230618959.944658.800.330418324.314034652.869.364651.698951.1516.52611959806.670.4482.6148951.957.998.226669.4020.660.6577.397817838.2739060211.62145.645.146554547849.96.3780.17136.96980.053904.49412.302.621641.37123.890158.3031.851.810.620.60.370.481714807.711524653.460.343750.0223058.312420.803342.7682932.569472125085.7411232.46611.86652.596182.56831.4874.85764694691612.4410368.0922.9655122.455012.67001.2649.400660.95076.2459.581518627.2928.8037.9621.31861101603010.055.42816312.0677.9071.4907879.244.392.7748.22943137386.8780148944.054652.310.332418504.289454657.309.414650.308949.8116.48691951427.420.4502.6128947.598.008.226661.7920.730.6587.417277841.1740440011.62245.735.146554677859.66.3880.21136.81680.140903.73512.302.620641.22123.879158.2781.851.810.620.60.370.481704781.921516168.500.261530.0235860.70520OpenBenchmarking.org

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans1230.18080.36160.54240.72320.904SE +/- 0.00067, N = 3SE +/- 0.00058, N = 3SE +/- 0.00136, N = 30.801890.525270.803341. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte1230.62381.24761.87142.49523.119SE +/- 0.00451, N = 3SE +/- 0.03153, N = 3SE +/- 0.00546, N = 32.772521.974852.768291. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM123816243240SE +/- 0.09, N = 3SE +/- 0.55, N = 3SE +/- 0.06, N = 332.1327.9932.571. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123500K1000K1500K2000K2500KSE +/- 6176.61, N = 3SE +/- 27951.91, N = 5SE +/- 32022.09, N = 152301015.752132136.122125085.741. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth1232K4K6K8K10KSE +/- 141.61, N = 3SE +/- 244.43, N = 3SE +/- 130.50, N = 311177.5910433.2611232.471. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 311.6612.2611.87MIN: 11.49MIN: 12.09MIN: 11.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth1230.58931.17861.76792.35722.9465SE +/- 0.02632, N = 3SE +/- 0.04035, N = 3SE +/- 0.00727, N = 32.618942.499892.596181. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 101230.6011.2021.8032.4043.005SE +/- 0.004, N = 3SE +/- 0.019, N = 3SE +/- 0.029, N = 32.6712.6102.568

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis123816243240SE +/- 0.36, N = 4SE +/- 0.40, N = 5SE +/- 0.17, N = 431.7532.4531.491. (CC) gcc options: -O2 -std=c99

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad1231.0932.1863.2794.3725.465SE +/- 0.00206, N = 3SE +/- 0.10428, N = 3SE +/- 0.00099, N = 34.847134.720804.857641. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1231.5M3M4.5M6M7.5MSE +/- 68860.30, N = 3SE +/- 38176.62, N = 3SE +/- 31151.41, N = 36987974713844069469161. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1233691215SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.6412.4212.44MIN: 11.41MIN: 11.31MIN: 11.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12380160240320400SE +/- 3.60, N = 9SE +/- 0.88, N = 3SE +/- 4.62, N = 3367.84362.12368.09

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.67731.35462.03192.70923.3865SE +/- 0.01485, N = 3SE +/- 0.00799, N = 3SE +/- 0.00894, N = 32.989103.010232.96551MIN: 2.87MIN: 2.92MIN: 2.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 322.7622.4322.46MIN: 22.59MIN: 22.34MIN: 22.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1233691215SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 312.5712.4912.67MIN: 11.06MIN: 11.04MIN: 11.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 61230.28550.5710.85651.1421.4275SE +/- 0.006, N = 3SE +/- 0.015, N = 3SE +/- 0.004, N = 31.2691.2511.264

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.01261, N = 3SE +/- 0.01956, N = 3SE +/- 0.01972, N = 39.457579.337809.40066MIN: 8.28MIN: 8.31MIN: 8.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 51230.21530.43060.64590.86121.0765SE +/- 0.007, N = 3SE +/- 0.005, N = 3SE +/- 0.007, N = 30.9450.9570.950

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00012320406080100SE +/- 0.53, N = 3SE +/- 0.24, N = 3SE +/- 0.14, N = 375.2975.3676.251. (CC) gcc options: -O2 -ldl -lz -lpthread

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1233691215SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 39.479.529.581. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH123300K600K900K1200K1500KSE +/- 21655.16, N = 3SE +/- 8440.76, N = 3SE +/- 3955.37, N = 31510008.081527380.461518627.291. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p123714212835SE +/- 0.02, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 328.9928.6828.801. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast123918273645SE +/- 0.42, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 337.5937.7137.961. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123510152025SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 321.4821.2721.32MIN: 21.27MIN: 20.98MIN: 21.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1232M4M6M8M10MSE +/- 112387.29, N = 3SE +/- 122181.06, N = 3SE +/- 60107.63, N = 3110381781093487311016030

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1233691215SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 39.9610.0310.051. Nodejs v12.18.2

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1231.23142.46283.69424.92566.157SE +/- 0.00422, N = 3SE +/- 0.00939, N = 3SE +/- 0.01688, N = 35.426975.472755.42816MIN: 5.34MIN: 5.37MIN: 5.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12370140210280350SE +/- 0.32, N = 3SE +/- 0.53, N = 3SE +/- 0.27, N = 3314.64314.32312.06

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S12320406080100SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 377.3277.6477.911. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1230.33660.67321.00981.34641.683SE +/- 0.008, N = 3SE +/- 0.003, N = 3SE +/- 0.005, N = 31.4851.4961.490

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1232K4K6K8K10KSE +/- 35.72, N = 3SE +/- 5.94, N = 3SE +/- 1.63, N = 37828.57882.97879.21. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1231020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 344.6844.6944.391. (CC) gcc options: -O3

Sunflow Rendering System

Global Illumination + Image Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1230.62571.25141.87712.50283.1285SE +/- 0.046, N = 3SE +/- 0.032, N = 3SE +/- 0.002, N = 32.7632.7812.774MIN: 2.57 / MAX: 3.48MIN: 2.61 / MAX: 3.45MIN: 2.61 / MAX: 3.64

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.01822, N = 3SE +/- 0.01580, N = 3SE +/- 0.00792, N = 38.272058.219548.22943MIN: 7.77MIN: 7.73MIN: 7.731. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12330K60K90K120K150KSE +/- 994.84, N = 3SE +/- 1278.58, N = 3SE +/- 423.97, N = 3138255.21137667.82137386.881. (CC) gcc options: -O2 -lrt" -lrt

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1232K4K6K8K10KSE +/- 5.25, N = 3SE +/- 3.26, N = 3SE +/- 6.92, N = 38905.558959.948944.05MIN: 8887.6MIN: 8942.77MIN: 8922.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU12310002000300040005000SE +/- 1.40, N = 3SE +/- 1.79, N = 3SE +/- 5.10, N = 34630.544658.804652.31MIN: 4614MIN: 4642.81MIN: 4626.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 11230.07470.14940.22410.29880.3735SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.3310.3300.332

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1239K18K27K36K45K4208341832418501. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.97071.94142.91213.88284.8535SE +/- 0.00682, N = 3SE +/- 0.02231, N = 3SE +/- 0.00714, N = 34.303434.314034.28945MIN: 3.79MIN: 3.81MIN: 3.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12310002000300040005000SE +/- 7.69, N = 3SE +/- 3.42, N = 3SE +/- 2.70, N = 34632.154652.864657.30MIN: 4606.26MIN: 4635.53MIN: 4635.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast1233691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.419.369.411. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU12310002000300040005000SE +/- 7.67, N = 3SE +/- 1.59, N = 3SE +/- 1.90, N = 34627.434651.694650.30MIN: 4596.02MIN: 4632.21MIN: 4632.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1232K4K6K8K10KSE +/- 1.58, N = 3SE +/- 3.04, N = 3SE +/- 24.18, N = 38905.028951.158949.81MIN: 8882.16MIN: 8921.6MIN: 8898.571. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU12348121620SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 316.5616.5316.49MIN: 15.42MIN: 15.31MIN: 15.321. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD123400K800K1200K1600K2000KSE +/- 19659.30, N = 3SE +/- 5504.90, N = 3SE +/- 19831.43, N = 31951090.961959806.671951427.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.10130.20260.30390.40520.5065SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.000, N = 30.4490.4480.4501. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1230.59021.18041.77062.36082.951SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.010, N = 32.6232.6142.6121. (CXX) g++ options: -O3 -pthread -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1232K4K6K8K10KSE +/- 3.12, N = 3SE +/- 5.20, N = 3SE +/- 5.53, N = 38915.608951.958947.59MIN: 8894.86MIN: 8924.86MIN: 8923.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow123246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 37.977.998.001. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 38.198.228.221. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed12314002800420056007000SE +/- 15.48, N = 3SE +/- 3.14, N = 3SE +/- 5.22, N = 36684.456669.406661.791. (CC) gcc options: -O3

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast123510152025SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 320.6720.6620.731. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1230.14830.29660.44490.59320.7415SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.6590.6570.658

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.00986, N = 3SE +/- 0.01057, N = 3SE +/- 0.01212, N = 37.417587.397817.41727MIN: 6.58MIN: 6.58MIN: 6.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1232K4K6K8K10KSE +/- 2.36, N = 3SE +/- 3.51, N = 3SE +/- 4.55, N = 37858.97838.27841.11. (CC) gcc options: -O3

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1231.6M3.2M4.8M6.4M8MSE +/- 18336.90, N = 3SE +/- 21649.68, N = 3SE +/- 9014.23, N = 37385971739060274044001. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01233691215SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 311.5911.6211.621. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1231020304050SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 345.7345.6445.731. (CC) gcc options: -O3

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast1231.15652.3133.46954.6265.7825SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.135.145.141. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite123140K280K420K560K700KSE +/- 858.56, N = 3SE +/- 578.17, N = 3SE +/- 915.50, N = 3656714655454655467

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1232K4K6K8K10KSE +/- 2.95, N = 3SE +/- 1.88, N = 3SE +/- 5.68, N = 37862.97849.97859.61. (CC) gcc options: -O3

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 36.376.376.381. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough12320406080100SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 380.2680.1780.211. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile123306090120150SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3136.91136.97136.82

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 212320406080100SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 380.0680.0580.141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1232004006008001000SE +/- 0.14, N = 3SE +/- 0.24, N = 3SE +/- 0.19, N = 3904.51904.49903.741. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1233691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 312.3112.3012.301. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms1230.591.181.772.362.95SE +/- 0.011, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 32.6222.6212.6201. (CXX) g++ options: -O3 -pthread -lm

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive123140280420560700SE +/- 0.12, N = 3SE +/- 0.21, N = 3SE +/- 0.03, N = 3641.49641.37641.221. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search123306090120150SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3123.93123.89123.881. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3123306090120150SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3158.29158.30158.281. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1230.41630.83261.24891.66522.0815SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.851.851.851. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow1230.40730.81461.22191.62922.0365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.811.811.811. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.13950.2790.41850.5580.6975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.620.620.621. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.60.60.61. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.370.371. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.480.480.481. (CXX) g++ options: -O3 -pthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET123400K800K1200K1600K2000KSE +/- 30192.95, N = 12SE +/- 17021.33, N = 3SE +/- 17337.60, N = 81703146.701714807.711704781.921. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123500K1000K1500K2000K2500KSE +/- 54405.26, N = 12SE +/- 10692.18, N = 3SE +/- 5643.18, N = 32395940.241524653.461516168.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency1230.07730.15460.23190.30920.3865SE +/- 0.00081, N = 3SE +/- 0.04057, N = 3SE +/- 0.00123, N = 30.263500.343750.261531. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access1230.00540.01080.01620.02160.027SE +/- 0.00038, N = 3SE +/- 0.00091, N = 3SE +/- 0.00040, N = 30.023950.022300.023581. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL1231428425670SE +/- 0.47, N = 3SE +/- 1.67, N = 9SE +/- 0.83, N = 461.3258.3160.711. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3


Phoronix Test Suite v10.8.4