1260L v5 Skylake December

Intel Xeon E3-1260L v5 testing with a ASRock E3V5 WS (P7.10 BIOS) and llvmpipe on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012197-HA-1260LV5SK58&sor&grs.

1260L v5 Skylake DecemberProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution123Intel Xeon E3-1260L v5 @ 3.90GHz (4 Cores / 8 Threads)ASRock E3V5 WS (P7.10 BIOS)Intel Xeon E3-1200 v5/E3-15008GB120GB INTEL SSDSC2BW12llvmpipeRealtek ALC892Intel I219-LMUbuntu 20.105.8.0-20-generic (x86_64)GNOME Shell 3.38.0X Server 1.20.8modesetting 1.20.83.3 Mesa 20.1.8 (LLVM 10.0.1 256 bits)GCC 10.2.0ext41024x768GNOME Shell 3.38.14.5 Mesa 20.2.1 (LLVM 11.0.0 256 bits)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xdc - Thermald 2.3Java Details- 1: OpenJDK Runtime Environment (build 11.0.8+10-post-Ubuntu-0ubuntu1)- 2: OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)- 3: OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

1260L v5 Skylake Decemberhpcc: G-Ptranshpcc: G-Fftehpcc: EP-DGEMMredis: GEThpcc: Max Ping Pong Bandwidthonednn: IP Shapes 3D - f32 - CPUhpcc: Rand Ring Bandwidthrav1e: 10espeak: Text-To-Speech Synthesishpcc: EP-STREAM Triadstockfish: Total Timeonednn: Deconvolution Batch shapes_1d - f32 - CPUbuild2: Time To Compileonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUrav1e: 6onednn: IP Shapes 1D - f32 - CPUrav1e: 5sqlite-speedtest: Timed Time - Size 1,000astcenc: Fastredis: LPUSHx265: Bosphorus 1080pkvazaar: Bosphorus 1080p - Ultra Fastonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUasmfish: 1024 Hash Memory, 26 Depthnode-web-tooling: onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUnumpy: basis: ETC1Sindigobench: CPU - Supercarcompress-lz4: 1 - Decompression Speedcompress-lz4: 9 - Compression Speedsunflow: Global Illumination + Image Synthesisonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUcoremark: CoreMark Size 666 - Iterations Per Secondonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUrav1e: 1brl-cad: VGR Performance Metriconednn: IP Shapes 1D - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUkvazaar: Bosphorus 4K - Ultra Fastonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUredis: SADDgromacs: Water Benchmarklammps: Rhodopsin Proteinonednn: Recurrent Neural Network Training - u8s8f32 - CPUkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumcompress-lz4: 1 - Compression Speedkvazaar: Bosphorus 1080p - Very Fastindigobench: CPU - Bedroomonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUcompress-lz4: 3 - Decompression Speedcrafty: Elapsed Timebasis: UASTC Level 0compress-lz4: 3 - Compression Speedkvazaar: Bosphorus 4K - Very Fastphpbench: PHP Benchmark Suitecompress-lz4: 9 - Decompression Speedx265: Bosphorus 4Kastcenc: Thoroughbuild-ffmpeg: Time To Compilebasis: UASTC Level 2basis: UASTC Level 2 + RDO Post-Processingastcenc: Mediumlammps: 20k Atomsastcenc: Exhaustivehmmer: Pfam Database Searchbasis: UASTC Level 3kvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Slowsimdjson: DistinctUserIDsimdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyaredis: SETredis: LPOPhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: G-HPL1230.801892.7725232.131302301015.7511177.58711.66432.618942.67131.7534.84713698797412.6362367.8442.9891022.764812.57051.2699.457570.94575.2929.471510008.0828.9937.5921.4796110381789.965.42697314.6477.3181.4857828.544.682.7638.27205138255.2123298905.554630.540.331420834.303434632.159.414627.438905.0216.56211951090.960.4492.6238915.607.978.196684.4520.670.6597.417587858.9738597111.59445.735.136567147862.96.3780.26136.90780.060904.51112.312.622641.49123.925158.2851.851.810.620.60.370.481703146.702395940.240.263500.0239561.317470.525271.9748527.988972132136.1210433.25912.25862.499892.61032.4464.72080713844012.4219362.1233.0102322.429912.49011.2519.337800.95775.3559.521527380.4628.6837.7121.27151093487310.035.47275314.3277.6421.4967882.944.692.7818.21954137667.8230618959.944658.800.330418324.314034652.869.364651.698951.1516.52611959806.670.4482.6148951.957.998.226669.4020.660.6577.397817838.2739060211.62145.645.146554547849.96.3780.17136.96980.053904.49412.302.621641.37123.890158.3031.851.810.620.60.370.481714807.711524653.460.343750.0223058.312420.803342.7682932.569472125085.7411232.46611.86652.596182.56831.4874.85764694691612.4410368.0922.9655122.455012.67001.2649.400660.95076.2459.581518627.2928.8037.9621.31861101603010.055.42816312.0677.9071.4907879.244.392.7748.22943137386.8780148944.054652.310.332418504.289454657.309.414650.308949.8116.48691951427.420.4502.6128947.598.008.226661.7920.730.6587.417277841.1740440011.62245.735.146554677859.66.3880.21136.81680.140903.73512.302.620641.22123.879158.2781.851.810.620.60.370.481704781.921516168.500.261530.0235860.70520OpenBenchmarking.org

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans3120.18080.36160.54240.72320.904SE +/- 0.00136, N = 3SE +/- 0.00067, N = 3SE +/- 0.00058, N = 30.803340.801890.525271. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte1320.62381.24761.87142.49523.119SE +/- 0.00451, N = 3SE +/- 0.00546, N = 3SE +/- 0.03153, N = 32.772522.768291.974851. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM312816243240SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.55, N = 332.5732.1327.991. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123500K1000K1500K2000K2500KSE +/- 6176.61, N = 3SE +/- 27951.91, N = 5SE +/- 32022.09, N = 152301015.752132136.122125085.741. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth3122K4K6K8K10KSE +/- 130.50, N = 3SE +/- 141.61, N = 3SE +/- 244.43, N = 311232.4711177.5910433.261. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1323691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 311.6611.8712.26MIN: 11.49MIN: 11.71MIN: 12.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth1320.58931.17861.76792.35722.9465SE +/- 0.02632, N = 3SE +/- 0.00727, N = 3SE +/- 0.04035, N = 32.618942.596182.499891. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 101230.6011.2021.8032.4043.005SE +/- 0.004, N = 3SE +/- 0.019, N = 3SE +/- 0.029, N = 32.6712.6102.568

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis312816243240SE +/- 0.17, N = 4SE +/- 0.36, N = 4SE +/- 0.40, N = 531.4931.7532.451. (CC) gcc options: -O2 -std=c99

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad3121.0932.1863.2794.3725.465SE +/- 0.00099, N = 3SE +/- 0.00206, N = 3SE +/- 0.10428, N = 34.857644.847134.720801. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time2131.5M3M4.5M6M7.5MSE +/- 38176.62, N = 3SE +/- 68860.30, N = 3SE +/- 31151.41, N = 37138440698797469469161. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU2313691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 312.4212.4412.64MIN: 11.31MIN: 11.36MIN: 11.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile21380160240320400SE +/- 0.88, N = 3SE +/- 3.60, N = 9SE +/- 4.62, N = 3362.12367.84368.09

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3120.67731.35462.03192.70923.3865SE +/- 0.00894, N = 3SE +/- 0.01485, N = 3SE +/- 0.00799, N = 32.965512.989103.01023MIN: 2.87MIN: 2.87MIN: 2.921. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU231510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 322.4322.4622.76MIN: 22.34MIN: 22.29MIN: 22.591. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU2133691215SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 312.4912.5712.67MIN: 11.04MIN: 11.06MIN: 11.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 61320.28550.5710.85651.1421.4275SE +/- 0.006, N = 3SE +/- 0.004, N = 3SE +/- 0.015, N = 31.2691.2641.251

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU2313691215SE +/- 0.01956, N = 3SE +/- 0.01972, N = 3SE +/- 0.01261, N = 39.337809.400669.45757MIN: 8.31MIN: 8.23MIN: 8.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 52310.21530.43060.64590.86121.0765SE +/- 0.005, N = 3SE +/- 0.007, N = 3SE +/- 0.007, N = 30.9570.9500.945

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00012320406080100SE +/- 0.53, N = 3SE +/- 0.24, N = 3SE +/- 0.14, N = 375.2975.3676.251. (CC) gcc options: -O2 -ldl -lz -lpthread

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1233691215SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 39.479.529.581. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH231300K600K900K1200K1500KSE +/- 8440.76, N = 3SE +/- 3955.37, N = 3SE +/- 21655.16, N = 31527380.461518627.291510008.081. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p132714212835SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 328.9928.8028.681. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast321918273645SE +/- 0.05, N = 3SE +/- 0.16, N = 3SE +/- 0.42, N = 337.9637.7137.591. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU231510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 321.2721.3221.48MIN: 20.98MIN: 21.1MIN: 21.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1322M4M6M8M10MSE +/- 112387.29, N = 3SE +/- 60107.63, N = 3SE +/- 122181.06, N = 3110381781101603010934873

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark3213691215SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 310.0510.039.961. Nodejs v12.18.2

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1321.23142.46283.69424.92566.157SE +/- 0.00422, N = 3SE +/- 0.01688, N = 3SE +/- 0.00939, N = 35.426975.428165.47275MIN: 5.34MIN: 5.33MIN: 5.371. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12370140210280350SE +/- 0.32, N = 3SE +/- 0.53, N = 3SE +/- 0.27, N = 3314.64314.32312.06

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S12320406080100SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 377.3277.6477.911. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar2310.33660.67321.00981.34641.683SE +/- 0.003, N = 3SE +/- 0.005, N = 3SE +/- 0.008, N = 31.4961.4901.485

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed2312K4K6K8K10KSE +/- 5.94, N = 3SE +/- 1.63, N = 3SE +/- 35.72, N = 37882.97879.27828.51. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed2131020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 344.6944.6844.391. (CC) gcc options: -O3

Sunflow Rendering System

Global Illumination + Image Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1320.62571.25141.87712.50283.1285SE +/- 0.046, N = 3SE +/- 0.002, N = 3SE +/- 0.032, N = 32.7632.7742.781MIN: 2.57 / MAX: 3.48MIN: 2.61 / MAX: 3.64MIN: 2.61 / MAX: 3.45

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU231246810SE +/- 0.01580, N = 3SE +/- 0.00792, N = 3SE +/- 0.01822, N = 38.219548.229438.27205MIN: 7.73MIN: 7.73MIN: 7.771. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12330K60K90K120K150KSE +/- 994.84, N = 3SE +/- 1278.58, N = 3SE +/- 423.97, N = 3138255.21137667.82137386.881. (CC) gcc options: -O2 -lrt" -lrt

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1322K4K6K8K10KSE +/- 5.25, N = 3SE +/- 6.92, N = 3SE +/- 3.26, N = 38905.558944.058959.94MIN: 8887.6MIN: 8922.43MIN: 8942.771. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU13210002000300040005000SE +/- 1.40, N = 3SE +/- 5.10, N = 3SE +/- 1.79, N = 34630.544652.314658.80MIN: 4614MIN: 4626.31MIN: 4642.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 13120.07470.14940.22410.29880.3735SE +/- 0.000, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 30.3320.3310.330

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1329K18K27K36K45K4208341850418321. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3120.97071.94142.91213.88284.8535SE +/- 0.00714, N = 3SE +/- 0.00682, N = 3SE +/- 0.02231, N = 34.289454.303434.31403MIN: 3.79MIN: 3.79MIN: 3.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12310002000300040005000SE +/- 7.69, N = 3SE +/- 3.42, N = 3SE +/- 2.70, N = 34632.154652.864657.30MIN: 4606.26MIN: 4635.53MIN: 4635.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast3123691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 39.419.419.361. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU13210002000300040005000SE +/- 7.67, N = 3SE +/- 1.90, N = 3SE +/- 1.59, N = 34627.434650.304651.69MIN: 4596.02MIN: 4632.98MIN: 4632.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1322K4K6K8K10KSE +/- 1.58, N = 3SE +/- 24.18, N = 3SE +/- 3.04, N = 38905.028949.818951.15MIN: 8882.16MIN: 8898.57MIN: 8921.61. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU32148121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 316.4916.5316.56MIN: 15.32MIN: 15.31MIN: 15.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD231400K800K1200K1600K2000KSE +/- 5504.90, N = 3SE +/- 19831.43, N = 3SE +/- 19659.30, N = 31959806.671951427.421951090.961. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark3120.10130.20260.30390.40520.5065SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 30.4500.4490.4481. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1230.59021.18041.77062.36082.951SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.010, N = 32.6232.6142.6121. (CXX) g++ options: -O3 -pthread -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1322K4K6K8K10KSE +/- 3.12, N = 3SE +/- 5.53, N = 3SE +/- 5.20, N = 38915.608947.598951.95MIN: 8894.86MIN: 8923.56MIN: 8924.861. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow321246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 38.007.997.971. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium321246810SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 38.228.228.191. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed12314002800420056007000SE +/- 15.48, N = 3SE +/- 3.14, N = 3SE +/- 5.22, N = 36684.456669.406661.791. (CC) gcc options: -O3

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast312510152025SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 320.7320.6720.661. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1320.14830.29660.44490.59320.7415SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.6590.6580.657

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU231246810SE +/- 0.01057, N = 3SE +/- 0.01212, N = 3SE +/- 0.00986, N = 37.397817.417277.41758MIN: 6.58MIN: 6.58MIN: 6.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1322K4K6K8K10KSE +/- 2.36, N = 3SE +/- 4.55, N = 3SE +/- 3.51, N = 37858.97841.17838.21. (CC) gcc options: -O3

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time3211.6M3.2M4.8M6.4M8MSE +/- 9014.23, N = 3SE +/- 21649.68, N = 3SE +/- 18336.90, N = 37404400739060273859711. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01233691215SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 311.5911.6211.621. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed3121020304050SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 345.7345.7345.641. (CC) gcc options: -O3

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast3211.15652.3133.46954.6265.7825SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.145.145.131. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite132140K280K420K560K700KSE +/- 858.56, N = 3SE +/- 915.50, N = 3SE +/- 578.17, N = 3656714655467655454

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1322K4K6K8K10KSE +/- 2.95, N = 3SE +/- 5.68, N = 3SE +/- 1.88, N = 37862.97859.67849.91. (CC) gcc options: -O3

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K321246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 36.386.376.371. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough23120406080100SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 380.1780.2180.261. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile312306090120150SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 3136.82136.91136.97

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 221320406080100SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 380.0580.0680.141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing3212004006008001000SE +/- 0.19, N = 3SE +/- 0.24, N = 3SE +/- 0.14, N = 3903.74904.49904.511. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium2313691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312.3012.3012.311. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms1230.591.181.772.362.95SE +/- 0.011, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 32.6222.6212.6201. (CXX) g++ options: -O3 -pthread -lm

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive321140280420560700SE +/- 0.03, N = 3SE +/- 0.21, N = 3SE +/- 0.12, N = 3641.22641.37641.491. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search321306090120150SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 3123.88123.89123.931. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3312306090120150SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3158.28158.29158.301. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium3210.41630.83261.24891.66522.0815SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.851.851.851. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow3210.40730.81461.22191.62922.0365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.811.811.811. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID3210.13950.2790.41850.5580.6975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.620.620.621. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets3210.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.60.60.61. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom3210.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.370.371. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya3210.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.480.480.481. (CXX) g++ options: -O3 -pthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET231400K800K1200K1600K2000KSE +/- 17021.33, N = 3SE +/- 17337.60, N = 8SE +/- 30192.95, N = 121714807.711704781.921703146.701. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123500K1000K1500K2000K2500KSE +/- 54405.26, N = 12SE +/- 10692.18, N = 3SE +/- 5643.18, N = 32395940.241524653.461516168.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency3120.07730.15460.23190.30920.3865SE +/- 0.00123, N = 3SE +/- 0.00081, N = 3SE +/- 0.04057, N = 30.261530.263500.343751. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access1320.00540.01080.01620.02160.027SE +/- 0.00038, N = 3SE +/- 0.00040, N = 3SE +/- 0.00091, N = 30.023950.023580.022301. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL1321428425670SE +/- 0.47, N = 3SE +/- 0.83, N = 4SE +/- 1.67, N = 961.3260.7158.311. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3


Phoronix Test Suite v10.8.4