1260L v5 Skylake December

Intel Xeon E3-1260L v5 testing with a ASRock E3V5 WS (P7.10 BIOS) and llvmpipe on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2012197-HA-1260LV5SK58&sor.

1260L v5 Skylake DecemberProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution123Intel Xeon E3-1260L v5 @ 3.90GHz (4 Cores / 8 Threads)ASRock E3V5 WS (P7.10 BIOS)Intel Xeon E3-1200 v5/E3-15008GB120GB INTEL SSDSC2BW12llvmpipeRealtek ALC892Intel I219-LMUbuntu 20.105.8.0-20-generic (x86_64)GNOME Shell 3.38.0X Server 1.20.8modesetting 1.20.83.3 Mesa 20.1.8 (LLVM 10.0.1 256 bits)GCC 10.2.0ext41024x768GNOME Shell 3.38.14.5 Mesa 20.2.1 (LLVM 11.0.0 256 bits)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xdc - Thermald 2.3Java Details- 1: OpenJDK Runtime Environment (build 11.0.8+10-post-Ubuntu-0ubuntu1)- 2: OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)- 3: OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

1260L v5 Skylake Decemberhpcc: G-HPLhpcc: G-Fftehpcc: EP-DGEMMhpcc: G-Ptranshpcc: EP-STREAM Triadhpcc: G-Rand Accesshpcc: Rand Ring Latencyhpcc: Rand Ring Bandwidthhpcc: Max Ping Pong Bandwidthhmmer: Pfam Database Searchlammps: 20k Atomslammps: Rhodopsin Proteinsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDcompress-lz4: 1 - Compression Speedcompress-lz4: 1 - Decompression Speedcompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedcrafty: Elapsed Timeonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Ultra Fastkvazaar: Bosphorus 1080p - Very Fastkvazaar: Bosphorus 1080p - Ultra Fastrav1e: 1rav1e: 5rav1e: 6rav1e: 10x265: Bosphorus 4Kx265: Bosphorus 1080pcoremark: CoreMark Size 666 - Iterations Per Secondstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthbuild-ffmpeg: Time To Compilebuild2: Time To Compilenumpy: espeak: Text-To-Speech Synthesisnode-web-tooling: gromacs: Water Benchmarkastcenc: Fastastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivebasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3basis: UASTC Level 2 + RDO Post-Processingsqlite-speedtest: Timed Time - Size 1,000redis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETindigobench: CPU - Bedroomindigobench: CPU - Supercarphpbench: PHP Benchmark Suitesunflow: Global Illumination + Image Synthesisbrl-cad: VGR Performance Metric12361.317472.7725232.131300.801894.847130.023950.263502.6189411177.587123.9252.6222.6230.480.370.60.626684.457828.545.737858.944.687862.973859719.4575711.66434.303432.9891022.764812.636216.562121.479612.57058.272058905.024630.548915.604627.435.426978905.554632.157.417581.811.857.978.195.139.4120.6737.590.3310.9451.2692.6716.3728.99138255.212329698797411038178136.907367.844314.6431.7539.960.4499.4712.3180.26641.4977.31811.59480.060158.285904.51175.2922395940.241951090.961510008.082301015.751703146.700.6591.4856567142.7634208358.312421.9748527.988970.525274.720800.022300.343752.4998910433.259123.8902.6212.6140.480.370.60.626669.407882.945.647838.244.697849.973906029.3378012.25864.314033.0102322.429912.421916.526121.271512.49018.219548951.154658.808951.954651.695.472758959.944652.867.397811.811.857.998.225.149.3620.6637.710.3300.9571.2512.6106.3728.68137667.823061713844010934873136.969362.123314.3232.44610.030.4489.5212.3080.17641.3777.64211.62180.053158.303904.49475.3551524653.461959806.671527380.462132136.121714807.710.6571.4966554542.7814183260.705202.7682932.569470.803344.857640.023580.261532.5961811232.466123.8792.6202.6120.480.370.60.626661.797879.245.737841.144.397859.674044009.4006611.86654.289452.9655122.455012.441016.486921.318612.67008.229438949.814652.318947.594650.305.428168944.054657.307.417271.811.858.008.225.149.4120.7337.960.3320.9501.2642.5686.3828.80137386.878014694691611016030136.816368.092312.0631.48710.050.4509.5812.3080.21641.2277.90711.62280.140158.278903.73576.2451516168.501951427.421518627.292125085.741704781.920.6581.4906554672.77441850OpenBenchmarking.org

HPC Challenge

Test / Class: G-HPL

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL1321428425670SE +/- 0.47, N = 3SE +/- 0.83, N = 4SE +/- 1.67, N = 961.3260.7158.311. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ffte

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte1320.62381.24761.87142.49523.119SE +/- 0.00451, N = 3SE +/- 0.00546, N = 3SE +/- 0.03153, N = 32.772522.768291.974851. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-DGEMM

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM312816243240SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.55, N = 332.5732.1327.991. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Ptrans

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans3120.18080.36160.54240.72320.904SE +/- 0.00136, N = 3SE +/- 0.00067, N = 3SE +/- 0.00058, N = 30.803340.801890.525271. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: EP-STREAM Triad

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad3121.0932.1863.2794.3725.465SE +/- 0.00099, N = 3SE +/- 0.00206, N = 3SE +/- 0.10428, N = 34.857644.847134.720801. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: G-Random Access

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access1320.00540.01080.01620.02160.027SE +/- 0.00038, N = 3SE +/- 0.00040, N = 3SE +/- 0.00091, N = 30.023950.023580.022301. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Latency

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency3120.07730.15460.23190.30920.3865SE +/- 0.00123, N = 3SE +/- 0.00081, N = 3SE +/- 0.04057, N = 30.261530.263500.343751. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Random Ring Bandwidth

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth1320.58931.17861.76792.35722.9465SE +/- 0.02632, N = 3SE +/- 0.00727, N = 3SE +/- 0.04035, N = 32.618942.596182.499891. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

HPC Challenge

Test / Class: Max Ping Pong Bandwidth

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth3122K4K6K8K10KSE +/- 130.50, N = 3SE +/- 141.61, N = 3SE +/- 244.43, N = 311232.4711177.5910433.261. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search321306090120150SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 3123.88123.89123.931. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

LAMMPS Molecular Dynamics Simulator

Model: 20k Atoms

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms1230.591.181.772.362.95SE +/- 0.011, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 32.6222.6212.6201. (CXX) g++ options: -O3 -pthread -lm

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1230.59021.18041.77062.36082.951SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.010, N = 32.6232.6142.6121. (CXX) g++ options: -O3 -pthread -lm

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya3210.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.480.480.481. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom3210.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.370.371. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets3210.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.60.60.61. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID3210.13950.2790.41850.5580.6975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.620.620.621. (CXX) g++ options: -O3 -pthread

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed12314002800420056007000SE +/- 15.48, N = 3SE +/- 3.14, N = 3SE +/- 5.22, N = 36684.456669.406661.791. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed2312K4K6K8K10KSE +/- 5.94, N = 3SE +/- 1.63, N = 3SE +/- 35.72, N = 37882.97879.27828.51. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed3121020304050SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 345.7345.7345.641. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1322K4K6K8K10KSE +/- 2.36, N = 3SE +/- 4.55, N = 3SE +/- 3.51, N = 37858.97841.17838.21. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed2131020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 344.6944.6844.391. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1322K4K6K8K10KSE +/- 2.95, N = 3SE +/- 5.68, N = 3SE +/- 1.88, N = 37862.97859.67849.91. (CC) gcc options: -O3

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time3211.6M3.2M4.8M6.4M8MSE +/- 9014.23, N = 3SE +/- 21649.68, N = 3SE +/- 18336.90, N = 37404400739060273859711. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU2313691215SE +/- 0.01956, N = 3SE +/- 0.01972, N = 3SE +/- 0.01261, N = 39.337809.400669.45757MIN: 8.31MIN: 8.23MIN: 8.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1323691215SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 311.6611.8712.26MIN: 11.49MIN: 11.71MIN: 12.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU3120.97071.94142.91213.88284.8535SE +/- 0.00714, N = 3SE +/- 0.00682, N = 3SE +/- 0.02231, N = 34.289454.303434.31403MIN: 3.79MIN: 3.79MIN: 3.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU3120.67731.35462.03192.70923.3865SE +/- 0.00894, N = 3SE +/- 0.01485, N = 3SE +/- 0.00799, N = 32.965512.989103.01023MIN: 2.87MIN: 2.87MIN: 2.921. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU231510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 322.4322.4622.76MIN: 22.34MIN: 22.29MIN: 22.591. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU2313691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 312.4212.4412.64MIN: 11.31MIN: 11.36MIN: 11.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU32148121620SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 316.4916.5316.56MIN: 15.32MIN: 15.31MIN: 15.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU231510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 321.2721.3221.48MIN: 20.98MIN: 21.1MIN: 21.271. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU2133691215SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 312.4912.5712.67MIN: 11.04MIN: 11.06MIN: 11.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU231246810SE +/- 0.01580, N = 3SE +/- 0.00792, N = 3SE +/- 0.01822, N = 38.219548.229438.27205MIN: 7.73MIN: 7.73MIN: 7.771. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1322K4K6K8K10KSE +/- 1.58, N = 3SE +/- 24.18, N = 3SE +/- 3.04, N = 38905.028949.818951.15MIN: 8882.16MIN: 8898.57MIN: 8921.61. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU13210002000300040005000SE +/- 1.40, N = 3SE +/- 5.10, N = 3SE +/- 1.79, N = 34630.544652.314658.80MIN: 4614MIN: 4626.31MIN: 4642.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1322K4K6K8K10KSE +/- 3.12, N = 3SE +/- 5.53, N = 3SE +/- 5.20, N = 38915.608947.598951.95MIN: 8894.86MIN: 8923.56MIN: 8924.861. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU13210002000300040005000SE +/- 7.67, N = 3SE +/- 1.90, N = 3SE +/- 1.59, N = 34627.434650.304651.69MIN: 4596.02MIN: 4632.98MIN: 4632.211. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1321.23142.46283.69424.92566.157SE +/- 0.00422, N = 3SE +/- 0.01688, N = 3SE +/- 0.00939, N = 35.426975.428165.47275MIN: 5.34MIN: 5.33MIN: 5.371. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1322K4K6K8K10KSE +/- 5.25, N = 3SE +/- 6.92, N = 3SE +/- 3.26, N = 38905.558944.058959.94MIN: 8887.6MIN: 8922.43MIN: 8942.771. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12310002000300040005000SE +/- 7.69, N = 3SE +/- 3.42, N = 3SE +/- 2.70, N = 34632.154652.864657.30MIN: 4606.26MIN: 4635.53MIN: 4635.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU231246810SE +/- 0.01057, N = 3SE +/- 0.01212, N = 3SE +/- 0.00986, N = 37.397817.417277.41758MIN: 6.58MIN: 6.58MIN: 6.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow3210.40730.81461.22191.62922.0365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.811.811.811. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium3210.41630.83261.24891.66522.0815SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.851.851.851. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Slow

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow321246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 38.007.997.971. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium321246810SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 38.228.228.191. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast3211.15652.3133.46954.6265.7825SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.145.145.131. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast3123691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 39.419.419.361. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast312510152025SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 320.7320.6720.661. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Kvazaar

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast321918273645SE +/- 0.05, N = 3SE +/- 0.16, N = 3SE +/- 0.42, N = 337.9637.7137.591. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 13120.07470.14940.22410.29880.3735SE +/- 0.000, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 30.3320.3310.330

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 52310.21530.43060.64590.86121.0765SE +/- 0.005, N = 3SE +/- 0.007, N = 3SE +/- 0.007, N = 30.9570.9500.945

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 61320.28550.5710.85651.1421.4275SE +/- 0.006, N = 3SE +/- 0.004, N = 3SE +/- 0.015, N = 31.2691.2641.251

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 101230.6011.2021.8032.4043.005SE +/- 0.004, N = 3SE +/- 0.019, N = 3SE +/- 0.029, N = 32.6712.6102.568

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K321246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 36.386.376.371. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p132714212835SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.15, N = 328.9928.8028.681. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12330K60K90K120K150KSE +/- 994.84, N = 3SE +/- 1278.58, N = 3SE +/- 423.97, N = 3138255.21137667.82137386.881. (CC) gcc options: -O2 -lrt" -lrt

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time2131.5M3M4.5M6M7.5MSE +/- 38176.62, N = 3SE +/- 68860.30, N = 3SE +/- 31151.41, N = 37138440698797469469161. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1322M4M6M8M10MSE +/- 112387.29, N = 3SE +/- 60107.63, N = 3SE +/- 122181.06, N = 3110381781101603010934873

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile312306090120150SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.11, N = 3136.82136.91136.97

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile21380160240320400SE +/- 0.88, N = 3SE +/- 3.60, N = 9SE +/- 4.62, N = 3362.12367.84368.09

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12370140210280350SE +/- 0.32, N = 3SE +/- 0.53, N = 3SE +/- 0.27, N = 3314.64314.32312.06

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis312816243240SE +/- 0.17, N = 4SE +/- 0.36, N = 4SE +/- 0.40, N = 531.4931.7532.451. (CC) gcc options: -O2 -std=c99

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark3213691215SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 310.0510.039.961. Nodejs v12.18.2

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark3120.10130.20260.30390.40520.5065SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 30.4500.4490.4481. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1233691215SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 39.479.529.581. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium2313691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312.3012.3012.311. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough23120406080100SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 380.1780.2180.261. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive321140280420560700SE +/- 0.03, N = 3SE +/- 0.21, N = 3SE +/- 0.12, N = 3641.22641.37641.491. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S12320406080100SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 377.3277.6477.911. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01233691215SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 311.5911.6211.621. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 221320406080100SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 380.0580.0680.141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3312306090120150SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3158.28158.29158.301. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing3212004006008001000SE +/- 0.19, N = 3SE +/- 0.24, N = 3SE +/- 0.14, N = 3903.74904.49904.511. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00012320406080100SE +/- 0.53, N = 3SE +/- 0.24, N = 3SE +/- 0.14, N = 375.2975.3676.251. (CC) gcc options: -O2 -ldl -lz -lpthread

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123500K1000K1500K2000K2500KSE +/- 54405.26, N = 12SE +/- 10692.18, N = 3SE +/- 5643.18, N = 32395940.241524653.461516168.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD231400K800K1200K1600K2000KSE +/- 5504.90, N = 3SE +/- 19831.43, N = 3SE +/- 19659.30, N = 31959806.671951427.421951090.961. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH231300K600K900K1200K1500KSE +/- 8440.76, N = 3SE +/- 3955.37, N = 3SE +/- 21655.16, N = 31527380.461518627.291510008.081. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123500K1000K1500K2000K2500KSE +/- 6176.61, N = 3SE +/- 27951.91, N = 5SE +/- 32022.09, N = 152301015.752132136.122125085.741. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET231400K800K1200K1600K2000KSE +/- 17021.33, N = 3SE +/- 17337.60, N = 8SE +/- 30192.95, N = 121714807.711704781.921703146.701. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1320.14830.29660.44490.59320.7415SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.6590.6580.657

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar2310.33660.67321.00981.34641.683SE +/- 0.003, N = 3SE +/- 0.005, N = 3SE +/- 0.008, N = 31.4961.4901.485

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite132140K280K420K560K700KSE +/- 858.56, N = 3SE +/- 915.50, N = 3SE +/- 578.17, N = 3656714655467655454

Sunflow Rendering System

Global Illumination + Image Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1320.62571.25141.87712.50283.1285SE +/- 0.046, N = 3SE +/- 0.002, N = 3SE +/- 0.032, N = 32.7632.7742.781MIN: 2.57 / MAX: 3.48MIN: 2.61 / MAX: 3.64MIN: 2.61 / MAX: 3.45

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1329K18K27K36K45K4208341850418321. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm


Phoronix Test Suite v10.8.4