1260L v5 Skylake December

Intel Xeon E3-1260L v5 testing with a ASRock E3V5 WS (P7.10 BIOS) and llvmpipe on Ubuntu 20.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2012197-HA-1260LV5SK58
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Chess Test Suite 3 Tests
Timed Code Compilation 2 Tests
C/C++ Compiler Tests 9 Tests
CPU Massive 12 Tests
Creator Workloads 9 Tests
Database Test Suite 2 Tests
Encoding 3 Tests
Fortran Tests 2 Tests
Game Development 2 Tests
HPC - High Performance Computing 6 Tests
Machine Learning 2 Tests
Molecular Dynamics 2 Tests
MPI Benchmarks 3 Tests
Multi-Core 12 Tests
NVIDIA GPU Compute 2 Tests
OpenMPI Tests 3 Tests
Programmer / Developer System Benchmarks 6 Tests
Scientific Computing 4 Tests
Server 5 Tests
Server CPU Tests 7 Tests
Single-Threaded 4 Tests
Texture Compression 2 Tests
Video Encoding 3 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
December 17 2020
  9 Hours, 48 Minutes
2
December 18 2020
  10 Hours, 23 Minutes
3
December 18 2020
  9 Hours, 25 Minutes
Invert Hiding All Results Option
  9 Hours, 52 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


1260L v5 Skylake DecemberProcessorMotherboardChipsetMemoryDiskGraphicsAudioNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen Resolution123Intel Xeon E3-1260L v5 @ 3.90GHz (4 Cores / 8 Threads)ASRock E3V5 WS (P7.10 BIOS)Intel Xeon E3-1200 v5/E3-15008GB120GB INTEL SSDSC2BW12llvmpipeRealtek ALC892Intel I219-LMUbuntu 20.105.8.0-20-generic (x86_64)GNOME Shell 3.38.0X Server 1.20.8modesetting 1.20.83.3 Mesa 20.1.8 (LLVM 10.0.1 256 bits)GCC 10.2.0ext41024x768GNOME Shell 3.38.14.5 Mesa 20.2.1 (LLVM 11.0.0 256 bits)OpenBenchmarking.orgCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xdc - Thermald 2.3Java Details- 1: OpenJDK Runtime Environment (build 11.0.8+10-post-Ubuntu-0ubuntu1)- 2: OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)- 3: OpenJDK Runtime Environment (build 11.0.9.1+1-Ubuntu-0ubuntu1.20.10)Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Mitigation of Microcode + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

123Result OverviewPhoronix Test Suite100%104%109%113%117%HPC ChallengeRediseSpeak-NG Speech EngineStockfishBuild2SQLite SpeedtestasmFishNode.js V8 Web Tooling Benchmarkrav1eNumpy BenchmarkSunflow Rendering SystemCoremarkBRL-CADx265GROMACSKvazaarCraftyASTC EncoderLAMMPS Molecular Dynamics SimulatorIndigoBenchBasis UniversalPHPBenchoneDNNTimed FFmpeg CompilationLZ4 CompressionTimed HMMer Searchsimdjson

1260L v5 Skylake Decemberhpcc: G-Ptranshpcc: G-Fftehpcc: EP-DGEMMredis: GEThpcc: Max Ping Pong Bandwidthonednn: IP Shapes 3D - f32 - CPUhpcc: Rand Ring Bandwidthrav1e: 10espeak: Text-To-Speech Synthesishpcc: EP-STREAM Triadstockfish: Total Timeonednn: Deconvolution Batch shapes_1d - f32 - CPUbuild2: Time To Compileonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUrav1e: 6onednn: IP Shapes 1D - f32 - CPUrav1e: 5sqlite-speedtest: Timed Time - Size 1,000astcenc: Fastredis: LPUSHx265: Bosphorus 1080pkvazaar: Bosphorus 1080p - Ultra Fastonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUasmfish: 1024 Hash Memory, 26 Depthnode-web-tooling: onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUnumpy: basis: ETC1Sindigobench: CPU - Supercarcompress-lz4: 1 - Decompression Speedcompress-lz4: 9 - Compression Speedsunflow: Global Illumination + Image Synthesisonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUcoremark: CoreMark Size 666 - Iterations Per Secondonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUrav1e: 1brl-cad: VGR Performance Metriconednn: IP Shapes 1D - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUkvazaar: Bosphorus 4K - Ultra Fastonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUredis: SADDgromacs: Water Benchmarklammps: Rhodopsin Proteinonednn: Recurrent Neural Network Training - u8s8f32 - CPUkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumcompress-lz4: 1 - Compression Speedkvazaar: Bosphorus 1080p - Very Fastindigobench: CPU - Bedroomonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUcompress-lz4: 3 - Decompression Speedcrafty: Elapsed Timebasis: UASTC Level 0compress-lz4: 3 - Compression Speedkvazaar: Bosphorus 4K - Very Fastphpbench: PHP Benchmark Suitecompress-lz4: 9 - Decompression Speedx265: Bosphorus 4Kastcenc: Thoroughbuild-ffmpeg: Time To Compilebasis: UASTC Level 2basis: UASTC Level 2 + RDO Post-Processingastcenc: Mediumlammps: 20k Atomsastcenc: Exhaustivehmmer: Pfam Database Searchbasis: UASTC Level 3kvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Slowsimdjson: DistinctUserIDsimdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyaredis: SETredis: LPOPhpcc: Rand Ring Latencyhpcc: G-Rand Accesshpcc: G-HPL1230.801892.7725232.131302301015.7511177.58711.66432.618942.67131.7534.84713698797412.6362367.8442.9891022.764812.57051.2699.457570.94575.2929.471510008.0828.9937.5921.4796110381789.965.42697314.6477.3181.4857828.544.682.7638.27205138255.2123298905.554630.540.331420834.303434632.159.414627.438905.0216.56211951090.960.4492.6238915.607.978.196684.4520.670.6597.417587858.9738597111.59445.735.136567147862.96.3780.26136.90780.060904.51112.312.622641.49123.925158.2851.851.810.620.60.370.481703146.702395940.240.263500.0239561.317470.525271.9748527.988972132136.1210433.25912.25862.499892.61032.4464.72080713844012.4219362.1233.0102322.429912.49011.2519.337800.95775.3559.521527380.4628.6837.7121.27151093487310.035.47275314.3277.6421.4967882.944.692.7818.21954137667.8230618959.944658.800.330418324.314034652.869.364651.698951.1516.52611959806.670.4482.6148951.957.998.226669.4020.660.6577.397817838.2739060211.62145.645.146554547849.96.3780.17136.96980.053904.49412.302.621641.37123.890158.3031.851.810.620.60.370.481714807.711524653.460.343750.0223058.312420.803342.7682932.569472125085.7411232.46611.86652.596182.56831.4874.85764694691612.4410368.0922.9655122.455012.67001.2649.400660.95076.2459.581518627.2928.8037.9621.31861101603010.055.42816312.0677.9071.4907879.244.392.7748.22943137386.8780148944.054652.310.332418504.289454657.309.414650.308949.8116.48691951427.420.4502.6128947.598.008.226661.7920.730.6587.417277841.1740440011.62245.735.146554677859.66.3880.21136.81680.140903.73512.302.620641.22123.879158.2781.851.810.620.60.370.481704781.921516168.500.261530.0235860.70520OpenBenchmarking.org

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans1230.18080.36160.54240.72320.904SE +/- 0.00067, N = 3SE +/- 0.00058, N = 3SE +/- 0.00136, N = 30.801890.525270.803341. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ptrans123246810Min: 0.8 / Avg: 0.8 / Max: 0.8Min: 0.52 / Avg: 0.53 / Max: 0.53Min: 0.8 / Avg: 0.8 / Max: 0.811. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte1230.62381.24761.87142.49523.119SE +/- 0.00451, N = 3SE +/- 0.03153, N = 3SE +/- 0.00546, N = 32.772521.974852.768291. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-Ffte123246810Min: 2.77 / Avg: 2.77 / Max: 2.78Min: 1.92 / Avg: 1.97 / Max: 2.03Min: 2.76 / Avg: 2.77 / Max: 2.781. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM123816243240SE +/- 0.09, N = 3SE +/- 0.55, N = 3SE +/- 0.06, N = 332.1327.9932.571. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: EP-DGEMM123714212835Min: 32.02 / Avg: 32.13 / Max: 32.31Min: 27.24 / Avg: 27.99 / Max: 29.06Min: 32.46 / Avg: 32.57 / Max: 32.651. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123500K1000K1500K2000K2500KSE +/- 6176.61, N = 3SE +/- 27951.91, N = 5SE +/- 32022.09, N = 152301015.752132136.122125085.741. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET123400K800K1200K1600K2000KMin: 2289061.75 / Avg: 2301015.75 / Max: 2309690.5Min: 2041534.62 / Avg: 2132136.12 / Max: 2207788Min: 1758087.88 / Avg: 2125085.74 / Max: 22278131. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth1232K4K6K8K10KSE +/- 141.61, N = 3SE +/- 244.43, N = 3SE +/- 130.50, N = 311177.5910433.2611232.471. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgMB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Max Ping Pong Bandwidth1232K4K6K8K10KMin: 10911.28 / Avg: 11177.59 / Max: 11394.26Min: 10184.21 / Avg: 10433.26 / Max: 10922.1Min: 11084.82 / Avg: 11232.47 / Max: 11492.671. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 311.6612.2611.87MIN: 11.49MIN: 12.09MIN: 11.711. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU12348121620Min: 11.64 / Avg: 11.66 / Max: 11.69Min: 12.24 / Avg: 12.26 / Max: 12.28Min: 11.83 / Avg: 11.87 / Max: 11.891. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth1230.58931.17861.76792.35722.9465SE +/- 0.02632, N = 3SE +/- 0.04035, N = 3SE +/- 0.00727, N = 32.618942.499892.596181. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Bandwidth123246810Min: 2.57 / Avg: 2.62 / Max: 2.66Min: 2.46 / Avg: 2.5 / Max: 2.58Min: 2.59 / Avg: 2.6 / Max: 2.611. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 101230.6011.2021.8032.4043.005SE +/- 0.004, N = 3SE +/- 0.019, N = 3SE +/- 0.029, N = 32.6712.6102.568
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 10123246810Min: 2.67 / Avg: 2.67 / Max: 2.68Min: 2.58 / Avg: 2.61 / Max: 2.64Min: 2.52 / Avg: 2.57 / Max: 2.62

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis123816243240SE +/- 0.36, N = 4SE +/- 0.40, N = 5SE +/- 0.17, N = 431.7532.4531.491. (CC) gcc options: -O2 -std=c99
OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis123714212835Min: 31.25 / Avg: 31.75 / Max: 32.81Min: 31.78 / Avg: 32.45 / Max: 33.91Min: 31.07 / Avg: 31.49 / Max: 31.91. (CC) gcc options: -O2 -std=c99

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad1231.0932.1863.2794.3725.465SE +/- 0.00206, N = 3SE +/- 0.10428, N = 3SE +/- 0.00099, N = 34.847134.720804.857641. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGB/s, More Is BetterHPC Challenge 1.5.0Test / Class: EP-STREAM Triad123246810Min: 4.84 / Avg: 4.85 / Max: 4.85Min: 4.52 / Avg: 4.72 / Max: 4.85Min: 4.86 / Avg: 4.86 / Max: 4.861. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1231.5M3M4.5M6M7.5MSE +/- 68860.30, N = 3SE +/- 38176.62, N = 3SE +/- 31151.41, N = 36987974713844069469161. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver
OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1231.2M2.4M3.6M4.8M6MMin: 6906391 / Avg: 6987974 / Max: 7124856Min: 7070213 / Avg: 7138440 / Max: 7202238Min: 6915643 / Avg: 6946916.33 / Max: 70092191. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1233691215SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 312.6412.4212.44MIN: 11.41MIN: 11.31MIN: 11.361. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU12348121620Min: 12.53 / Avg: 12.64 / Max: 12.8Min: 12.4 / Avg: 12.42 / Max: 12.44Min: 12.43 / Avg: 12.44 / Max: 12.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Build2

This test profile measures the time to bootstrap/install the build2 C++ build toolchain from source. Build2 is a cross-platform build toolchain for C/C++ code and features Cargo-like features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12380160240320400SE +/- 3.60, N = 9SE +/- 0.88, N = 3SE +/- 4.62, N = 3367.84362.12368.09
OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12370140210280350Min: 361.25 / Avg: 367.84 / Max: 395.26Min: 360.83 / Avg: 362.12 / Max: 363.82Min: 362.13 / Avg: 368.09 / Max: 377.19

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.67731.35462.03192.70923.3865SE +/- 0.01485, N = 3SE +/- 0.00799, N = 3SE +/- 0.00894, N = 32.989103.010232.96551MIN: 2.87MIN: 2.92MIN: 2.871. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU123246810Min: 2.97 / Avg: 2.99 / Max: 3.02Min: 3 / Avg: 3.01 / Max: 3.02Min: 2.95 / Avg: 2.97 / Max: 2.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 322.7622.4322.46MIN: 22.59MIN: 22.34MIN: 22.291. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123510152025Min: 22.67 / Avg: 22.76 / Max: 22.81Min: 22.41 / Avg: 22.43 / Max: 22.47Min: 22.42 / Avg: 22.46 / Max: 22.481. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1233691215SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 312.5712.4912.67MIN: 11.06MIN: 11.04MIN: 11.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU12348121620Min: 12.43 / Avg: 12.57 / Max: 12.76Min: 12.43 / Avg: 12.49 / Max: 12.52Min: 12.5 / Avg: 12.67 / Max: 12.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 61230.28550.5710.85651.1421.4275SE +/- 0.006, N = 3SE +/- 0.015, N = 3SE +/- 0.004, N = 31.2691.2511.264
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 6123246810Min: 1.26 / Avg: 1.27 / Max: 1.28Min: 1.23 / Avg: 1.25 / Max: 1.28Min: 1.26 / Avg: 1.26 / Max: 1.27

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1233691215SE +/- 0.01261, N = 3SE +/- 0.01956, N = 3SE +/- 0.01972, N = 39.457579.337809.40066MIN: 8.28MIN: 8.31MIN: 8.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1233691215Min: 9.44 / Avg: 9.46 / Max: 9.48Min: 9.3 / Avg: 9.34 / Max: 9.36Min: 9.37 / Avg: 9.4 / Max: 9.441. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 51230.21530.43060.64590.86121.0765SE +/- 0.007, N = 3SE +/- 0.005, N = 3SE +/- 0.007, N = 30.9450.9570.950
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 5123246810Min: 0.93 / Avg: 0.95 / Max: 0.96Min: 0.95 / Avg: 0.96 / Max: 0.97Min: 0.94 / Avg: 0.95 / Max: 0.96

SQLite Speedtest

This is a benchmark of SQLite's speedtest1 benchmark program with an increased problem size of 1,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00012320406080100SE +/- 0.53, N = 3SE +/- 0.24, N = 3SE +/- 0.14, N = 375.2975.3676.251. (CC) gcc options: -O2 -ldl -lz -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001231530456075Min: 74.67 / Avg: 75.29 / Max: 76.34Min: 75.11 / Avg: 75.35 / Max: 75.84Min: 76.02 / Avg: 76.24 / Max: 76.511. (CC) gcc options: -O2 -ldl -lz -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1233691215SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 39.479.529.581. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1233691215Min: 9.3 / Avg: 9.47 / Max: 9.57Min: 9.39 / Avg: 9.52 / Max: 9.61Min: 9.56 / Avg: 9.58 / Max: 9.621. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH123300K600K900K1200K1500KSE +/- 21655.16, N = 3SE +/- 8440.76, N = 3SE +/- 3955.37, N = 31510008.081527380.461518627.291. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH123300K600K900K1200K1500KMin: 1466698 / Avg: 1510008.08 / Max: 1531785.62Min: 1515781.75 / Avg: 1527380.46 / Max: 1543802.5Min: 1510719 / Avg: 1518627.29 / Max: 1522751.881. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p123714212835SE +/- 0.02, N = 3SE +/- 0.15, N = 3SE +/- 0.03, N = 328.9928.6828.801. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080p123612182430Min: 28.95 / Avg: 28.99 / Max: 29.01Min: 28.38 / Avg: 28.68 / Max: 28.86Min: 28.74 / Avg: 28.8 / Max: 28.841. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast123918273645SE +/- 0.42, N = 3SE +/- 0.16, N = 3SE +/- 0.05, N = 337.5937.7137.961. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Ultra Fast123816243240Min: 36.75 / Avg: 37.59 / Max: 38.02Min: 37.53 / Avg: 37.71 / Max: 38.02Min: 37.87 / Avg: 37.96 / Max: 38.021. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123510152025SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 321.4821.2721.32MIN: 21.27MIN: 20.98MIN: 21.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123510152025Min: 21.38 / Avg: 21.48 / Max: 21.56Min: 21.25 / Avg: 21.27 / Max: 21.3Min: 21.29 / Avg: 21.32 / Max: 21.341. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1232M4M6M8M10MSE +/- 112387.29, N = 3SE +/- 122181.06, N = 3SE +/- 60107.63, N = 3110381781093487311016030
OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1232M4M6M8M10MMin: 10900913 / Avg: 11038178 / Max: 11260958Min: 10692529 / Avg: 10934873.33 / Max: 11083185Min: 10909339 / Avg: 11016030 / Max: 11117350

Node.js V8 Web Tooling Benchmark

Running the V8 project's Web-Tooling-Benchmark under Node.js. The Web-Tooling-Benchmark stresses JavaScript-related workloads common to web developers like Babel and TypeScript and Babylon. This test profile can test the system's JavaScript performance with Node.js. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1233691215SE +/- 0.10, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 39.9610.0310.051. Nodejs v12.18.2
OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1233691215Min: 9.78 / Avg: 9.96 / Max: 10.13Min: 10 / Avg: 10.03 / Max: 10.08Min: 9.94 / Avg: 10.05 / Max: 10.191. Nodejs v12.18.2

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1231.23142.46283.69424.92566.157SE +/- 0.00422, N = 3SE +/- 0.00939, N = 3SE +/- 0.01688, N = 35.426975.472755.42816MIN: 5.34MIN: 5.37MIN: 5.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123246810Min: 5.42 / Avg: 5.43 / Max: 5.43Min: 5.46 / Avg: 5.47 / Max: 5.49Min: 5.4 / Avg: 5.43 / Max: 5.461. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12370140210280350SE +/- 0.32, N = 3SE +/- 0.53, N = 3SE +/- 0.27, N = 3314.64314.32312.06
OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark12360120180240300Min: 314.05 / Avg: 314.64 / Max: 315.16Min: 313.46 / Avg: 314.32 / Max: 315.3Min: 311.79 / Avg: 312.06 / Max: 312.61

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S12320406080100SE +/- 0.07, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 377.3277.6477.911. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1231530456075Min: 77.23 / Avg: 77.32 / Max: 77.46Min: 77.42 / Avg: 77.64 / Max: 77.78Min: 77.75 / Avg: 77.91 / Max: 78.191. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1230.33660.67321.00981.34641.683SE +/- 0.008, N = 3SE +/- 0.003, N = 3SE +/- 0.005, N = 31.4851.4961.490
OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar123246810Min: 1.47 / Avg: 1.49 / Max: 1.5Min: 1.49 / Avg: 1.5 / Max: 1.5Min: 1.48 / Avg: 1.49 / Max: 1.5

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1232K4K6K8K10KSE +/- 35.72, N = 3SE +/- 5.94, N = 3SE +/- 1.63, N = 37828.57882.97879.21. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed12314002800420056007000Min: 7780.8 / Avg: 7828.5 / Max: 7898.4Min: 7874.5 / Avg: 7882.93 / Max: 7894.4Min: 7876 / Avg: 7879.2 / Max: 7881.31. (CC) gcc options: -O3

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1231020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 344.6844.6944.391. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed123918273645Min: 44.67 / Avg: 44.68 / Max: 44.69Min: 44.68 / Avg: 44.69 / Max: 44.7Min: 44.16 / Avg: 44.39 / Max: 44.511. (CC) gcc options: -O3

Sunflow Rendering System

This test runs benchmarks of the Sunflow Rendering System. The Sunflow Rendering System is an open-source render engine for photo-realistic image synthesis with a ray-tracing core. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis1230.62571.25141.87712.50283.1285SE +/- 0.046, N = 3SE +/- 0.032, N = 3SE +/- 0.002, N = 32.7632.7812.774MIN: 2.57 / MAX: 3.48MIN: 2.61 / MAX: 3.45MIN: 2.61 / MAX: 3.64
OpenBenchmarking.orgSeconds, Fewer Is BetterSunflow Rendering System 0.07.2Global Illumination + Image Synthesis123246810Min: 2.69 / Avg: 2.76 / Max: 2.85Min: 2.74 / Avg: 2.78 / Max: 2.84Min: 2.77 / Avg: 2.77 / Max: 2.78

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.01822, N = 3SE +/- 0.01580, N = 3SE +/- 0.00792, N = 38.272058.219548.22943MIN: 7.77MIN: 7.73MIN: 7.731. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1233691215Min: 8.24 / Avg: 8.27 / Max: 8.3Min: 8.19 / Avg: 8.22 / Max: 8.24Min: 8.21 / Avg: 8.23 / Max: 8.241. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12330K60K90K120K150KSE +/- 994.84, N = 3SE +/- 1278.58, N = 3SE +/- 423.97, N = 3138255.21137667.82137386.881. (CC) gcc options: -O2 -lrt" -lrt
OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12320K40K60K80K100KMin: 136501.3 / Avg: 138255.21 / Max: 139945.77Min: 136112.29 / Avg: 137667.82 / Max: 140203.29Min: 136647.02 / Avg: 137386.88 / Max: 138115.591. (CC) gcc options: -O2 -lrt" -lrt

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1232K4K6K8K10KSE +/- 5.25, N = 3SE +/- 3.26, N = 3SE +/- 6.92, N = 38905.558959.948944.05MIN: 8887.6MIN: 8942.77MIN: 8922.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU12316003200480064008000Min: 8898.95 / Avg: 8905.55 / Max: 8915.92Min: 8954.57 / Avg: 8959.94 / Max: 8965.84Min: 8936.97 / Avg: 8944.05 / Max: 8957.891. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU12310002000300040005000SE +/- 1.40, N = 3SE +/- 1.79, N = 3SE +/- 5.10, N = 34630.544658.804652.31MIN: 4614MIN: 4642.81MIN: 4626.311. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1238001600240032004000Min: 4628.08 / Avg: 4630.54 / Max: 4632.94Min: 4656.52 / Avg: 4658.8 / Max: 4662.33Min: 4642.22 / Avg: 4652.31 / Max: 4658.621. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 11230.07470.14940.22410.29880.3735SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.3310.3300.332
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 112312345Min: 0.33 / Avg: 0.33 / Max: 0.33Min: 0.33 / Avg: 0.33 / Max: 0.33Min: 0.33 / Avg: 0.33 / Max: 0.33

BRL-CAD

BRL-CAD 7.28.0 is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1239K18K27K36K45K4208341832418501. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.97071.94142.91213.88284.8535SE +/- 0.00682, N = 3SE +/- 0.02231, N = 3SE +/- 0.00714, N = 34.303434.314034.28945MIN: 3.79MIN: 3.81MIN: 3.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU123246810Min: 4.29 / Avg: 4.3 / Max: 4.32Min: 4.29 / Avg: 4.31 / Max: 4.36Min: 4.28 / Avg: 4.29 / Max: 4.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12310002000300040005000SE +/- 7.69, N = 3SE +/- 3.42, N = 3SE +/- 2.70, N = 34632.154652.864657.30MIN: 4606.26MIN: 4635.53MIN: 4635.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1238001600240032004000Min: 4622.41 / Avg: 4632.15 / Max: 4647.33Min: 4646.63 / Avg: 4652.86 / Max: 4658.43Min: 4652.12 / Avg: 4657.3 / Max: 4661.221. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast1233691215SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 39.419.369.411. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Ultra Fast1233691215Min: 9.39 / Avg: 9.41 / Max: 9.43Min: 9.36 / Avg: 9.36 / Max: 9.36Min: 9.4 / Avg: 9.41 / Max: 9.421. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU12310002000300040005000SE +/- 7.67, N = 3SE +/- 1.59, N = 3SE +/- 1.90, N = 34627.434651.694650.30MIN: 4596.02MIN: 4632.21MIN: 4632.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1238001600240032004000Min: 4615.4 / Avg: 4627.43 / Max: 4641.69Min: 4648.52 / Avg: 4651.69 / Max: 4653.51Min: 4646.68 / Avg: 4650.3 / Max: 4653.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1232K4K6K8K10KSE +/- 1.58, N = 3SE +/- 3.04, N = 3SE +/- 24.18, N = 38905.028951.158949.81MIN: 8882.16MIN: 8921.6MIN: 8898.571. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12316003200480064008000Min: 8902.2 / Avg: 8905.02 / Max: 8907.65Min: 8946.49 / Avg: 8951.15 / Max: 8956.86Min: 8922.41 / Avg: 8949.81 / Max: 8998.031. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU12348121620SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 316.5616.5316.49MIN: 15.42MIN: 15.31MIN: 15.321. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU12348121620Min: 16.52 / Avg: 16.56 / Max: 16.63Min: 16.5 / Avg: 16.53 / Max: 16.58Min: 16.44 / Avg: 16.49 / Max: 16.551. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD123400K800K1200K1600K2000KSE +/- 19659.30, N = 3SE +/- 5504.90, N = 3SE +/- 19831.43, N = 31951090.961959806.671951427.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD123300K600K900K1200K1500KMin: 1912351.75 / Avg: 1951090.96 / Max: 1976284.62Min: 1949692 / Avg: 1959806.67 / Max: 1968629.88Min: 1912351.75 / Avg: 1951427.42 / Max: 1976853.881. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1230.10130.20260.30390.40520.5065SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.000, N = 30.4490.4480.4501. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark12312345Min: 0.45 / Avg: 0.45 / Max: 0.45Min: 0.44 / Avg: 0.45 / Max: 0.45Min: 0.45 / Avg: 0.45 / Max: 0.451. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1230.59021.18041.77062.36082.951SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.010, N = 32.6232.6142.6121. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein123246810Min: 2.61 / Avg: 2.62 / Max: 2.63Min: 2.61 / Avg: 2.61 / Max: 2.62Min: 2.59 / Avg: 2.61 / Max: 2.631. (CXX) g++ options: -O3 -pthread -lm

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1232K4K6K8K10KSE +/- 3.12, N = 3SE +/- 5.20, N = 3SE +/- 5.53, N = 38915.608951.958947.59MIN: 8894.86MIN: 8924.86MIN: 8923.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12316003200480064008000Min: 8910.65 / Avg: 8915.6 / Max: 8921.37Min: 8942.61 / Avg: 8951.95 / Max: 8960.6Min: 8940.2 / Avg: 8947.59 / Max: 8958.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow123246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 37.977.998.001. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Slow1233691215Min: 7.94 / Avg: 7.97 / Max: 8.02Min: 7.96 / Avg: 7.99 / Max: 8.01Min: 7.97 / Avg: 8 / Max: 8.031. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 38.198.228.221. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Medium1233691215Min: 8.17 / Avg: 8.19 / Max: 8.22Min: 8.19 / Avg: 8.22 / Max: 8.25Min: 8.21 / Avg: 8.22 / Max: 8.221. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed12314002800420056007000SE +/- 15.48, N = 3SE +/- 3.14, N = 3SE +/- 5.22, N = 36684.456669.406661.791. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed12312002400360048006000Min: 6653.76 / Avg: 6684.45 / Max: 6703.39Min: 6663.64 / Avg: 6669.4 / Max: 6674.43Min: 6656.54 / Avg: 6661.79 / Max: 6672.231. (CC) gcc options: -O3

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast123510152025SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 320.6720.6620.731. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 1080p - Video Preset: Very Fast123510152025Min: 20.55 / Avg: 20.67 / Max: 20.75Min: 20.63 / Avg: 20.66 / Max: 20.7Min: 20.67 / Avg: 20.73 / Max: 20.821. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1230.14830.29660.44490.59320.7415SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.6590.6570.658
OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom123246810Min: 0.66 / Avg: 0.66 / Max: 0.66Min: 0.66 / Avg: 0.66 / Max: 0.66Min: 0.66 / Avg: 0.66 / Max: 0.66

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123246810SE +/- 0.00986, N = 3SE +/- 0.01057, N = 3SE +/- 0.01212, N = 37.417587.397817.41727MIN: 6.58MIN: 6.58MIN: 6.581. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1233691215Min: 7.4 / Avg: 7.42 / Max: 7.43Min: 7.38 / Avg: 7.4 / Max: 7.42Min: 7.39 / Avg: 7.42 / Max: 7.431. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1232K4K6K8K10KSE +/- 2.36, N = 3SE +/- 3.51, N = 3SE +/- 4.55, N = 37858.97838.27841.11. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed12314002800420056007000Min: 7854.3 / Avg: 7858.9 / Max: 7862.1Min: 7833.8 / Avg: 7838.17 / Max: 7845.1Min: 7836.3 / Avg: 7841.1 / Max: 7850.21. (CC) gcc options: -O3

Crafty

This is a performance test of Crafty, an advanced open-source chess engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1231.6M3.2M4.8M6.4M8MSE +/- 18336.90, N = 3SE +/- 21649.68, N = 3SE +/- 9014.23, N = 37385971739060274044001. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm
OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1231.3M2.6M3.9M5.2M6.5MMin: 7366065 / Avg: 7385971.33 / Max: 7422599Min: 7365218 / Avg: 7390602.33 / Max: 7433673Min: 7386423 / Avg: 7404400.33 / Max: 74145641. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01233691215SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 311.5911.6211.621. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01233691215Min: 11.49 / Avg: 11.59 / Max: 11.66Min: 11.53 / Avg: 11.62 / Max: 11.69Min: 11.54 / Avg: 11.62 / Max: 11.671. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1231020304050SE +/- 0.04, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 345.7345.6445.731. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed123918273645Min: 45.67 / Avg: 45.73 / Max: 45.8Min: 45.49 / Avg: 45.64 / Max: 45.81Min: 45.65 / Avg: 45.73 / Max: 45.831. (CC) gcc options: -O3

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast1231.15652.3133.46954.6265.7825SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.135.145.141. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Very Fast123246810Min: 5.12 / Avg: 5.13 / Max: 5.14Min: 5.12 / Avg: 5.14 / Max: 5.16Min: 5.12 / Avg: 5.14 / Max: 5.151. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite123140K280K420K560K700KSE +/- 858.56, N = 3SE +/- 578.17, N = 3SE +/- 915.50, N = 3656714655454655467
OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite123110K220K330K440K550KMin: 655008 / Avg: 656713.67 / Max: 657738Min: 654303 / Avg: 655454.33 / Max: 656123Min: 654509 / Avg: 655466.67 / Max: 657297

LZ4 Compression

This test measures the time needed to compress/decompress a sample file (an Ubuntu ISO) using LZ4 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1232K4K6K8K10KSE +/- 2.95, N = 3SE +/- 1.88, N = 3SE +/- 5.68, N = 37862.97849.97859.61. (CC) gcc options: -O3
OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed12314002800420056007000Min: 7857.7 / Avg: 7862.87 / Max: 7867.9Min: 7848 / Avg: 7849.93 / Max: 7853.7Min: 7851 / Avg: 7859.57 / Max: 7870.31. (CC) gcc options: -O3

x265

This is a simple test of the x265 encoder run on the CPU with 1080p and 4K options for H.265 video encode performance with x265. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K123246810SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 36.376.376.381. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4K1233691215Min: 6.35 / Avg: 6.37 / Max: 6.4Min: 6.34 / Avg: 6.37 / Max: 6.4Min: 6.36 / Avg: 6.38 / Max: 6.41. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough12320406080100SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 380.2680.1780.211. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1231530456075Min: 80.14 / Avg: 80.26 / Max: 80.35Min: 80.11 / Avg: 80.17 / Max: 80.2Min: 80.16 / Avg: 80.21 / Max: 80.241. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Timed FFmpeg Compilation

This test times how long it takes to build the FFmpeg multimedia library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile123306090120150SE +/- 0.04, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 3136.91136.97136.82
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile123306090120150Min: 136.85 / Avg: 136.91 / Max: 136.98Min: 136.85 / Avg: 136.97 / Max: 137.18Min: 136.69 / Avg: 136.82 / Max: 136.97

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 212320406080100SE +/- 0.05, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 380.0680.0580.141. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 21231530456075Min: 79.96 / Avg: 80.06 / Max: 80.14Min: 80.05 / Avg: 80.05 / Max: 80.06Min: 80.04 / Avg: 80.14 / Max: 80.251. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1232004006008001000SE +/- 0.14, N = 3SE +/- 0.24, N = 3SE +/- 0.19, N = 3904.51904.49903.741. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing123160320480640800Min: 904.32 / Avg: 904.51 / Max: 904.79Min: 904.02 / Avg: 904.49 / Max: 904.73Min: 903.37 / Avg: 903.74 / Max: 903.991. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1233691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 312.3112.3012.301. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium12348121620Min: 12.3 / Avg: 12.31 / Max: 12.31Min: 12.29 / Avg: 12.3 / Max: 12.31Min: 12.3 / Avg: 12.3 / Max: 12.311. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms1230.591.181.772.362.95SE +/- 0.011, N = 3SE +/- 0.005, N = 3SE +/- 0.002, N = 32.6222.6212.6201. (CXX) g++ options: -O3 -pthread -lm
OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms123246810Min: 2.6 / Avg: 2.62 / Max: 2.64Min: 2.61 / Avg: 2.62 / Max: 2.63Min: 2.62 / Avg: 2.62 / Max: 2.631. (CXX) g++ options: -O3 -pthread -lm

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive123140280420560700SE +/- 0.12, N = 3SE +/- 0.21, N = 3SE +/- 0.03, N = 3641.49641.37641.221. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive123110220330440550Min: 641.26 / Avg: 641.49 / Max: 641.66Min: 640.98 / Avg: 641.37 / Max: 641.7Min: 641.16 / Avg: 641.22 / Max: 641.251. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Timed HMMer Search

This test searches through the Pfam database of profile hidden markov models. The search finds the domain structure of Drosophila Sevenless protein. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search123306090120150SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3123.93123.89123.881. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search12320406080100Min: 123.82 / Avg: 123.93 / Max: 124.08Min: 123.81 / Avg: 123.89 / Max: 124.01Min: 123.84 / Avg: 123.88 / Max: 123.951. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

Basis Universal

Basis Universal is a GPU texture codoec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3123306090120150SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3158.29158.30158.281. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 3123306090120150Min: 158.28 / Avg: 158.29 / Max: 158.3Min: 158.26 / Avg: 158.3 / Max: 158.35Min: 158.2 / Avg: 158.28 / Max: 158.331. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Kvazaar

This is a test of Kvazaar as a CPU-based H.265 video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium1230.41630.83261.24891.66522.0815SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.851.851.851. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Medium123246810Min: 1.85 / Avg: 1.85 / Max: 1.85Min: 1.85 / Avg: 1.85 / Max: 1.85Min: 1.85 / Avg: 1.85 / Max: 1.851. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow1230.40730.81461.22191.62922.0365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.811.811.811. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.0Video Input: Bosphorus 4K - Video Preset: Slow123246810Min: 1.81 / Avg: 1.81 / Max: 1.81Min: 1.81 / Avg: 1.81 / Max: 1.81Min: 1.81 / Avg: 1.81 / Max: 1.811. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1230.13950.2790.41850.5580.6975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.620.620.621. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID123246810Min: 0.61 / Avg: 0.62 / Max: 0.62Min: 0.61 / Avg: 0.62 / Max: 0.62Min: 0.62 / Avg: 0.62 / Max: 0.621. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1230.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.60.60.61. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets123246810Min: 0.6 / Avg: 0.6 / Max: 0.6Min: 0.6 / Avg: 0.6 / Max: 0.6Min: 0.6 / Avg: 0.6 / Max: 0.61. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1230.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.370.371. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom12312345Min: 0.37 / Avg: 0.37 / Max: 0.37Min: 0.37 / Avg: 0.37 / Max: 0.37Min: 0.37 / Avg: 0.37 / Max: 0.371. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1230.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.480.480.481. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya12312345Min: 0.48 / Avg: 0.48 / Max: 0.49Min: 0.48 / Avg: 0.48 / Max: 0.48Min: 0.48 / Avg: 0.48 / Max: 0.481. (CXX) g++ options: -O3 -pthread

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET123400K800K1200K1600K2000KSE +/- 30192.95, N = 12SE +/- 17021.33, N = 3SE +/- 17337.60, N = 81703146.701714807.711704781.921. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET123300K600K900K1200K1500KMin: 1374197.88 / Avg: 1703146.7 / Max: 1748251.62Min: 1683771.12 / Avg: 1714807.71 / Max: 1742439Min: 1595049.5 / Avg: 1704781.92 / Max: 1745200.621. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123500K1000K1500K2000K2500KSE +/- 54405.26, N = 12SE +/- 10692.18, N = 3SE +/- 5643.18, N = 32395940.241524653.461516168.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP123400K800K1200K1600K2000KMin: 1808318.38 / Avg: 2395940.24 / Max: 2487641.75Min: 1510622.38 / Avg: 1524653.46 / Max: 1545644.5Min: 1508295.62 / Avg: 1516168.5 / Max: 1527108.51. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

HPC Challenge

HPC Challenge (HPCC) is a cluster-focused benchmark consisting of the HPL Linpack TPP benchmark, DGEMM, STREAM, PTRANS, RandomAccess, FFT, and communication bandwidth and latency. This HPC Challenge test profile attempts to ship with standard yet versatile configuration/input files though they can be modified. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency1230.07730.15460.23190.30920.3865SE +/- 0.00081, N = 3SE +/- 0.04057, N = 3SE +/- 0.00123, N = 30.263500.343750.261531. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgusecs, Fewer Is BetterHPC Challenge 1.5.0Test / Class: Random Ring Latency12312345Min: 0.26 / Avg: 0.26 / Max: 0.26Min: 0.26 / Avg: 0.34 / Max: 0.39Min: 0.26 / Avg: 0.26 / Max: 0.261. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access1230.00540.01080.01620.02160.027SE +/- 0.00038, N = 3SE +/- 0.00091, N = 3SE +/- 0.00040, N = 30.023950.022300.023581. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGUP/s, More Is BetterHPC Challenge 1.5.0Test / Class: G-Random Access12312345Min: 0.02 / Avg: 0.02 / Max: 0.02Min: 0.02 / Avg: 0.02 / Max: 0.02Min: 0.02 / Avg: 0.02 / Max: 0.021. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL1231428425670SE +/- 0.47, N = 3SE +/- 1.67, N = 9SE +/- 0.83, N = 461.3258.3160.711. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3
OpenBenchmarking.orgGFLOPS, More Is BetterHPC Challenge 1.5.0Test / Class: G-HPL1231224364860Min: 60.41 / Avg: 61.32 / Max: 61.96Min: 49.15 / Avg: 58.31 / Max: 64.26Min: 59.52 / Avg: 60.71 / Max: 63.031. (CC) gcc options: -lblas -lm -pthread -lmpi -fomit-frame-pointer -funroll-loops2. ATLAS + Open MPI 4.0.3

84 Results Shown

HPC Challenge:
  G-Ptrans
  G-Ffte
  EP-DGEMM
Redis
HPC Challenge
oneDNN
HPC Challenge
rav1e
eSpeak-NG Speech Engine
HPC Challenge
Stockfish
oneDNN
Build2
oneDNN:
  IP Shapes 3D - u8s8f32 - CPU
  Convolution Batch Shapes Auto - f32 - CPU
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
rav1e
oneDNN
rav1e
SQLite Speedtest
ASTC Encoder
Redis
x265
Kvazaar
oneDNN
asmFish
Node.js V8 Web Tooling Benchmark
oneDNN
Numpy Benchmark
Basis Universal
IndigoBench
LZ4 Compression:
  1 - Decompression Speed
  9 - Compression Speed
Sunflow Rendering System
oneDNN
Coremark
oneDNN:
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - f32 - CPU
rav1e
BRL-CAD
oneDNN:
  IP Shapes 1D - u8s8f32 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Kvazaar
oneDNN:
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
Redis
GROMACS
LAMMPS Molecular Dynamics Simulator
oneDNN
Kvazaar:
  Bosphorus 1080p - Slow
  Bosphorus 1080p - Medium
LZ4 Compression
Kvazaar
IndigoBench
oneDNN
LZ4 Compression
Crafty
Basis Universal
LZ4 Compression
Kvazaar
PHPBench
LZ4 Compression
x265
ASTC Encoder
Timed FFmpeg Compilation
Basis Universal:
  UASTC Level 2
  UASTC Level 2 + RDO Post-Processing
ASTC Encoder
LAMMPS Molecular Dynamics Simulator
ASTC Encoder
Timed HMMer Search
Basis Universal
Kvazaar:
  Bosphorus 4K - Medium
  Bosphorus 4K - Slow
simdjson:
  DistinctUserID
  PartialTweets
  LargeRand
  Kostya
Redis:
  SET
  LPOP
HPC Challenge:
  Rand Ring Latency
  G-Rand Access
  G-HPL