eMAG

Ampere eMAG ARMv8 testing with a AmpereComputing OSPREY (4.8.19 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2012272-NE-EMAG7677119
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 2 Tests
AV1 2 Tests
Chess Test Suite 3 Tests
C/C++ Compiler Tests 5 Tests
CPU Massive 7 Tests
Creator Workloads 7 Tests
Encoding 5 Tests
HPC - High Performance Computing 3 Tests
Machine Learning 2 Tests
Multi-Core 8 Tests
Programmer / Developer System Benchmarks 2 Tests
Server CPU Tests 5 Tests
Single-Threaded 2 Tests
Video Encoding 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
1
December 26 2020
  6 Hours, 4 Minutes
2
December 27 2020
  6 Hours, 55 Minutes
3
December 27 2020
  4 Hours, 25 Minutes
4
December 27 2020
  1 Hour, 21 Minutes
Invert Hiding All Results Option
  4 Hours, 41 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


eMAGProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen Resolution1234Ampere eMAG ARMv8 @ 3.00GHz (32 Cores)AmpereComputing OSPREY (4.8.19 BIOS)Applied Micro Circuits X-Gene126GB256GB Samsung SSD 860ASPEEDVE228Intel I210Ubuntu 20.045.7.0-050700-generic (aarch64)GNOME Shell 3.36.3X Server 1.20.8modesetting 1.20.8GCC 9.3.0ext41920x1080OpenBenchmarking.orgCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-nls --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Processor Details- Scaling Governor: cppc_cpufreq ondemandPython Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Mitigation of PTI + spec_store_bypass: Vulnerable + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Vulnerable + tsx_async_abort: Not affected

1234Result OverviewPhoronix Test Suite100%101%102%103%oneDNNCLOMPTimed MAFFT AlignmentsimdjsonTSCP

eMAGclomp: Static OMP Speedupmafft: Multiple Sequence Alignment - LSU RNAsimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDtscp: AI Chess Performanceonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUrav1e: 1rav1e: 5rav1e: 6rav1e: 10x264: H.264 Video Encodingcoremark: CoreMark Size 666 - Iterations Per Secondstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthavifenc: 0avifenc: 2avifenc: 8avifenc: 10numpy: build-eigen: Time To Compileencode-ape: WAV To APEencode-opus: WAV To Opus Encodeespeak: Text-To-Speech Synthesis12347.135.6410.480.230.550.5651590329.014823.6825183.917418.33093.4725191.83760.3671113.020184.796114.06631838.217700.430113.816255.919.817629827.817140.136.61480.0840.1880.2230.42532.13385397.3703631569146933037962404.737250.16522.30221.88391.65357.47496.13048.30887.8317.235.3440.480.230.550.5651590333.224422.8008184.877421.22198.2804194.62063.4163112.310185.965113.51130600.517313.530971.016777.021.007731520.316556.738.37060.0840.1870.2210.41932.67385035.2074661541712733135767403.850250.70522.33221.83291.86357.14039.91948.26684.9027.036.3550.480.230.550.5651571032.284022.8470183.393424.64176.3515173.53955.9390112.550185.162113.11231978.216864.330446.516436.820.484730673.717065.538.49440.0840.1870.2220.42032.78385080.514289156176077.235.9990.480.230.550.5751570930.892524.4468183.137419.39284.3690173.05059.7828112.924183.153112.61330770.0OpenBenchmarking.org

CLOMP

CLOMP is the C version of the Livermore OpenMP benchmark developed to measure OpenMP overheads and other performance impacts due to threading in order to influence future system designs. This particular test profile configuration is currently set to look at the OpenMP static schedule speed-up across all available CPU cores using the recommended test configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1234246810SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 37.17.27.07.21. (CC) gcc options: -fopenmp -O3 -lm
OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup12343691215Min: 7 / Avg: 7.1 / Max: 7.2Min: 7.2 / Avg: 7.23 / Max: 7.3Min: 6.9 / Avg: 7 / Max: 7.1Min: 7.1 / Avg: 7.23 / Max: 7.31. (CC) gcc options: -fopenmp -O3 -lm

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1234816243240SE +/- 0.31, N = 3SE +/- 0.11, N = 3SE +/- 0.16, N = 3SE +/- 0.49, N = 335.6435.3436.3636.001. (CC) gcc options: -std=c99 -O3 -lm -lpthread
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1234816243240Min: 35.16 / Avg: 35.64 / Max: 36.23Min: 35.23 / Avg: 35.34 / Max: 35.57Min: 36.04 / Avg: 36.36 / Max: 36.53Min: 35.07 / Avg: 36 / Max: 36.711. (CC) gcc options: -std=c99 -O3 -lm -lpthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya12340.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.480.480.480.481. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya123412345Min: 0.48 / Avg: 0.48 / Max: 0.48Min: 0.48 / Avg: 0.48 / Max: 0.48Min: 0.48 / Avg: 0.48 / Max: 0.48Min: 0.48 / Avg: 0.48 / Max: 0.481. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom12340.05180.10360.15540.20720.259SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.230.230.230.231. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom123412345Min: 0.23 / Avg: 0.23 / Max: 0.23Min: 0.23 / Avg: 0.23 / Max: 0.23Min: 0.23 / Avg: 0.23 / Max: 0.23Min: 0.23 / Avg: 0.23 / Max: 0.231. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets12340.12380.24760.37140.49520.619SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.550.550.550.551. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1234246810Min: 0.55 / Avg: 0.55 / Max: 0.55Min: 0.55 / Avg: 0.55 / Max: 0.55Min: 0.55 / Avg: 0.55 / Max: 0.55Min: 0.55 / Avg: 0.55 / Max: 0.551. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID12340.12830.25660.38490.51320.6415SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.560.560.560.571. (CXX) g++ options: -O3 -pthread
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1234246810Min: 0.56 / Avg: 0.56 / Max: 0.57Min: 0.56 / Avg: 0.56 / Max: 0.56Min: 0.56 / Avg: 0.56 / Max: 0.57Min: 0.56 / Avg: 0.57 / Max: 0.571. (CXX) g++ options: -O3 -pthread

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance1234110K220K330K440K550KSE +/- 327.56, N = 5SE +/- 328.25, N = 5SE +/- 264.73, N = 55159035159035157105157091. (CC) gcc options: -O3 -march=native
OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess Performance123490K180K270K360K450KMin: 514745 / Avg: 515903.4 / Max: 516677Min: 515227 / Avg: 515903.2 / Max: 517162Min: 515227 / Avg: 515709.8 / Max: 5166771. (CC) gcc options: -O3 -march=native

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1234816243240SE +/- 1.27, N = 15SE +/- 1.03, N = 15SE +/- 1.36, N = 15SE +/- 1.62, N = 1229.0133.2232.2830.89MIN: 15.13MIN: 15.35MIN: 15.14MIN: 15.091. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1234714212835Min: 22.33 / Avg: 29.01 / Max: 36.43Min: 24.42 / Avg: 33.22 / Max: 36.86Min: 23.22 / Avg: 32.28 / Max: 37.25Min: 23.09 / Avg: 30.89 / Max: 36.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1234612182430SE +/- 0.67, N = 15SE +/- 0.68, N = 12SE +/- 0.69, N = 15SE +/- 1.03, N = 1523.6822.8022.8524.45MIN: 17.03MIN: 17.06MIN: 17.02MIN: 17.031. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1234612182430Min: 20.05 / Avg: 23.68 / Max: 29.54Min: 19.76 / Avg: 22.8 / Max: 27.15Min: 19.26 / Avg: 22.85 / Max: 28.32Min: 19.16 / Avg: 24.45 / Max: 33.921. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU12344080120160200SE +/- 1.68, N = 3SE +/- 2.81, N = 3SE +/- 1.41, N = 3SE +/- 0.18, N = 3183.92184.88183.39183.14MIN: 127.82MIN: 135.46MIN: 133.71MIN: 130.481. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1234306090120150Min: 181.73 / Avg: 183.92 / Max: 187.22Min: 179.6 / Avg: 184.88 / Max: 189.16Min: 180.58 / Avg: 183.39 / Max: 184.97Min: 182.78 / Avg: 183.14 / Max: 183.321. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU123490180270360450SE +/- 2.55, N = 3SE +/- 0.17, N = 3SE +/- 1.42, N = 3SE +/- 1.34, N = 3418.33421.22424.64419.39MIN: 379.92MIN: 376.78MIN: 380.56MIN: 371.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU123480160240320400Min: 413.81 / Avg: 418.33 / Max: 422.63Min: 420.93 / Avg: 421.22 / Max: 421.53Min: 421.85 / Avg: 424.64 / Max: 426.49Min: 417.18 / Avg: 419.39 / Max: 421.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123420406080100SE +/- 7.37, N = 15SE +/- 7.70, N = 12SE +/- 6.62, N = 15SE +/- 8.19, N = 1593.4798.2876.3584.37MIN: 30.81MIN: 30.76MIN: 30.78MIN: 30.781. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU123420406080100Min: 47.44 / Avg: 93.47 / Max: 119.35Min: 31.33 / Avg: 98.28 / Max: 121.87Min: 39.59 / Avg: 76.35 / Max: 115.17Min: 32.81 / Avg: 84.37 / Max: 121.061. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU12344080120160200SE +/- 15.29, N = 12SE +/- 13.95, N = 15SE +/- 12.91, N = 15SE +/- 12.10, N = 15191.84194.62173.54173.05MIN: 115.28MIN: 115.45MIN: 116.21MIN: 1161. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU12344080120160200Min: 134.35 / Avg: 191.84 / Max: 256.86Min: 127.31 / Avg: 194.62 / Max: 255.27Min: 131.13 / Avg: 173.54 / Max: 259.58Min: 132.43 / Avg: 173.05 / Max: 261.051. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU12341428425670SE +/- 6.33, N = 15SE +/- 7.31, N = 15SE +/- 7.43, N = 12SE +/- 6.08, N = 1560.3763.4255.9459.78MIN: 26.98MIN: 26.98MIN: 26.97MIN: 26.971. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU12341224364860Min: 29.23 / Avg: 60.37 / Max: 87.83Min: 27.02 / Avg: 63.42 / Max: 89.94Min: 27.02 / Avg: 55.94 / Max: 90.26Min: 32.53 / Avg: 59.78 / Max: 88.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1234306090120150SE +/- 0.34, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.27, N = 3113.02112.31112.55112.92MIN: 98.24MIN: 103.46MIN: 102.72MIN: 104.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU123420406080100Min: 112.35 / Avg: 113.02 / Max: 113.38Min: 112.26 / Avg: 112.31 / Max: 112.41Min: 112.34 / Avg: 112.55 / Max: 112.71Min: 112.5 / Avg: 112.92 / Max: 113.421. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU12344080120160200SE +/- 0.92, N = 3SE +/- 1.31, N = 3SE +/- 1.64, N = 3SE +/- 2.05, N = 3184.80185.97185.16183.15MIN: 121.06MIN: 114.9MIN: 116.12MIN: 1111. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1234306090120150Min: 183.54 / Avg: 184.8 / Max: 186.58Min: 183.82 / Avg: 185.97 / Max: 188.35Min: 183.14 / Avg: 185.16 / Max: 188.4Min: 179.42 / Avg: 183.15 / Max: 186.471. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1234306090120150SE +/- 0.92, N = 3SE +/- 0.48, N = 3SE +/- 1.41, N = 3SE +/- 1.75, N = 3114.07113.51113.11112.61MIN: 93.84MIN: 91.33MIN: 92.39MIN: 90.721. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU123420406080100Min: 112.31 / Avg: 114.07 / Max: 115.42Min: 112.58 / Avg: 113.51 / Max: 114.19Min: 111.68 / Avg: 113.11 / Max: 115.94Min: 109.4 / Avg: 112.61 / Max: 115.411. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12347K14K21K28K35KSE +/- 1283.37, N = 9SE +/- 724.02, N = 10SE +/- 171.48, N = 3SE +/- 405.43, N = 1231838.230600.531978.230770.0MIN: 23896.8MIN: 23693.3MIN: 25646MIN: 24555.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU12346K12K18K24K30KMin: 28315 / Avg: 31838.16 / Max: 41666.1Min: 27903.5 / Avg: 30600.53 / Max: 33846.5Min: 31661.6 / Avg: 31978.2 / Max: 32250.7Min: 28665 / Avg: 30770 / Max: 33455.41. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1234K8K12K16K20KSE +/- 343.55, N = 11SE +/- 395.99, N = 12SE +/- 324.82, N = 1217700.417313.516864.3MIN: 13035MIN: 12871.9MIN: 12759.41. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1233K6K9K12K15KMin: 15835.1 / Avg: 17700.4 / Max: 20087Min: 15023.9 / Avg: 17313.53 / Max: 19405.4Min: 15806.1 / Avg: 16864.34 / Max: 18991.71. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1237K14K21K28K35KSE +/- 165.59, N = 3SE +/- 500.69, N = 12SE +/- 588.64, N = 1230113.830971.030446.5MIN: 24215.3MIN: 23442.3MIN: 23892.71. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1235K10K15K20K25KMin: 29847.1 / Avg: 30113.8 / Max: 30417.2Min: 27739.1 / Avg: 30970.97 / Max: 33470.1Min: 27483.5 / Avg: 30446.53 / Max: 34427.61. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1234K8K12K16K20KSE +/- 188.21, N = 3SE +/- 307.63, N = 12SE +/- 378.61, N = 916255.916777.016436.8MIN: 13038.5MIN: 12597.8MIN: 13044.41. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1233K6K9K12K15KMin: 15907.6 / Avg: 16255.9 / Max: 16553.7Min: 15394.7 / Avg: 16777.03 / Max: 19122.5Min: 14374.2 / Avg: 16436.76 / Max: 17944.21. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123510152025SE +/- 1.04, N = 15SE +/- 0.93, N = 15SE +/- 0.93, N = 1519.8221.0120.48MIN: 8.13MIN: 8.13MIN: 8.141. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU123510152025Min: 12.18 / Avg: 19.82 / Max: 24.56Min: 15.38 / Avg: 21.01 / Max: 26.1Min: 15.73 / Avg: 20.48 / Max: 25.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1237K14K21K28K35KSE +/- 438.76, N = 12SE +/- 470.00, N = 3SE +/- 360.00, N = 329827.831520.330673.7MIN: 23657MIN: 24350.2MIN: 24396.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1235K10K15K20K25KMin: 27611.7 / Avg: 29827.75 / Max: 31991.6Min: 30936.7 / Avg: 31520.33 / Max: 32450.3Min: 30046.4 / Avg: 30673.7 / Max: 31293.41. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1234K8K12K16K20KSE +/- 244.89, N = 3SE +/- 327.46, N = 12SE +/- 506.17, N = 917140.116556.717065.5MIN: 13604.9MIN: 12882.3MIN: 13244.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1233K6K9K12K15KMin: 16718.1 / Avg: 17140.1 / Max: 17566.4Min: 15193.1 / Avg: 16556.68 / Max: 18858.9Min: 15491.3 / Avg: 17065.53 / Max: 20329.51. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123918273645SE +/- 1.48, N = 15SE +/- 0.94, N = 15SE +/- 0.82, N = 1536.6138.3738.49MIN: 19.74MIN: 19.74MIN: 19.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU123816243240Min: 25.62 / Avg: 36.61 / Max: 41.46Min: 27.96 / Avg: 38.37 / Max: 41.9Min: 32.92 / Avg: 38.49 / Max: 41.451. (CXX) g++ options: -O3 -std=c++11 -fopenmp -mcpu=native -fPIC -pie -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 11230.01890.03780.05670.07560.0945SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0840.0840.084
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 112312345Min: 0.08 / Avg: 0.08 / Max: 0.08Min: 0.08 / Avg: 0.08 / Max: 0.08Min: 0.08 / Avg: 0.08 / Max: 0.08

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 51230.04230.08460.12690.16920.2115SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1880.1870.187
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 512312345Min: 0.19 / Avg: 0.19 / Max: 0.19Min: 0.19 / Avg: 0.19 / Max: 0.19Min: 0.19 / Avg: 0.19 / Max: 0.19

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 61230.05020.10040.15060.20080.251SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.2230.2210.222
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 612312345Min: 0.22 / Avg: 0.22 / Max: 0.22Min: 0.22 / Avg: 0.22 / Max: 0.22Min: 0.22 / Avg: 0.22 / Max: 0.22

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 101230.09560.19120.28680.38240.478SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.4250.4190.420
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4 AlphaSpeed: 1012312345Min: 0.43 / Avg: 0.43 / Max: 0.43Min: 0.42 / Avg: 0.42 / Max: 0.42Min: 0.42 / Avg: 0.42 / Max: 0.42

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video Encoding123816243240SE +/- 0.38, N = 5SE +/- 0.30, N = 3SE +/- 0.51, N = 332.1332.6732.781. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -lm -lpthread
OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2019-12-17H.264 Video Encoding123714212835Min: 30.76 / Avg: 32.13 / Max: 32.88Min: 32.32 / Avg: 32.67 / Max: 33.27Min: 31.85 / Avg: 32.78 / Max: 33.61. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -lm -lpthread

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12380K160K240K320K400KSE +/- 656.44, N = 3SE +/- 798.90, N = 3SE +/- 664.33, N = 3385397.37385035.21385080.511. (CC) gcc options: -O2 -lrt" -lrt
OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second12370K140K210K280K350KMin: 384084.5 / Avg: 385397.37 / Max: 386053.81Min: 383440.18 / Avg: 385035.21 / Max: 385914.13Min: 383785.08 / Avg: 385080.51 / Max: 385983.961. (CC) gcc options: -O2 -lrt" -lrt

Stockfish

This is a test of Stockfish, an advanced C++11 chess benchmark that can scale up to 128 CPU cores. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1233M6M9M12M15MSE +/- 125793.16, N = 3SE +/- 184845.46, N = 6SE +/- 177988.27, N = 151569146915417127156176071. (CXX) g++ options: -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -flto -flto=jobserver
OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1233M6M9M12M15MMin: 15447324 / Avg: 15691469.33 / Max: 15866140Min: 14832073 / Avg: 15417127.33 / Max: 16203593Min: 14728243 / Avg: 15617607.13 / Max: 170277641. (CXX) g++ options: -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -flto -flto=jobserver

asmFish

This is a test of asmFish, an advanced chess benchmark written in Assembly. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth127M14M21M28M35MSE +/- 299925.52, N = 3SE +/- 393577.81, N = 33303796233135767
OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth126M12M18M24M30MMin: 32608036 / Avg: 33037962.33 / Max: 33615194Min: 32438964 / Avg: 33135766.67 / Max: 33801280

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 01290180270360450SE +/- 1.37, N = 3SE +/- 0.42, N = 3404.74403.851. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 01270140210280350Min: 402.89 / Avg: 404.74 / Max: 407.4Min: 403.19 / Avg: 403.85 / Max: 404.621. (CXX) g++ options: -O3 -fPIC

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 21250100150200250SE +/- 0.06, N = 3SE +/- 0.40, N = 3250.17250.711. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 21250100150200250Min: 250.04 / Avg: 250.17 / Max: 250.26Min: 249.93 / Avg: 250.7 / Max: 251.291. (CXX) g++ options: -O3 -fPIC

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 812510152025SE +/- 0.05, N = 3SE +/- 0.02, N = 322.3022.331. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 812510152025Min: 22.21 / Avg: 22.3 / Max: 22.38Min: 22.29 / Avg: 22.33 / Max: 22.361. (CXX) g++ options: -O3 -fPIC

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 1012510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 321.8821.831. (CXX) g++ options: -O3 -fPIC
OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.7.3Encoder Speed: 1012510152025Min: 21.85 / Avg: 21.88 / Max: 21.93Min: 21.81 / Avg: 21.83 / Max: 21.861. (CXX) g++ options: -O3 -fPIC

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark1220406080100SE +/- 0.30, N = 3SE +/- 0.27, N = 391.6591.86
OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark1220406080100Min: 91.08 / Avg: 91.65 / Max: 92.11Min: 91.32 / Avg: 91.86 / Max: 92.18

Timed Eigen Compilation

This test times how long it takes to build all Eigen examples. The Eigen examples are compiled serially. Eigen is a C++ template library for linear algebra. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile1280160240320400SE +/- 1.89, N = 3SE +/- 0.65, N = 3357.47357.14
OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile1260120180240300Min: 353.74 / Avg: 357.47 / Max: 359.82Min: 356.03 / Avg: 357.14 / Max: 358.27

Monkey Audio Encoding

This test times how long it takes to encode a sample WAV file to Monkey's Audio APE format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1220406080100SE +/- 0.05, N = 5SE +/- 0.06, N = 596.1339.921. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt
OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1220406080100Min: 96.02 / Avg: 96.13 / Max: 96.29Min: 39.82 / Avg: 39.92 / Max: 40.171. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

Opus Codec Encoding

Opus is an open audio codec. Opus is a lossy audio compression format designed primarily for interactive real-time applications over the Internet. This test uses Opus-Tools and measures the time required to encode a WAV file to Opus. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode121122334455SE +/- 0.03, N = 5SE +/- 0.01, N = 548.3148.271. (CXX) g++ options: -fvisibility=hidden -logg -lm
OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode121020304050Min: 48.24 / Avg: 48.31 / Max: 48.4Min: 48.24 / Avg: 48.27 / Max: 48.311. (CXX) g++ options: -fvisibility=hidden -logg -lm

eSpeak-NG Speech Engine

This test times how long it takes the eSpeak speech synthesizer to read Project Gutenberg's The Outline of Science and output to a WAV file. This test profile is now tracking the eSpeak-NG version of eSpeak. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1220406080100SE +/- 0.12, N = 4SE +/- 0.21, N = 487.8384.901. (CC) gcc options: -O2 -std=c99
OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1220406080100Min: 87.69 / Avg: 87.83 / Max: 88.19Min: 84.47 / Avg: 84.9 / Max: 85.431. (CC) gcc options: -O2 -std=c99