xeon-platinum-8380-2p-smoke-run

2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2105012-IB-XEONPLATI04
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
r1
April 28 2021
  1 Day, 1 Minute
r1a
April 29 2021
  11 Hours, 50 Minutes
r2
April 29 2021
  1 Minute
r2a
April 29 2021
  1 Hour, 9 Minutes
r2b
April 29 2021
  18 Hours, 2 Minutes
r3
April 30 2021
  17 Hours, 57 Minutes
r4
April 30 2021
  17 Hours, 55 Minutes
r5
May 01 2021
  46 Minutes
Invert Behavior (Only Show Selected Data)
  11 Hours, 28 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


xeon-platinum-8380-2p-smoke-run ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolutionr1r1ar2r2ar2br3r4r52 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads)Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS)Intel Device 099816 x 32 GB DDR4-3200MT/s Hynix HMA84GR7CJR4N-XN2 x 7682GB INTEL SSDPF2KX076TZ + 2 x 800GB INTEL SSDPF21Q800GB + 3841GB Micron_9300_MTFDHAL3T8TDP + 960GB INTEL SSDSC2KG96ASPEEDVE2282 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFPUbuntu 20.045.11.0-051100-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8GCC 9.3.0ext41920x10801024x768OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- r1: Scaling Governor: intel_pstate performance - CPU Microcode: 0xd000270- r1a: Scaling Governor: intel_pstate performance - CPU Microcode: 0xd000270- r2: Scaling Governor: intel_pstate performance - CPU Microcode: 0xd000270- r2a: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270- r2b: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270- r3: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270- r4: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270- r5: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270Python Details- Python 2.7.18 + Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

xeon-platinum-8380-2p-smoke-run toybrot: TBBtoybrot: OpenMPintel-mlc: Peak Injection Bandwidth - 1:1 Reads-Writesintel-mlc: Peak Injection Bandwidth - Stream-Triad Likeintel-mlc: Peak Injection Bandwidth - 3:1 Reads-Writesintel-mlc: Peak Injection Bandwidth - 2:1 Reads-Writesintel-mlc: Max Bandwidth - Stream-Triad Likeintel-mlc: Peak Injection Bandwidth - All Readsintel-mlc: Max Bandwidth - 1:1 Reads-Writesintel-mlc: Max Bandwidth - 2:1 Reads-Writesintel-mlc: Max Bandwidth - 3:1 Reads-Writestoybrot: C++ Taskstoybrot: C++ Threadsbotan: KASUMIbotan: KASUMI - Decryptbotan: AES-256botan: AES-256 - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptbasis: ETC1Sbasis: UASTC Level 0basis: UASTC Level 2basis: UASTC Level 3tjbench: Decompression Throughputastcenc: Mediumastcenc: Thoroughastcenc: Exhaustivedraco: Liondraco: Church Facadetoktx: UASTC 3toktx: Zstd Compression 9toktx: Zstd Compression 19toktx: UASTC 3 + Zstd Compression 19toktx: UASTC 4 + Zstd Compression 19vosk: securemark: SecureMark-TLSxmrig: Monero - 1Mxmrig: Wownero - 1Mmnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3onednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUincompact3d: X3D-benchmarking input.i3dincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directioncp2k: Fayalite-FISTstockfish: Total Timebuild-llvm: Ninjabuild-llvm: Unix Makefilesbuild-linux-kernel: Time To Compilesysbench: RAM / Memorysysbench: CPUaom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 4Kintel-mlc: Max Bandwidth - All Readsaom-av1: Speed 0 Two-Pass - Bosphorus 4Kaom-av1: Speed 4 Two-Pass - Bosphorus 4Kaom-av1: Speed 0 Two-Pass - Bosphorus 1080paom-av1: Speed 4 Two-Pass - Bosphorus 1080paom-av1: Speed 6 Realtime - Bosphorus 1080paom-av1: Speed 6 Two-Pass - Bosphorus 1080paom-av1: Speed 8 Realtime - Bosphorus 1080paom-av1: Speed 9 Realtime - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080phelsing: 14 digitsvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080pblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyavifenc: 0avifenc: 2avifenc: 6avifenc: 10hammerdb-mariadb: 32 - 500avifenc: 6, Losslesshammerdb-mariadb: 16 - 250hammerdb-mariadb: 8 - 500hammerdb-mariadb: 16 - 250hammerdb-mariadb: 8 - 250hammerdb-mariadb: 8 - 500hammerdb-mariadb: 32 - 250hammerdb-mariadb: 32 - 500avifenc: 10, Losslessluxcorerender: DLSC - CPUluxcorerender: Danish Mood - CPUluxcorerender: Orange Juice - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Rainbow Colors and Prism - CPUbuild-erlang: Time To Compilebuild-mesa: Time To Compilebuild-nodejs: Time To Compilehammerdb-mariadb: 16 - 500hammerdb-mariadb: 8 - 250hammerdb-mariadb: 32 - 250hammerdb-mariadb: 16 - 500build-wasmer: Time To Compileviennacl: CPU BLAS - sCOPYhammerdb-mariadb: 64 - 500hammerdb-mariadb: 128 - 500hammerdb-mariadb: 128 - 500hammerdb-mariadb: 128 - 250hammerdb-mariadb: 128 - 250hammerdb-mariadb: 64 - 500viennacl: CPU BLAS - sAXPYhammerdb-mariadb: 64 - 250hammerdb-mariadb: 64 - 250viennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTgnuradio: Five Back to Back FIR Filtersgnuradio: Signal Source (Cosine)gnuradio: FIR Filtergnuradio: IIR Filtergnuradio: FM Deemphasis Filtergnuradio: Hilbert Transformliquid-dsp: 1 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 4 - 256 - 57liquid-dsp: 8 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 128 - 256 - 57liquid-dsp: 160 - 256 - 57luaradio: Five Back to Back FIR Filtersluaradio: FM Deemphasis Filterluaradio: Hilbert Transformluaradio: Complex Phasesrslte: OFDM_Testsrslte: PHY_DL_Testsrslte: PHY_DL_Testmysqlslap: 1mysqlslap: 4mysqlslap: 8mysqlslap: 16mysqlslap: 32mysqlslap: 64mysqlslap: 128mysqlslap: 256mysqlslap: 512gmpbench: Total Timeintel-mlc: Idle Latencyliquid-dsp: 64 - 256 - 57r1r1ar2r2ar2br3r4r568507318442422.3324377.2425933.7459038.6325766.94356476.2439496.74459455.38426148.967879701877.28774.3205669.7005663.055289.126292.736363.038363.255115.972116.074623.494619.458161.63461935.91822541219299.548051.50.9185681.248091.215940.3982822.961351.800461.109917.494670.8641640.8778150.3383270.210919801.409447.971792.8312.079443.530263.57247445.5190.239989804.392445.1440.2151150.593042313.9204512.7437099611.3586022181644819145.717216.32324.38215.097.3729.2033.07357285.28401.2977.872386.29327.8736.91290.67499.2357.97531.53913.2475.47720841932.113192913285984637572900829437969054688188.8529.707.4214.367.8417.04114.55020.952101.101195258957682092546447762.1601834642985719017328855415167809194684100319139763279620843105872072.371973.575.676.076.31024.32183.5603.0610.6734.0459.3577920001107133332176433334419533338853200001735100000341593333331448000001094.8410.080.3546.8120300000183.476.94642.135.1326713333369647308442843.2323924.2424096.6456260.3325184.58358385.5441408.09456629.89424612.627724698077.31074.2885670.8095663.612288.852292.374363.615363.326115.970116.069623.198619.538156.96901635.00922536619452.050166.10.9122791.252671.222780.3955882.968571.798811.122247.500590.8632140.8791370.3416630.210728804.323447.308791.9272.085323.543673.57662447.4360.240122793.363446.9360.2136430.595661311.9607852.7385909611.2727114186263552145.550215.76024.36015.197.5528.9932.51358364.560.194.170.516.8928.6621.25103.92125.25408.2478.159393.46329.5337.34288.99493.5157.71031.47913.3285.50531.6248.8129.617.5514.268.0413.34113.80020.379100.44661.930504623115724217322818876137027733539237163.631972.376.877.477.21015.22175.3604.8609.5727.4459.18902733331736800000335273333331620666671094.5409.680.3548.2120133333184.277.34642.833.0326370000067.5442144.2323826.9424077.3456408.6325260.41358269.7442460.05456545.88424818.831374.663358456.0932.569847412440454.7324209.8425925.6459309.8325409.99357742.9441732.77459226.53425997.228050714976.28674.2755606.9675662.763288.562292.396362.926363.196114.663116.080615.806612.43834.23711.25113.97917.163160.2625597.18879.290716.3621612670015.6643.47019.78110.01156.66036.42422534319311.149908.37.17448.7324.0783.21353.0730.9436241.253131.237960.4034093.004641.817741.1187428.40230.8740800.8699780.3418930.210324808.289446.389789.8362.117123.531213.64232447.7010.243026791.695447.2870.2168060.602122307.6221083.0228199211.5617158181554218148.484226.44027.99712510.56214210.835.973.2212.0314.30357774.430.142.010.323.3010.397.4536.2043.26182.1778.33182.26164.3227.80158.16234.5129.5671.7846.38110.0288.5764.97138.37216.0656.65638.39510.2829.275.7314.285.8413.42191.74621.575110.93071.928691474349422.2507.1447.6562.3389.961.959.862.354.7111.21684.4470.0498.2645.8357.456230333110173333213203333428100000862890000169933333334000666673131866667804.5370.178.2458.7120733333181.675.033361614141312648854031921601664524.5322743333370037439449554.1324227.4424904.5457190.5325218.50358463.7440939.22457141.24424925.848048720376.40774.3095593.3665662.342286.180292.827359.452363.314114.517115.723616.501612.149159.18703835.58122529120652.949813.40.9369411.241761.245080.4068773.009291.843391.1457828.18150.8749680.9018230.3419550.218349796.689450.648793.0802.108413.562243.64033446.9170.243308793.916447.1440.2165860.602314386.3900013.5659277414.5982965189214499147.163226.19928.0185.973.2011.9414.06358268.000.152.050.333.3610.397.3836.0643.42181.5278.079185.53164.5128.22157.83234.3965.96038.31316.6156.59738.59010.0889.245.6513.895.9216.47192.24521.369111.79071.13011358625329131024.2713.4764.364766.468.966.961.7580.51723.9502.0487.4621.0408.057197667111510000215343333432170000865410000170450000034110000003143300000662.8370.378.2458.2120833333181.676.134581580142012628874041894504.567.6323270000070167429446396.0324112.8425822.1458941.9325314.62358110.5440315.41458790.96425848.098037714176.40374.2925611.9955650.139286.004292.610359.573363.279114.646116.070619.638615.97534.42011.22614.15917.185159.2377527.14729.309116.3729617070825.5623.69720.08210.02956.77035.50322274720574.649937.37.17048.0414.1003.36252.2270.9407141.242221.241160.4029193.009071.819131.1181128.46130.8762270.8754210.3402430.217941792.296446.536792.0492.108373.547833.64319447.9580.242450811.941448.9060.2150850.602038389.6982803.5727815314.6577489186013261146.909224.29028.09412553.44214241.346.003.2312.1014.73357925.980.142.100.333.3610.547.4336.3542.37179.1378.539184.07162.2128.01156.26233.9629.6972.2946.73109.9688.6865.88837.79616.2116.74638.50710.2089.255.6813.945.8714.79193.83921.313111.67370.7581167855535936115876570.264770.872.467.663.7487.91619.2515.6487.7622.0373.855251667109430000216773333432013333860046667169750000033988000003140266667706.1368.078.4452.7120666667183.778.34525.767.83245666667448800.1324234.5425508.1458830.6325312.30357722.7440205.22458756.46425467.51357550.8268.1OpenBenchmarking.org

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBr1r1ar2br3r415003000450060007500SE +/- 59.06, N = 15SE +/- 80.68, N = 3SE +/- 73.83, N = 15SE +/- 69.20, N = 15SE +/- 81.70, N = 15685069646984700370161. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMPr1ar1r2br4r316003200480064008000SE +/- 0.88, N = 3SE +/- 5.13, N = 3SE +/- 101.59, N = 3SE +/- 91.12, N = 4SE +/- 85.45, N = 4730873187412742974391. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

Intel Memory Latency Checker

Intel Memory Latency Checker (MLC) is a binary-only system memory bandwidth and latency benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Peak Injection Bandwidth - 1:1 Reads-Writesr3r5r4r1ar1r2ar2b100K200K300K400K500KSE +/- 138.13, N = 3SE +/- 847.23, N = 3SE +/- 1601.80, N = 3SE +/- 148.63, N = 3SE +/- 1187.16, N = 3SE +/- 212.40, N = 3SE +/- 314.54, N = 3449554.1448800.1446396.0442843.2442422.3442144.2440454.7

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Peak Injection Bandwidth - Stream-Triad Liker1r5r3r2br4r1ar2a70K140K210K280K350KSE +/- 177.93, N = 3SE +/- 55.81, N = 3SE +/- 32.03, N = 3SE +/- 12.95, N = 3SE +/- 60.42, N = 3SE +/- 38.10, N = 3SE +/- 34.05, N = 3324377.2324234.5324227.4324209.8324112.8323924.2323826.9

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Peak Injection Bandwidth - 3:1 Reads-Writesr1r2br4r5r3r1ar2a90K180K270K360K450KSE +/- 163.24, N = 3SE +/- 25.04, N = 3SE +/- 23.30, N = 3SE +/- 23.30, N = 3SE +/- 88.34, N = 3SE +/- 94.95, N = 3SE +/- 236.99, N = 3425933.7425925.6425822.1425508.1424904.5424096.6424077.3

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Peak Injection Bandwidth - 2:1 Reads-Writesr2br1r4r5r3r2ar1a100K200K300K400K500KSE +/- 64.32, N = 3SE +/- 274.15, N = 3SE +/- 36.24, N = 3SE +/- 12.06, N = 3SE +/- 73.04, N = 3SE +/- 115.55, N = 3SE +/- 130.28, N = 3459309.8459038.6458941.9458830.6457190.5456408.6456260.3

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Max Bandwidth - Stream-Triad Liker1r2br4r5r2ar3r1a70K140K210K280K350KSE +/- 25.05, N = 3SE +/- 50.20, N = 3SE +/- 7.71, N = 3SE +/- 22.58, N = 3SE +/- 53.08, N = 3SE +/- 50.80, N = 3SE +/- 11.61, N = 3325766.94325409.99325314.62325312.30325260.41325218.50325184.58

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Peak Injection Bandwidth - All Readsr3r1ar2ar4r2br5r180K160K240K320K400KSE +/- 24.95, N = 3SE +/- 14.58, N = 3SE +/- 37.47, N = 3SE +/- 26.62, N = 3SE +/- 14.54, N = 3SE +/- 23.85, N = 3SE +/- 709.43, N = 3358463.7358385.5358269.7358110.5357742.9357722.7356476.2

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Max Bandwidth - 1:1 Reads-Writesr2ar2br1ar3r4r5r190K180K270K360K450KSE +/- 1844.14, N = 3SE +/- 3117.58, N = 3SE +/- 1093.30, N = 3SE +/- 276.68, N = 3SE +/- 2322.32, N = 3SE +/- 1051.98, N = 3SE +/- 821.19, N = 3442460.05441732.77441408.09440939.22440315.41440205.22439496.74

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Max Bandwidth - 2:1 Reads-Writesr1r2br4r5r3r1ar2a100K200K300K400K500KSE +/- 33.49, N = 3SE +/- 51.02, N = 3SE +/- 8.60, N = 3SE +/- 53.22, N = 3SE +/- 89.89, N = 3SE +/- 129.26, N = 3SE +/- 54.98, N = 3459455.38459226.53458790.96458756.46457141.24456629.89456545.88

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Max Bandwidth - 3:1 Reads-Writesr1r2br4r5r3r2ar1a90K180K270K360K450KSE +/- 105.41, N = 3SE +/- 71.38, N = 3SE +/- 67.02, N = 3SE +/- 133.64, N = 3SE +/- 109.66, N = 3SE +/- 392.90, N = 3SE +/- 465.24, N = 3426148.96425997.22425848.09425467.51424925.84424818.83424612.62

toyBrot Fractal Generator

ToyBrot is a Mandelbrot fractal generator supporting C++ threads/tasks, OpenMP, Intel Threaded Building Blocks (TBB), and other targets. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasksr1ar1r4r3r2b2K4K6K8K10KSE +/- 80.44, N = 4SE +/- 43.45, N = 3SE +/- 85.46, N = 4SE +/- 93.55, N = 4SE +/- 102.03, N = 3772478798037804880501. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threadsr1ar1r4r2br315003000450060007500SE +/- 29.96, N = 3SE +/- 49.12, N = 3SE +/- 76.94, N = 4SE +/- 89.67, N = 3SE +/- 98.76, N = 3698070187141714972031. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

Botan

Botan is a BSD-licensed cross-platform open-source C++ crypto library "cryptography toolkit" that supports most publicly known cryptographic algorithms. The project's stated goal is to be "the best option for cryptography in C++ by offering the tools necessary to implement a range of practical systems, such as TLS protocol, X.509 certificates, modern AEAD ciphers, PKCS#11 and TPM hardware support, password hashing, and post quantum crypto schemes." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIr1ar1r3r4r2b20406080100SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.77, N = 3SE +/- 0.87, N = 3SE +/- 1.01, N = 377.3177.2976.4176.4076.291. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decryptr1r3r4r1ar2b20406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 374.3274.3174.2974.2974.281. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256r1ar1r4r2br312002400360048006000SE +/- 0.28, N = 3SE +/- 0.92, N = 3SE +/- 51.03, N = 3SE +/- 55.60, N = 3SE +/- 42.23, N = 35670.815669.705612.005606.975593.371. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decryptr1ar1r2br3r412002400360048006000SE +/- 0.12, N = 3SE +/- 1.20, N = 3SE +/- 0.94, N = 3SE +/- 1.10, N = 3SE +/- 12.66, N = 35663.615663.065662.765662.345650.141. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofishr1r1ar2br3r460120180240300SE +/- 0.14, N = 3SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 2.66, N = 3SE +/- 2.83, N = 3289.13288.85288.56286.18286.001. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decryptr3r1r4r2br1a60120180240300SE +/- 0.06, N = 3SE +/- 0.14, N = 3SE +/- 0.04, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 3292.83292.74292.61292.40292.371. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfishr1ar1r2br4r380160240320400SE +/- 0.05, N = 3SE +/- 0.56, N = 3SE +/- 0.11, N = 3SE +/- 3.51, N = 3SE +/- 3.73, N = 3363.62363.04362.93359.57359.451. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decryptr1ar3r4r1r2b80160240320400SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3363.33363.31363.28363.26363.201. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256r1r1ar2br4r3306090120150SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 1.15, N = 3SE +/- 1.17, N = 3SE +/- 1.33, N = 3115.97115.97114.66114.65114.521. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decryptr2br1r4r1ar3306090120150SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.35, N = 3116.08116.07116.07116.07115.721. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305r1r1ar4r3r2b130260390520650SE +/- 0.03, N = 3SE +/- 0.17, N = 3SE +/- 2.98, N = 3SE +/- 3.19, N = 3SE +/- 3.48, N = 3623.49623.20619.64616.50615.811. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decryptr1ar1r4r2br3130260390520650SE +/- 0.57, N = 3SE +/- 0.40, N = 3SE +/- 2.81, N = 3SE +/- 3.49, N = 3SE +/- 3.74, N = 3619.54619.46615.98612.44612.151. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Basis Universal

Basis Universal is a GPU texture codec. This test times how long it takes to convert sRGB PNGs into Basis Univeral assets with various settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1Sr2br4816243240SE +/- 0.21, N = 3SE +/- 0.42, N = 334.2434.421. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0r4r2b3691215SE +/- 0.08, N = 3SE +/- 0.08, N = 1511.2311.251. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2r2br448121620SE +/- 0.18, N = 3SE +/- 0.15, N = 313.9814.161. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3r2br448121620SE +/- 0.02, N = 3SE +/- 0.01, N = 317.1617.191. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

libjpeg-turbo tjbench

tjbench is a JPEG decompression/compression benchmark that is part of libjpeg-turbo, a JPEG image codec library optimized for SIMD instructions on modern CPU architectures. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughputr1r2br4r3r1a4080120160200SE +/- 0.15, N = 3SE +/- 0.07, N = 3SE +/- 0.47, N = 3SE +/- 1.04, N = 3SE +/- 0.39, N = 3161.63160.26159.24159.19156.971. (CC) gcc options: -O3 -rdynamic

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Mediumr4r2b246810SE +/- 0.0290, N = 3SE +/- 0.0906, N = 157.14727.18871. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thoroughr2br43691215SE +/- 0.0796, N = 8SE +/- 0.0879, N = 79.29079.30911. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustiver2br448121620SE +/- 0.00, N = 3SE +/- 0.02, N = 316.3616.371. (CXX) g++ options: -O3 -flto -pthread

Google Draco

Draco is a library developed by Google for compressing/decompressing 3D geometric meshes and point clouds. This test profile uses some Artec3D PLY models as the sample 3D model input formats for Draco compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.4.1Model: Lionr2br413002600390052006500SE +/- 25.21, N = 3SE +/- 21.15, N = 3612661701. (CXX) g++ options: -O3

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.4.1Model: Church Facader2br415003000450060007500SE +/- 20.01, N = 3SE +/- 3.33, N = 3700170821. (CXX) g++ options: -O3

KTX-Software toktx

This is a benchmark of The Khronos Group's KTX-Software library and tools. KTX-Software provides "toktx" for converting/creating in the KTX container format for image textures. This benchmark times how long it takes to convert to KTX 2.0 format with various settings using a reference PNG sample input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 3r4r2b1.27442.54883.82325.09766.372SE +/- 0.008, N = 3SE +/- 0.053, N = 155.5625.664

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: Zstd Compression 9r2br40.83181.66362.49543.32724.159SE +/- 0.003, N = 3SE +/- 0.064, N = 153.4703.697

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: Zstd Compression 19r2br4510152025SE +/- 0.22, N = 3SE +/- 0.20, N = 319.7820.08

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 3 + Zstd Compression 19r2br43691215SE +/- 0.06, N = 3SE +/- 0.11, N = 510.0110.03

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 4 + Zstd Compression 19r2br41326395265SE +/- 0.68, N = 4SE +/- 0.74, N = 356.6656.77

VOSK Speech Recognition Toolkit

VOSK is an open-source offline speech recognition API/toolkit. VOSK supports speech recognition in 17 languages and has a variety of models available and interfaces for different programming languages. This test profile times the speech-to-text process for a roughly three minute audio recording. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterVOSK Speech Recognition Toolkit 0.3.21r1ar4r3r1r2b816243240SE +/- 0.29, N = 8SE +/- 0.32, N = 3SE +/- 0.43, N = 3SE +/- 0.32, N = 3SE +/- 0.43, N = 335.0135.5035.5835.9236.42

SecureMark

SecureMark is an objective, standardized benchmarking framework for measuring the efficiency of cryptographic processing solutions developed by EEMBC. SecureMark-TLS is benchmarking Transport Layer Security performance with a focus on IoT/edge computing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSr1r1ar2br3r450K100K150K200K250KSE +/- 234.37, N = 3SE +/- 236.12, N = 3SE +/- 84.15, N = 3SE +/- 267.95, N = 3SE +/- 2769.20, N = 32254122253662253432252912227471. (CC) gcc options: -pedantic -O3

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmlrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Monero - Hash Count: 1Mr3r4r1ar2br14K8K12K16K20KSE +/- 245.77, N = 3SE +/- 243.31, N = 15SE +/- 20.55, N = 3SE +/- 151.73, N = 3SE +/- 23.28, N = 320652.920574.619452.019311.119299.51. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Wownero - Hash Count: 1Mr1ar4r2br3r111K22K33K44K55KSE +/- 588.34, N = 3SE +/- 235.04, N = 3SE +/- 238.38, N = 3SE +/- 358.18, N = 3SE +/- 425.40, N = 750166.149937.349908.349813.448051.51. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0r4r2b246810SE +/- 0.078, N = 12SE +/- 0.002, N = 37.1707.174MIN: 6.38 / MAX: 9.97MIN: 6.95 / MAX: 7.881. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50r4r2b1122334455SE +/- 1.07, N = 12SE +/- 2.59, N = 348.0448.73MIN: 42.13 / MAX: 145.2MIN: 43.19 / MAX: 69.591. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224r2br40.92251.8452.76753.694.6125SE +/- 0.333, N = 3SE +/- 0.135, N = 124.0784.100MIN: 2.9 / MAX: 13.17MIN: 2.97 / MAX: 12.981. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0r2br40.75651.5132.26953.0263.7825SE +/- 0.089, N = 3SE +/- 0.021, N = 123.2133.362MIN: 2.8 / MAX: 6.7MIN: 2.98 / MAX: 6.661. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3r4r2b1224364860SE +/- 0.75, N = 12SE +/- 1.54, N = 352.2353.07MIN: 47.47 / MAX: 94.69MIN: 49.59 / MAX: 69.621. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUr1ar1r3r4r2b0.21230.42460.63690.84921.0615SE +/- 0.002111, N = 3SE +/- 0.002101, N = 3SE +/- 0.007264, N = 3SE +/- 0.008450, N = 3SE +/- 0.011253, N = 30.9122790.9185680.9369410.9407140.943624MIN: 0.86MIN: 0.85MIN: 0.85MIN: 0.86MIN: 0.861. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUr3r4r1r1ar2b0.2820.5640.8461.1281.41SE +/- 0.01211, N = 3SE +/- 0.01282, N = 3SE +/- 0.00180, N = 3SE +/- 0.01592, N = 15SE +/- 0.00964, N = 31.241761.242221.248091.252671.25313MIN: 1.18MIN: 1.19MIN: 1.2MIN: 1.19MIN: 1.21. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUr1r1ar2br4r30.28010.56020.84031.12041.4005SE +/- 0.01080, N = 15SE +/- 0.01126, N = 15SE +/- 0.01174, N = 15SE +/- 0.00891, N = 15SE +/- 0.01066, N = 151.215941.222781.237961.241161.24508MIN: 0.84MIN: 0.85MIN: 0.87MIN: 0.85MIN: 0.891. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUr1ar1r4r2br30.09150.1830.27450.3660.4575SE +/- 0.001124, N = 3SE +/- 0.001135, N = 3SE +/- 0.002415, N = 14SE +/- 0.004259, N = 4SE +/- 0.003204, N = 100.3955880.3982820.4029190.4034090.406877MIN: 0.36MIN: 0.37MIN: 0.36MIN: 0.36MIN: 0.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUr1r1ar2br4r30.67711.35422.03132.70843.3855SE +/- 0.00128, N = 3SE +/- 0.00276, N = 3SE +/- 0.02287, N = 13SE +/- 0.02449, N = 14SE +/- 0.02478, N = 142.961352.968573.004643.009073.00929MIN: 2.84MIN: 2.84MIN: 2.84MIN: 2.84MIN: 2.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUr1ar1r2br4r30.41480.82961.24441.65922.074SE +/- 0.00121, N = 3SE +/- 0.00580, N = 3SE +/- 0.01382, N = 3SE +/- 0.00968, N = 3SE +/- 0.02043, N = 31.798811.800461.817741.819131.84339MIN: 1.69MIN: 1.68MIN: 1.69MIN: 1.68MIN: 1.671. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUr1r4r2br1ar30.25780.51560.77341.03121.289SE +/- 0.00274, N = 3SE +/- 0.01182, N = 3SE +/- 0.00330, N = 3SE +/- 0.00124, N = 3SE +/- 0.00975, N = 31.109911.118111.118741.122241.14578MIN: 1.02MIN: 1.02MIN: 1.02MIN: 1.02MIN: 1.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUr1r1ar3r2br4714212835SE +/- 0.02080, N = 3SE +/- 0.01835, N = 3SE +/- 0.30585, N = 15SE +/- 0.31773, N = 13SE +/- 0.38629, N = 127.494677.5005928.1815028.4023028.46130MIN: 6.98MIN: 6.91MIN: 14.34MIN: 14.66MIN: 14.761. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUr1ar1r2br3r40.19720.39440.59160.78880.986SE +/- 0.002055, N = 3SE +/- 0.002419, N = 3SE +/- 0.008361, N = 14SE +/- 0.007890, N = 14SE +/- 0.007461, N = 140.8632140.8641640.8740800.8749680.876227MIN: 0.84MIN: 0.84MIN: 0.83MIN: 0.84MIN: 0.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUr2br4r1r1ar30.20290.40580.60870.81161.0145SE +/- 0.004902, N = 3SE +/- 0.005244, N = 3SE +/- 0.006225, N = 3SE +/- 0.003986, N = 3SE +/- 0.006631, N = 30.8699780.8754210.8778150.8791370.901823MIN: 0.82MIN: 0.82MIN: 0.82MIN: 0.83MIN: 0.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUr1r4r1ar2br30.07690.15380.23070.30760.3845SE +/- 0.000853, N = 3SE +/- 0.004121, N = 3SE +/- 0.002562, N = 3SE +/- 0.003448, N = 5SE +/- 0.003372, N = 60.3383270.3402430.3416630.3418930.341955MIN: 0.3MIN: 0.3MIN: 0.31MIN: 0.3MIN: 0.311. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUr2br1ar1r4r30.04910.09820.14730.19640.2455SE +/- 0.004449, N = 15SE +/- 0.001109, N = 3SE +/- 0.002205, N = 15SE +/- 0.004970, N = 15SE +/- 0.003384, N = 150.2103240.2107280.2109190.2179410.218349MIN: 0.18MIN: 0.2MIN: 0.19MIN: 0.19MIN: 0.191. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUr4r3r1r1ar2b2004006008001000SE +/- 2.67, N = 3SE +/- 1.09, N = 3SE +/- 7.46, N = 3SE +/- 4.49, N = 3SE +/- 9.76, N = 3792.30796.69801.41804.32808.29MIN: 763.96MIN: 771.28MIN: 767.38MIN: 765.37MIN: 767.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUr2br4r1ar1r3100200300400500SE +/- 0.78, N = 3SE +/- 1.10, N = 3SE +/- 0.90, N = 3SE +/- 0.58, N = 3SE +/- 2.40, N = 3446.39446.54447.31447.97450.65MIN: 432.04MIN: 429.71MIN: 432.33MIN: 433.22MIN: 432.961. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUr2br1ar4r1r32004006008001000SE +/- 1.48, N = 3SE +/- 3.65, N = 3SE +/- 1.96, N = 3SE +/- 2.07, N = 3SE +/- 2.18, N = 3789.84791.93792.05792.83793.08MIN: 767.03MIN: 765.01MIN: 765.9MIN: 763.76MIN: 768.21. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUr1r1ar4r3r2b0.47640.95281.42921.90562.382SE +/- 0.00138, N = 3SE +/- 0.00168, N = 3SE +/- 0.01801, N = 3SE +/- 0.01943, N = 3SE +/- 0.01980, N = 32.079442.085322.108372.108412.11712MIN: 2.03MIN: 2.03MIN: 2.03MIN: 2.03MIN: 2.031. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUr1r2br1ar4r30.80151.6032.40453.2064.0075SE +/- 0.00193, N = 3SE +/- 0.00854, N = 3SE +/- 0.00732, N = 3SE +/- 0.00650, N = 3SE +/- 0.01280, N = 33.530263.531213.543673.547833.56224MIN: 3.38MIN: 3.37MIN: 3.38MIN: 3.37MIN: 3.391. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUr1r1ar3r2br40.81971.63942.45913.27884.0985SE +/- 0.00924, N = 3SE +/- 0.00795, N = 3SE +/- 0.05675, N = 14SE +/- 0.05421, N = 14SE +/- 0.05617, N = 143.572473.576623.640333.642323.64319MIN: 3.53MIN: 3.5MIN: 3.47MIN: 3.51MIN: 3.51. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUr1r3r1ar2br4100200300400500SE +/- 0.85, N = 3SE +/- 0.04, N = 3SE +/- 2.18, N = 3SE +/- 1.13, N = 3SE +/- 2.63, N = 3445.52446.92447.44447.70447.96MIN: 431.18MIN: 433.64MIN: 429.4MIN: 433.04MIN: 429.991. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUr1r1ar4r2br30.05470.10940.16410.21880.2735SE +/- 0.000856, N = 3SE +/- 0.000662, N = 3SE +/- 0.002245, N = 7SE +/- 0.003187, N = 3SE +/- 0.002507, N = 50.2399890.2401220.2424500.2430260.243308MIN: 0.22MIN: 0.23MIN: 0.22MIN: 0.22MIN: 0.221. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUr2br1ar3r1r42004006008001000SE +/- 0.61, N = 3SE +/- 1.56, N = 3SE +/- 0.83, N = 3SE +/- 7.01, N = 3SE +/- 16.86, N = 14791.70793.36793.92804.39811.94MIN: 769.61MIN: 765.14MIN: 769MIN: 763.49MIN: 761.611. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUr1r1ar3r2br4100200300400500SE +/- 0.58, N = 3SE +/- 1.79, N = 3SE +/- 1.24, N = 3SE +/- 0.65, N = 3SE +/- 3.51, N = 3445.14446.94447.14447.29448.91MIN: 431.52MIN: 430.47MIN: 432.42MIN: 433.06MIN: 431.331. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUr1ar4r1r3r2b0.04880.09760.14640.19520.244SE +/- 0.000781, N = 3SE +/- 0.001544, N = 12SE +/- 0.000867, N = 3SE +/- 0.002019, N = 7SE +/- 0.001893, N = 80.2136430.2150850.2151150.2165860.216806MIN: 0.19MIN: 0.19MIN: 0.19MIN: 0.19MIN: 0.191. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUr1r1ar4r2br30.13550.2710.40650.5420.6775SE +/- 0.001703, N = 3SE +/- 0.000780, N = 3SE +/- 0.003648, N = 3SE +/- 0.004180, N = 3SE +/- 0.004400, N = 30.5930420.5956610.6020380.6021220.602314MIN: 0.56MIN: 0.56MIN: 0.56MIN: 0.56MIN: 0.561. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dr2br1ar1r3r480160240320400SE +/- 2.73, N = 9SE +/- 0.12, N = 3SE +/- 0.46, N = 3SE +/- 4.39, N = 9SE +/- 3.91, N = 9307.62311.96313.92386.39389.701. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionr1ar1r2br3r40.80391.60782.41173.21564.0195SE +/- 0.01532048, N = 3SE +/- 0.00774937, N = 3SE +/- 0.02799890, N = 3SE +/- 0.03072276, N = 15SE +/- 0.02850005, N = 152.738590962.743709963.022819923.565927743.572781531. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionr1ar1r2br3r448121620SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 311.2711.3611.5614.6014.661. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Input: Fayalite-FISTr2a300600900120015001374.66

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timer3r1ar4r1r2b40M80M120M160M200MSE +/- 1924842.52, N = 3SE +/- 2404481.41, N = 3SE +/- 2183262.34, N = 4SE +/- 1585265.68, N = 15SE +/- 1982639.48, N = 31892144991862635521860132611816448191815542181. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

Timed LLVM Compilation

This test times how long it takes to build the LLVM compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 12.0Build System: Ninjar1ar1r4r3r2b306090120150SE +/- 0.75, N = 3SE +/- 0.52, N = 3SE +/- 0.56, N = 3SE +/- 0.32, N = 3SE +/- 1.12, N = 3145.55145.72146.91147.16148.48

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 12.0Build System: Unix Makefilesr1ar1r4r3r2b50100150200250SE +/- 0.80, N = 3SE +/- 0.91, N = 3SE +/- 0.43, N = 3SE +/- 1.24, N = 3SE +/- 0.77, N = 3215.76216.32224.29226.20226.44

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compiler1ar1r2br3r4714212835SE +/- 0.28, N = 4SE +/- 0.30, N = 4SE +/- 0.32, N = 14SE +/- 0.41, N = 14SE +/- 0.37, N = 1424.3624.3828.0028.0228.09

Sysbench

This is a benchmark of Sysbench with the built-in CPU and memory sub-tests. Sysbench is a scriptable multi-threaded benchmark tool based on LuaJIT. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memoryr4r2b3K6K9K12K15KSE +/- 118.72, N = 15SE +/- 125.16, N = 1512553.4412510.561. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUr4r2b50K100K150K200K250KSE +/- 269.51, N = 3SE +/- 247.29, N = 3214241.34214210.831. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

AOM AV1

This is a test of the AOMedia AV1 encoder (libaom) developed by AOMedia and Google. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4Kr1ar1r4r3r2b48121620SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 12SE +/- 0.06, N = 315.1915.096.005.975.971. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4Kr1ar1r4r2br3246810SE +/- 0.06, N = 3SE +/- 0.09, N = 15SE +/- 0.03, N = 5SE +/- 0.03, N = 9SE +/- 0.04, N = 37.557.373.233.223.201. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4Kr1r1ar4r2br3714212835SE +/- 0.19, N = 3SE +/- 0.29, N = 5SE +/- 0.17, N = 3SE +/- 0.08, N = 15SE +/- 0.12, N = 1529.2028.9912.1012.0311.941. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4Kr1r1ar4r2br3816243240SE +/- 0.28, N = 3SE +/- 0.28, N = 3SE +/- 0.08, N = 3SE +/- 0.15, N = 15SE +/- 0.18, N = 433.0732.5114.7314.3014.061. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Intel Memory Latency Checker

Intel Memory Latency Checker (MLC) is a binary-only system memory bandwidth and latency benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Max Bandwidth - All Readsr2ar1ar3r4r2br5r180K160K240K320K400KSE +/- 107.35, N = 3SE +/- 142.76, N = 3SE +/- 59.61, N = 3SE +/- 83.70, N = 3SE +/- 83.63, N = 3SE +/- 46.23, N = 3SE +/- 67.01, N = 3358456.09358364.56358268.00357925.98357774.43357550.82357285.28

AOM AV1

This is a test of the AOMedia AV1 encoder (libaom) developed by AOMedia and Google. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4Kr1ar3r4r2b0.04280.08560.12840.17120.214SE +/- 0.00, N = 5SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 120.190.150.140.141. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4Kr1ar4r3r2b0.93831.87662.81493.75324.6915SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 9SE +/- 0.03, N = 34.172.102.052.011. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080pr1ar4r3r2b0.11480.22960.34440.45920.574SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.510.330.330.321. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080pr1ar4r3r2b246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 5SE +/- 0.03, N = 36.893.363.363.301. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080pr1ar4r3r2b714212835SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 328.6610.5410.3910.391. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080pr1ar2br4r3510152025SE +/- 0.17, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 321.257.457.437.381. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080pr1ar4r2br320406080100SE +/- 1.01, N = 15SE +/- 0.27, N = 3SE +/- 0.19, N = 3SE +/- 0.26, N = 3103.9236.3536.2036.061. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080pr1ar3r2br4306090120150SE +/- 0.82, N = 15SE +/- 0.31, N = 15SE +/- 0.49, N = 3SE +/- 0.28, N = 3125.2543.4243.2642.371. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pr1ar1r2br3r490180270360450SE +/- 0.66, N = 3SE +/- 1.44, N = 3SE +/- 0.90, N = 3SE +/- 2.25, N = 3SE +/- 0.47, N = 3408.24401.29182.17181.52179.131. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Helsing

Helsing is an open-source POSIX vampire number generator. This test profile measures the time it takes to generate vampire numbers between varying numbers of digits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitr1r3r1ar2br42040608010077.8778.0878.1678.3378.541. (CC) gcc options: -O2 -pthread -lcrypto

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pr1ar1r3r4r2b90180270360450SE +/- 16.03, N = 12SE +/- 15.40, N = 12SE +/- 1.57, N = 3SE +/- 0.65, N = 3SE +/- 4.05, N = 12393.46386.29185.53184.07182.261. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pr1ar1r3r2br470140210280350SE +/- 1.10, N = 3SE +/- 1.20, N = 3SE +/- 1.63, N = 3SE +/- 1.13, N = 3SE +/- 1.59, N = 3329.53327.87164.51164.32162.211. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pr1ar1r3r4r2b918273645SE +/- 0.24, N = 3SE +/- 0.29, N = 3SE +/- 0.14, N = 3SE +/- 0.31, N = 3SE +/- 0.09, N = 337.3436.9128.2228.0127.801. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pr1r1ar2br3r460120180240300SE +/- 1.68, N = 3SE +/- 1.37, N = 3SE +/- 1.76, N = 5SE +/- 1.64, N = 3SE +/- 1.22, N = 3290.67288.99158.16157.83156.261. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pr1r1ar2br3r4110220330440550SE +/- 3.80, N = 3SE +/- 4.78, N = 3SE +/- 2.64, N = 4SE +/- 1.80, N = 10SE +/- 1.14, N = 3499.23493.51234.51234.39233.961. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CPU-Onlyr2br4714212835SE +/- 0.08, N = 3SE +/- 0.32, N = 329.5629.69

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CPU-Onlyr2br41632486480SE +/- 0.08, N = 3SE +/- 0.13, N = 371.7872.29

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CPU-Onlyr2br41122334455SE +/- 0.15, N = 3SE +/- 0.25, N = 346.3846.73

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CPU-Onlyr4r2b20406080100SE +/- 0.59, N = 3SE +/- 0.18, N = 3109.96110.02

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CPU-Onlyr2br420406080100SE +/- 0.08, N = 3SE +/- 0.28, N = 388.5788.68

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0r1ar1r2br4r31530456075SE +/- 0.24, N = 3SE +/- 0.21, N = 3SE +/- 0.22, N = 3SE +/- 0.68, N = 3SE +/- 0.20, N = 357.7157.9864.9765.8965.961. (CXX) g++ options: -O3 -fPIC -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2r1ar1r4r3r2b918273645SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.20, N = 3SE +/- 0.40, N = 331.4831.5437.8038.3138.371. (CXX) g++ options: -O3 -fPIC -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6r1r1ar2br4r348121620SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.23, N = 3SE +/- 0.12, N = 15SE +/- 0.13, N = 1513.2513.3316.0716.2116.621. (CXX) g++ options: -O3 -fPIC -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10r1r1ar3r2br4246810SE +/- 0.038, N = 3SE +/- 0.014, N = 3SE +/- 0.145, N = 15SE +/- 0.116, N = 15SE +/- 0.130, N = 155.4775.5056.5976.6566.7461. (CXX) g++ options: -O3 -fPIC -lm

HammerDB - MariaDB

This is a MariaDB MySQL database server benchmark making use of the HammerDB benchmarking / load testing tool. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 32 - Warehouses: 500r140K80K120K160K200KSE +/- 2885.40, N = 92084191. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Losslessr1ar1r2br4r3918273645SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.24, N = 3SE +/- 0.36, N = 6SE +/- 0.35, N = 331.6232.1138.4038.5138.591. (CXX) g++ options: -O3 -fPIC -lm

HammerDB - MariaDB

This is a MariaDB MySQL database server benchmark making use of the HammerDB benchmarking / load testing tool. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 16 - Warehouses: 250r140K80K120K160K200KSE +/- 2649.02, N = 31929131. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 8 - Warehouses: 500r160K120K180K240K300KSE +/- 2338.98, N = 32859841. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 16 - Warehouses: 250r114K28K42K56K70KSE +/- 880.35, N = 3637571. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 8 - Warehouses: 250r160K120K180K240K300KSE +/- 2006.72, N = 32900821. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 8 - Warehouses: 500r120K40K60K80K100KSE +/- 693.36, N = 3943791. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 32 - Warehouses: 250r115K30K45K60K75KSE +/- 1078.76, N = 9690541. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 32 - Warehouses: 500r115K30K45K60K75KSE +/- 921.11, N = 9688181. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Losslessr1ar1r3r4r2b3691215SE +/- 0.016, N = 3SE +/- 0.036, N = 3SE +/- 0.130, N = 15SE +/- 0.157, N = 15SE +/- 0.154, N = 158.8128.85210.08810.20810.2821. (CXX) g++ options: -O3 -fPIC -lm

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: DLSC - Acceleration: CPUr1r1ar2br4r33691215SE +/- 0.09, N = 3SE +/- 0.09, N = 15SE +/- 0.08, N = 15SE +/- 0.09, N = 3SE +/- 0.10, N = 39.709.619.279.259.24MIN: 8.98 / MAX: 12.22MIN: 8 / MAX: 12.27MIN: 8.31 / MAX: 11.98MIN: 8.59 / MAX: 11.4MIN: 8.74 / MAX: 11.37

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Danish Mood - Acceleration: CPUr1ar1r2br4r3246810SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 37.557.425.735.685.65MIN: 3.28 / MAX: 8.86MIN: 3.2 / MAX: 8.74MIN: 1.3 / MAX: 7.65MIN: 1.26 / MAX: 7.6MIN: 1.24 / MAX: 7.63

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Orange Juice - Acceleration: CPUr1r2br1ar4r348121620SE +/- 0.13, N = 3SE +/- 0.18, N = 3SE +/- 0.21, N = 3SE +/- 0.13, N = 15SE +/- 0.12, N = 1514.3614.2814.2613.9413.89MIN: 11.58 / MAX: 19.44MIN: 11.93 / MAX: 17.73MIN: 11.6 / MAX: 19.3MIN: 11.06 / MAX: 17.84MIN: 11.08 / MAX: 17.77

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: LuxCore Benchmark - Acceleration: CPUr1ar1r3r4r2b246810SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 38.047.845.925.875.84MIN: 3.51 / MAX: 9.33MIN: 3.44 / MAX: 9.2MIN: 1.15 / MAX: 7.98MIN: 1.15 / MAX: 7.95MIN: 1.16 / MAX: 7.97

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Rainbow Colors and Prism - Acceleration: CPUr1r3r4r2br1a48121620SE +/- 1.05, N = 15SE +/- 1.13, N = 12SE +/- 0.79, N = 12SE +/- 0.87, N = 13SE +/- 0.47, N = 1517.0416.4714.7913.4213.34MIN: 11.27 / MAX: 22.05MIN: 10.39 / MAX: 21.43MIN: 9.85 / MAX: 20.95MIN: 8.28 / MAX: 21.15MIN: 10.32 / MAX: 17.45

Timed Erlang/OTP Compilation

This test times how long it takes to compile Erlang/OTP. Erlang is a programming language and run-time for massively scalable soft real-time systems with high availability requirements. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compiler1ar1r2br3r44080120160200SE +/- 0.37, N = 3SE +/- 0.18, N = 3SE +/- 1.08, N = 3SE +/- 0.31, N = 3SE +/- 1.56, N = 3113.80114.55191.75192.25193.84

Timed Mesa Compilation

This test profile times how long it takes to compile Mesa with Meson/Ninja. For minimizing build dependencies and avoid versioning conflicts, test this is just the core Mesa build without LLVM or the extra Gallium3D/Mesa drivers enabled. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compiler1ar1r4r3r2b510152025SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.11, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 320.3820.9521.3121.3721.58

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compiler1ar1r2br4r3306090120150SE +/- 0.29, N = 3SE +/- 0.27, N = 3SE +/- 0.50, N = 3SE +/- 0.78, N = 3SE +/- 0.68, N = 3100.45101.10110.93111.67111.79

HammerDB - MariaDB

This is a MariaDB MySQL database server benchmark making use of the HammerDB benchmarking / load testing tool. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 16 - Warehouses: 500r140K80K120K160K200KSE +/- 3159.46, N = 91952581. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 8 - Warehouses: 250r120K40K60K80K100KSE +/- 675.05, N = 3957681. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 32 - Warehouses: 250r140K80K120K160K200KSE +/- 3390.81, N = 92092541. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 16 - Warehouses: 500r114K28K42K56K70KSE +/- 1031.07, N = 9644771. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

Timed Wasmer Compilation

This test times how long it takes to compile Wasmer. Wasmer is written in the Rust programming language and is a WebAssembly runtime implementation that supports WASI and EmScripten. This test profile builds Wasmer with the Cranelift and Singlepast compiler features enabled. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Wasmer Compilation 1.0.2Time To Compiler1ar1r4r3r2b1632486480SE +/- 0.62, N = 3SE +/- 0.22, N = 3SE +/- 0.51, N = 3SE +/- 0.66, N = 7SE +/- 0.42, N = 361.9362.1670.7671.1371.931. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYr1r4r3r2br1a400800120016002000SE +/- 16.63, N = 14SE +/- 54.62, N = 15SE +/- 51.32, N = 15SE +/- 22.07, N = 15SE +/- 4.10, N = 31834116711356915041. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

HammerDB - MariaDB

This is a MariaDB MySQL database server benchmark making use of the HammerDB benchmarking / load testing tool. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 64 - Warehouses: 500r1r1a14K28K42K56K70KSE +/- 620.04, N = 3SE +/- 730.55, N = 964298623111. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 128 - Warehouses: 500r1ar112K24K36K48K60KSE +/- 484.29, N = 9SE +/- 891.59, N = 957242571901. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 128 - Warehouses: 500r1r1a40K80K120K160K200KSE +/- 2691.06, N = 9SE +/- 1389.03, N = 91732881732281. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 128 - Warehouses: 250r112K24K36K48K60KSE +/- 857.30, N = 9554151. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 128 - Warehouses: 250r140K80K120K160K200KSE +/- 2616.54, N = 91678091. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 64 - Warehouses: 500r1r1a40K80K120K160K200KSE +/- 2149.33, N = 3SE +/- 2084.32, N = 91946841887611. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYr1r3r4r2br1a2004006008001000SE +/- 6.62, N = 14SE +/- 8.11, N = 15SE +/- 11.35, N = 15SE +/- 10.36, N = 15SE +/- 15.25, N = 310038628554743701. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

HammerDB - MariaDB

This is a MariaDB MySQL database server benchmark making use of the HammerDB benchmarking / load testing tool. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 64 - Warehouses: 250r140K80K120K160K200KSE +/- 2831.11, N = 91913971. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 64 - Warehouses: 250r114K28K42K56K70KSE +/- 937.55, N = 9632791. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTr1r4r3r2br1a130260390520650SE +/- 2.34, N = 14SE +/- 2.45, N = 15SE +/- 2.55, N = 15SE +/- 5.60, N = 15SE +/- 11.67, N = 36205355323492771. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYr4r3r1r2br1a2004006008001000SE +/- 9.73, N = 15SE +/- 26.97, N = 15SE +/- 25.47, N = 14SE +/- 35.11, N = 15SE +/- 29.90, N = 3936.0913.0843.0422.2335.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYr4r1r3r2br1a2004006008001000SE +/- 5.62, N = 15SE +/- 20.63, N = 14SE +/- 82.34, N = 15SE +/- 40.80, N = 15SE +/- 23.02, N = 31158.01058.01024.2507.1392.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTr4r1r3r2br1a160320480640800SE +/- 2.76, N = 15SE +/- 6.43, N = 14SE +/- 50.57, N = 15SE +/- 34.40, N = 14SE +/- 34.44, N = 3765.00720.00713.47447.65371.001. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-Nr1r4r3r1ar2b1632486480SE +/- 0.36, N = 14SE +/- 0.25, N = 15SE +/- 3.93, N = 15SE +/- 2.90, N = 3SE +/- 3.75, N = 1572.370.264.363.662.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-Tr1r4r3r2br1a160320480640800SE +/- 2.46, N = 13SE +/- 3.20, N = 15SE +/- 2.02, N = 15SE +/- 27.49, N = 15SE +/- 5.04, N = 3719.0647.0647.0389.9319.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNr1r1ar4r3r2b1632486480SE +/- 1.42, N = 14SE +/- 3.11, N = 3SE +/- 1.95, N = 15SE +/- 2.18, N = 15SE +/- 2.06, N = 1573.572.370.866.461.91. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTr1ar1r4r3r2b20406080100SE +/- 1.01, N = 3SE +/- 1.88, N = 13SE +/- 1.98, N = 15SE +/- 1.99, N = 15SE +/- 1.14, N = 1576.875.672.468.959.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNr1ar1r4r3r2b20406080100SE +/- 0.69, N = 3SE +/- 1.67, N = 13SE +/- 2.43, N = 14SE +/- 1.88, N = 15SE +/- 2.02, N = 1577.476.067.666.962.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTr1ar1r4r3r2b20406080100SE +/- 0.90, N = 3SE +/- 1.45, N = 13SE +/- 2.94, N = 15SE +/- 2.33, N = 15SE +/- 1.75, N = 1577.276.363.761.754.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GNU Radio

GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filtersr1r1ar3r4r2b2004006008001000SE +/- 2.54, N = 3SE +/- 2.30, N = 3SE +/- 39.63, N = 9SE +/- 48.36, N = 9SE +/- 1.12, N = 31024.31015.2580.5487.9111.21. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)r1r1ar3r2br45001000150020002500SE +/- 0.93, N = 3SE +/- 2.24, N = 3SE +/- 72.44, N = 9SE +/- 168.17, N = 3SE +/- 82.03, N = 92183.52175.31723.91684.41619.21. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filterr1ar1r4r3r2b130260390520650SE +/- 0.20, N = 3SE +/- 1.45, N = 3SE +/- 11.25, N = 9SE +/- 16.19, N = 9SE +/- 44.41, N = 3604.8603.0515.6502.0470.01. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filterr1r1ar2br4r3130260390520650SE +/- 0.38, N = 3SE +/- 0.46, N = 3SE +/- 45.07, N = 3SE +/- 25.67, N = 9SE +/- 26.49, N = 9610.6609.5498.2487.7487.41. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filterr1r1ar2br4r3160320480640800SE +/- 1.94, N = 3SE +/- 1.04, N = 3SE +/- 53.33, N = 3SE +/- 32.02, N = 9SE +/- 31.57, N = 9734.0727.4645.8622.0621.01. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transformr1r1ar3r4r2b100200300400500SE +/- 2.02, N = 3SE +/- 1.66, N = 3SE +/- 17.46, N = 9SE +/- 24.71, N = 9SE +/- 47.90, N = 3459.3459.1408.0373.8357.41. 3.8.1.0

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57r1r3r2br412M24M36M48M60MSE +/- 173700.89, N = 3SE +/- 550708.74, N = 3SE +/- 613156.95, N = 3SE +/- 534784.17, N = 3577920005719766756230333552516671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 57r3r1r2br420M40M60M80M100MSE +/- 430348.70, N = 3SE +/- 729984.78, N = 3SE +/- 907677.13, N = 3SE +/- 132035.35, N = 31115100001107133331101733331094300001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 57r1r4r3r2b50M100M150M200M250MSE +/- 1090112.12, N = 3SE +/- 1956802.95, N = 3SE +/- 1663583.82, N = 3SE +/- 824809.74, N = 32176433332167733332153433332132033331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57r1r3r4r2b90M180M270M360M450MSE +/- 422150.58, N = 3SE +/- 1240739.03, N = 3SE +/- 2739929.03, N = 3SE +/- 2458908.97, N = 34419533334321700004320133334281000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57r1ar1r3r2br4200M400M600M800M1000MSE +/- 669162.00, N = 3SE +/- 691953.76, N = 3SE +/- 859903.10, N = 3SE +/- 3620722.76, N = 3SE +/- 10609570.10, N = 38902733338853200008654100008628900008600466671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57r1ar1r3r2br4400M800M1200M1600M2000MSE +/- 2515949.13, N = 3SE +/- 3951371.07, N = 3SE +/- 10121648.97, N = 3SE +/- 6582552.70, N = 3173680000017351000001704500000169933333316975000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57r1r3r2br4r1a700M1400M2100M2800M3500MSE +/- 8088331.79, N = 3SE +/- 6896617.53, N = 3SE +/- 14312737.14, N = 3SE +/- 16537936.19, N = 3SE +/- 38975091.76, N = 3341593333334110000003400066667339880000033527333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 160 - Buffer Length: 256 - Filter Length: 57r1ar1r3r4r2b700M1400M2100M2800M3500MSE +/- 2062630.47, N = 3SE +/- 17047384.94, N = 3SE +/- 14901789.60, N = 3SE +/- 16411005.79, N = 3SE +/- 14685858.66, N = 3316206666731448000003143300000314026666731318666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

LuaRadio

LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filtersr1r1ar2br4r32004006008001000SE +/- 2.24, N = 3SE +/- 0.62, N = 3SE +/- 22.87, N = 9SE +/- 73.21, N = 6SE +/- 74.31, N = 61094.81094.5804.5706.1662.8

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filterr1r1ar3r2br490180270360450SE +/- 0.21, N = 3SE +/- 1.40, N = 3SE +/- 4.83, N = 6SE +/- 5.30, N = 9SE +/- 1.19, N = 6410.0409.6370.3370.1368.0

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transformr1ar1r4r3r2b20406080100SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.61, N = 6SE +/- 0.47, N = 6SE +/- 0.41, N = 980.380.378.478.278.2

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phaser1ar1r2br3r4120240360480600SE +/- 0.71, N = 3SE +/- 0.25, N = 3SE +/- 3.61, N = 9SE +/- 4.31, N = 6SE +/- 4.50, N = 6548.2546.8458.7458.2452.7

srsLTE

srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_Testr3r2br4r1r1a30M60M90M120M150MSE +/- 600925.21, N = 3SE +/- 366666.67, N = 3SE +/- 233333.33, N = 3SE +/- 611010.09, N = 3SE +/- 240370.09, N = 31208333331207333331206666671203000001201333331. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Testr1ar4r1r3r2b4080120160200SE +/- 0.36, N = 3SE +/- 0.58, N = 3SE +/- 1.15, N = 3SE +/- 2.42, N = 3SE +/- 1.23, N = 3184.2183.7183.4181.6181.61. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Testr4r1ar1r3r2b20406080100SE +/- 0.62, N = 3SE +/- 1.16, N = 3SE +/- 0.76, N = 3SE +/- 1.14, N = 3SE +/- 0.38, N = 378.377.376.976.175.01. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

MariaDB

This is a MariaDB MySQL database server benchmark making use of mysqlslap. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 1r3r2b7001400210028003500SE +/- 61.33, N = 12SE +/- 73.97, N = 15345833361. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 4r2br330060090012001500SE +/- 16.07, N = 3SE +/- 7.20, N = 3161415801. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 8r3r2b30060090012001500SE +/- 3.56, N = 3SE +/- 10.97, N = 3142014131. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 16r2br330060090012001500SE +/- 1.85, N = 3SE +/- 3.49, N = 3126412621. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 32r3r2b2004006008001000SE +/- 1.83, N = 3SE +/- 0.26, N = 38878851. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 64r3r2b90180270360450SE +/- 0.16, N = 3SE +/- 0.62, N = 34044031. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 128r2br34080120160200SE +/- 0.65, N = 3SE +/- 0.35, N = 31921891. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 256r2b4080120160200SE +/- 0.22, N = 31601. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 512r2b4080120160200SE +/- 0.87, N = 31661. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

GNU GMP GMPbench

GMPbench is a test of the GNU Multiple Precision Arithmetic (GMP) Library. GMPbench is a single-threaded integer benchmark that leverages the GMP library to stress the CPU with widening integer multiplication. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Timer1ar1r4r2br3100020003000400050004642.84642.14525.74524.54504.51. (CC) gcc options: -O3 -fomit-frame-pointer -lm

Intel Memory Latency Checker

Intel Memory Latency Checker (MLC) is a binary-only system memory bandwidth and latency benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns, Fewer Is BetterIntel Memory Latency CheckerTest: Idle Latencyr2ar1ar1r2r3r4r51530456075SE +/- 0.28, N = 8SE +/- 0.39, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.09, N = 332.533.035.167.567.667.868.1

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57r1r1ar4r3r2b700M1400M2100M2800M3500MSE +/- 5206513.02, N = 3SE +/- 2150193.79, N = 3SE +/- 12876378.03, N = 3SE +/- 14893734.70, N = 3SE +/- 17049079.48, N = 3326713333332637000003245666667323270000032274333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

188 Results Shown

toyBrot Fractal Generator:
  TBB
  OpenMP
Intel Memory Latency Checker:
  Peak Injection Bandwidth - 1:1 Reads-Writes
  Peak Injection Bandwidth - Stream-Triad Like
  Peak Injection Bandwidth - 3:1 Reads-Writes
  Peak Injection Bandwidth - 2:1 Reads-Writes
  Max Bandwidth - Stream-Triad Like
  Peak Injection Bandwidth - All Reads
  Max Bandwidth - 1:1 Reads-Writes
  Max Bandwidth - 2:1 Reads-Writes
  Max Bandwidth - 3:1 Reads-Writes
toyBrot Fractal Generator:
  C++ Tasks
  C++ Threads
Botan:
  KASUMI
  KASUMI - Decrypt
  AES-256
  AES-256 - Decrypt
  Twofish
  Twofish - Decrypt
  Blowfish
  Blowfish - Decrypt
  CAST-256
  CAST-256 - Decrypt
  ChaCha20Poly1305
  ChaCha20Poly1305 - Decrypt
Basis Universal:
  ETC1S
  UASTC Level 0
  UASTC Level 2
  UASTC Level 3
libjpeg-turbo tjbench
ASTC Encoder:
  Medium
  Thorough
  Exhaustive
Google Draco:
  Lion
  Church Facade
KTX-Software toktx:
  UASTC 3
  Zstd Compression 9
  Zstd Compression 19
  UASTC 3 + Zstd Compression 19
  UASTC 4 + Zstd Compression 19
VOSK Speech Recognition Toolkit
SecureMark
Xmrig:
  Monero - 1M
  Wownero - 1M
Mobile Neural Network:
  SqueezeNetV1.0
  resnet-v2-50
  MobileNetV2_224
  mobilenet-v1-1.0
  inception-v3
oneDNN:
  IP Shapes 1D - f32 - CPU
  IP Shapes 3D - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU
  IP Shapes 3D - u8s8f32 - CPU
  IP Shapes 1D - bf16bf16bf16 - CPU
  IP Shapes 3D - bf16bf16bf16 - CPU
  Convolution Batch Shapes Auto - f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Training - u8s8f32 - CPU
  Convolution Batch Shapes Auto - bf16bf16bf16 - CPU
  Deconvolution Batch shapes_1d - bf16bf16bf16 - CPU
  Deconvolution Batch shapes_3d - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
  Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
  Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPU
Xcompact3d Incompact3d:
  X3D-benchmarking input.i3d
  input.i3d 129 Cells Per Direction
  input.i3d 193 Cells Per Direction
CP2K Molecular Dynamics
Stockfish
Timed LLVM Compilation:
  Ninja
  Unix Makefiles
Timed Linux Kernel Compilation
Sysbench:
  RAM / Memory
  CPU
AOM AV1:
  Speed 6 Realtime - Bosphorus 4K
  Speed 6 Two-Pass - Bosphorus 4K
  Speed 8 Realtime - Bosphorus 4K
  Speed 9 Realtime - Bosphorus 4K
Intel Memory Latency Checker
AOM AV1:
  Speed 0 Two-Pass - Bosphorus 4K
  Speed 4 Two-Pass - Bosphorus 4K
  Speed 0 Two-Pass - Bosphorus 1080p
  Speed 4 Two-Pass - Bosphorus 1080p
  Speed 6 Realtime - Bosphorus 1080p
  Speed 6 Two-Pass - Bosphorus 1080p
  Speed 8 Realtime - Bosphorus 1080p
  Speed 9 Realtime - Bosphorus 1080p
SVT-VP9
Helsing
SVT-VP9:
  VMAF Optimized - Bosphorus 1080p
  Visual Quality Optimized - Bosphorus 1080p
SVT-HEVC:
  1 - Bosphorus 1080p
  7 - Bosphorus 1080p
  10 - Bosphorus 1080p
Blender:
  BMW27 - CPU-Only
  Classroom - CPU-Only
  Fishy Cat - CPU-Only
  Barbershop - CPU-Only
  Pabellon Barcelona - CPU-Only
libavif avifenc:
  0
  2
  6
  10
HammerDB - MariaDB
libavif avifenc
HammerDB - MariaDB:
  16 - 250
  8 - 500
  16 - 250
  8 - 250
  8 - 500
  32 - 250
  32 - 500
libavif avifenc
LuxCoreRender:
  DLSC - CPU
  Danish Mood - CPU
  Orange Juice - CPU
  LuxCore Benchmark - CPU
  Rainbow Colors and Prism - CPU
Timed Erlang/OTP Compilation
Timed Mesa Compilation
Timed Node.js Compilation
HammerDB - MariaDB:
  16 - 500
  8 - 250
  32 - 250
  16 - 500
Timed Wasmer Compilation
ViennaCL
HammerDB - MariaDB:
  64 - 500
  128 - 500
  128 - 500
  128 - 250
  128 - 250
  64 - 500
ViennaCL
HammerDB - MariaDB:
  64 - 250:
    Transactions Per Minute
    New Orders Per Minute
ViennaCL:
  CPU BLAS - sDOT
  CPU BLAS - dCOPY
  CPU BLAS - dAXPY
  CPU BLAS - dDOT
  CPU BLAS - dGEMV-N
  CPU BLAS - dGEMV-T
  CPU BLAS - dGEMM-NN
  CPU BLAS - dGEMM-NT
  CPU BLAS - dGEMM-TN
  CPU BLAS - dGEMM-TT
GNU Radio:
  Five Back to Back FIR Filters
  Signal Source (Cosine)
  FIR Filter
  IIR Filter
  FM Deemphasis Filter
  Hilbert Transform
Liquid-DSP:
  1 - 256 - 57
  2 - 256 - 57
  4 - 256 - 57
  8 - 256 - 57
  16 - 256 - 57
  32 - 256 - 57
  128 - 256 - 57
  160 - 256 - 57
LuaRadio:
  Five Back to Back FIR Filters
  FM Deemphasis Filter
  Hilbert Transform
  Complex Phase
srsLTE:
  OFDM_Test
  PHY_DL_Test
  PHY_DL_Test
MariaDB:
  1
  4
  8
  16
  32
  64
  128
  256
  512
GNU GMP GMPbench
Intel Memory Latency Checker
Liquid-DSP