xeon-platinum-8380-2p-smoke-run

2 x Intel Xeon Platinum 8380 testing with a Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2105012-IB-XEONPLATI04&grr&sro&rro.

xeon-platinum-8380-2p-smoke-run ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolutionr1r1ar2r2ar2br3r4r52 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads)Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS)Intel Device 099816 x 32 GB DDR4-3200MT/s Hynix HMA84GR7CJR4N-XN2 x 7682GB INTEL SSDPF2KX076TZ + 2 x 800GB INTEL SSDPF21Q800GB + 3841GB Micron_9300_MTFDHAL3T8TDP + 960GB INTEL SSDSC2KG96ASPEEDVE2282 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFPUbuntu 20.045.11.0-051100-generic (x86_64)GNOME Shell 3.36.4X Server 1.20.8GCC 9.3.0ext41920x10801024x768OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- r1: Scaling Governor: intel_pstate performance - CPU Microcode: 0xd000270- r1a: Scaling Governor: intel_pstate performance - CPU Microcode: 0xd000270- r2: Scaling Governor: intel_pstate performance - CPU Microcode: 0xd000270- r2a: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270- r2b: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270- r3: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270- r4: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270- r5: Scaling Governor: intel_pstate powersave - CPU Microcode: 0xd000270Python Details- Python 2.7.18 + Python 3.8.5Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

xeon-platinum-8380-2p-smoke-run hammerdb-mariadb: 128 - 250hammerdb-mariadb: 128 - 250hammerdb-mariadb: 128 - 500hammerdb-mariadb: 128 - 500hammerdb-mariadb: 64 - 250hammerdb-mariadb: 64 - 250hammerdb-mariadb: 32 - 250hammerdb-mariadb: 32 - 250hammerdb-mariadb: 16 - 500hammerdb-mariadb: 16 - 500hammerdb-mariadb: 32 - 500hammerdb-mariadb: 32 - 500mysqlslap: 256mysqlslap: 512mysqlslap: 128hammerdb-mariadb: 64 - 500hammerdb-mariadb: 64 - 500incompact3d: X3D-benchmarking input.i3dgnuradio: Hilbert Transformgnuradio: FM Deemphasis Filtergnuradio: IIR Filtergnuradio: FIR Filtergnuradio: Signal Source (Cosine)gnuradio: Five Back to Back FIR Filtersmysqlslap: 64aom-av1: Speed 4 Two-Pass - Bosphorus 4Kcp2k: Fayalite-FISThammerdb-mariadb: 16 - 250hammerdb-mariadb: 16 - 250luaradio: Complex Phaseluaradio: Hilbert Transformluaradio: FM Deemphasis Filterluaradio: Five Back to Back FIR Filtershammerdb-mariadb: 8 - 250hammerdb-mariadb: 8 - 250hammerdb-mariadb: 8 - 500hammerdb-mariadb: 8 - 500aom-av1: Speed 6 Two-Pass - Bosphorus 4Kmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0mysqlslap: 1securemark: SecureMark-TLSaom-av1: Speed 0 Two-Pass - Bosphorus 4Kmysqlslap: 32build-llvm: Unix Makefilesaom-av1: Speed 4 Two-Pass - Bosphorus 1080pluxcorerender: Orange Juice - CPUmysqlslap: 16build-erlang: Time To Compileluxcorerender: DLSC - CPUintel-mlc: Max Bandwidth - Stream-Triad Likeintel-mlc: Max Bandwidth - 1:1 Reads-Writesintel-mlc: Max Bandwidth - 2:1 Reads-Writesintel-mlc: Max Bandwidth - 3:1 Reads-Writesintel-mlc: Max Bandwidth - All Readsmysqlslap: 8build-llvm: Ninjaaom-av1: Speed 6 Realtime - Bosphorus 4Konednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUmysqlslap: 4aom-av1: Speed 8 Realtime - Bosphorus 4Kgmpbench: Total Timeblender: Barbershop - CPU-Onlybuild-nodejs: Time To Compileviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYxmrig: Monero - 1Maom-av1: Speed 6 Two-Pass - Bosphorus 1080pbuild-linux-kernel: Time To Compilesysbench: CPUblender: Pabellon Barcelona - CPU-Onlybuild-wasmer: Time To Compileonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUaom-av1: Speed 9 Realtime - Bosphorus 4Kblender: Classroom - CPU-Onlytoktx: UASTC 4 + Zstd Compression 19onednn: Deconvolution Batch shapes_1d - f32 - CPUluxcorerender: Danish Mood - CPUluxcorerender: LuxCore Benchmark - CPUavifenc: 0aom-av1: Speed 0 Two-Pass - Bosphorus 1080pluxcorerender: Rainbow Colors and Prism - CPUaom-av1: Speed 6 Realtime - Bosphorus 1080pblender: Fishy Cat - CPU-Onlyonednn: IP Shapes 1D - bf16bf16bf16 - CPUvosk: stockfish: Total Timeavifenc: 6, Losslesssrslte: PHY_DL_Testsrslte: PHY_DL_Testavifenc: 6srslte: OFDM_Testsysbench: RAM / Memoryavifenc: 2botan: AES-256 - Decryptbotan: AES-256basis: ETC1Sbasis: UASTC Level 0avifenc: 10, Losslessaom-av1: Speed 9 Realtime - Bosphorus 1080pblender: BMW27 - CPU-Onlybotan: ChaCha20Poly1305 - Decryptbotan: ChaCha20Poly1305botan: Blowfish - Decryptbotan: Blowfishbotan: Twofish - Decryptbotan: Twofishbotan: CAST-256 - Decryptbotan: CAST-256botan: KASUMI - Decryptbotan: KASUMItoybrot: TBBxmrig: Wownero - 1Mintel-mlc: Peak Injection Bandwidth - Stream-Triad Likeintel-mlc: Peak Injection Bandwidth - 1:1 Reads-Writesintel-mlc: Peak Injection Bandwidth - 2:1 Reads-Writesintel-mlc: Peak Injection Bandwidth - 3:1 Reads-Writesintel-mlc: Peak Injection Bandwidth - All Readsonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUhelsing: 14 digittjbench: Decompression Throughputastcenc: Thoroughastcenc: Exhaustiveavifenc: 10astcenc: Mediumbuild-mesa: Time To Compileonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUsvt-hevc: 1 - Bosphorus 1080paom-av1: Speed 8 Realtime - Bosphorus 1080pliquid-dsp: 160 - 256 - 57liquid-dsp: 128 - 256 - 57liquid-dsp: 64 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 8 - 256 - 57liquid-dsp: 4 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 1 - 256 - 57toktx: Zstd Compression 19basis: UASTC Level 3onednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUsvt-vp9: VMAF Optimized - Bosphorus 1080ptoktx: UASTC 3onednn: IP Shapes 3D - f32 - CPUintel-mlc: Idle Latencyonednn: IP Shapes 1D - f32 - CPUbasis: UASTC Level 2incompact3d: input.i3d 193 Cells Per Directiontoktx: UASTC 3 + Zstd Compression 19onednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUincompact3d: input.i3d 129 Cells Per Directiontoktx: Zstd Compression 9onednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUtoybrot: C++ Tasksdraco: Church Facadeonednn: IP Shapes 3D - bf16bf16bf16 - CPUdraco: Liontoybrot: OpenMPtoybrot: C++ Threadssvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080ponednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUsvt-hevc: 10 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080pr1r1ar2r2ar2br3r4r516780955415173288571901913976327920925469054195258644772084196881819468464298313.920451459.3734.0610.6603.02183.51024.319291363757546.880.3410.01094.829008295768285984943797.37225412216.32314.36114.5509.70325766.94439496.74459455.38426148.96357285.28145.71715.09804.39229.204642.1101.10176.376.075.673.571972.372010588436201003183419299.524.38262.160801.409792.8311.21594447.971445.144445.51933.077.494677.427.8457.97517.042.9613535.91818164481932.11376.9183.413.24712030000031.5395663.0555669.7008.852619.458623.494363.255363.038292.736289.126116.074115.97274.32077.287685048051.5324377.2442422.3459038.6425933.7356476.20.3383270.21511577.872161.6346195.47720.9523.530260.39828236.913144800000341593333332671333331735100000885320000441953333217643333110713333577920000.239989386.291.2480935.10.91856811.35860220.2109190.5930422.743709963.572470.86416478791.8004673187018327.87401.291.109910.8778152.07944499.23290.671732285724218876162311311.960785459.1727.4609.5604.82175.31015.24.17548.280.3409.61094.57.552253660.19215.7606.8914.26113.8009.61325184.58441408.09456629.89424612.62358364.56145.55015.19793.36328.994642.8100.44677.277.476.872.331963.637139233527737050419452.021.2524.36061.930804.323791.9271.22278447.308446.936447.43632.517.500597.558.0457.7100.5113.3428.662.9685735.00918626355231.62477.3184.213.32812013333331.4795663.6125670.8098.812125.25619.538623.198363.326363.615292.374288.852116.069115.97074.28877.310696450166.1323924.2442843.2456260.3424096.6358385.50.3416630.21364378.159156.9690165.50520.3793.543670.39558837.34103.9231620666673352733333326370000017368000008902733330.240122393.461.2526733.00.91227911.27271140.2107280.5956612.738590963.576620.86321477241.7988173086980329.53408.241.122240.8791372.08532493.51288.9967.51374.663325260.41442460.05456545.88424818.83358456.09323826.9442144.2456408.6424077.3358269.732.5160166192307.622108357.4645.8498.2470.01684.4111.24032.01458.778.2370.1804.53.2253.0733.2134.07848.7327.17433362253430.14885226.4403.3014.281264191.7469.27325409.99441732.77459226.53425997.22357774.431413148.4845.97791.695161412.034524.5110.02110.93054.762.359.861.9389.962.3447.65507.1422.234947469119311.17.4527.997214210.8388.5771.928808.289789.8361.23796446.389447.287447.70114.3071.7856.66028.40235.735.8464.9710.3213.4210.3946.383.0046436.42418155421838.39575.0181.616.06512073333312510.5638.3725662.7635606.96734.23711.25110.28243.2629.56612.438615.806363.196362.926292.396288.562116.080114.66374.27576.286698449908.3324209.8440454.7459309.8425925.6357742.90.3418930.21680678.33160.2625599.290716.36216.6567.188721.5753.531210.40340927.8036.2031318666673400066667322743333316993333338628900004281000002132033331101733335623033319.78117.1630.243026182.265.6641.253130.94362413.97911.561715810.0110.2103240.6021223.022819923.4703.642320.874080805070011.81774612674127149164.32182.171.118740.8699782.11712234.51158.16189386.390001408.0621.0487.4502.01723.9580.54042.05458.278.2370.3662.83.2034582252910.15887226.1993.3613.891262192.2459.24325218.50440939.22457141.24424925.84358268.001420147.1635.97793.916158011.944504.5111.79061.766.968.966.464764.3713.471024.2913532862113520652.97.3828.01871.130796.689793.0801.24508450.648447.144446.91714.0628.18155.655.9265.9600.3316.4710.393.0092935.58118921449938.59076.1181.616.61512083333338.3135662.3425593.36610.08843.42612.149616.501363.314359.452292.827286.180115.723114.51774.30976.407700349813.4324227.4449554.1457190.5424904.5358463.70.3419550.21658678.079159.1870386.59721.3693.562240.40687728.2236.063143300000341100000032327000001704500000865410000432170000215343333111510000571976670.243308185.531.2417667.60.93694114.59829650.2183490.6023143.565927743.640330.87496880481.8433974397203164.51181.521.145780.9018232.10841234.39157.83389.698280373.8622.0487.7515.61619.2487.92.10452.778.4368.0706.13.2352.2273.3624.10048.0417.1702227470.14224.2903.3613.94193.8399.25325314.62440315.41458790.96425848.09357925.98146.9096.00811.94112.104525.7109.96111.67363.767.672.470.864770.27651158936535855116720574.67.4328.094214241.3488.6870.758792.296792.0491.24116446.536448.906447.95814.7372.2956.77028.46135.685.8765.8880.3314.7910.5446.733.0090735.50318601326138.50778.3183.716.21112066666712553.4437.7965650.1395611.99534.42011.22610.20842.3729.69615.975619.638363.279359.573292.610286.004116.070114.64674.29276.403701649937.3324112.8446396.0458941.9425822.1358110.50.3402430.21508578.539159.2377529.309116.37296.7467.147221.3133.547830.40291928.0136.3531402666673398800000324566666716975000008600466674320133332167733331094300005525166720.08217.1850.242450184.075.5621.2422267.80.94071414.15914.657748910.0290.2179410.6020383.572781533.6973.643190.876227803770821.81913617074297141162.21179.131.118110.8754212.10837233.96156.26325312.30440205.22458756.46425467.51357550.82324234.5448800.1458830.6425508.1357722.768.1OpenBenchmarking.org

HammerDB - MariaDB

Virtual Users: 128 - Warehouses: 250

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 128 - Warehouses: 250r140K80K120K160K200KSE +/- 2616.54, N = 91678091. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 128 - Warehouses: 250

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 128 - Warehouses: 250r112K24K36K48K60KSE +/- 857.30, N = 9554151. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 128 - Warehouses: 500

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 128 - Warehouses: 500r1ar140K80K120K160K200KSE +/- 1389.03, N = 9SE +/- 2691.06, N = 91732281732881. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 128 - Warehouses: 500

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 128 - Warehouses: 500r1ar112K24K36K48K60KSE +/- 484.29, N = 9SE +/- 891.59, N = 957242571901. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 64 - Warehouses: 250

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 64 - Warehouses: 250r140K80K120K160K200KSE +/- 2831.11, N = 91913971. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 64 - Warehouses: 250

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 64 - Warehouses: 250r114K28K42K56K70KSE +/- 937.55, N = 9632791. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 32 - Warehouses: 250

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 32 - Warehouses: 250r140K80K120K160K200KSE +/- 3390.81, N = 92092541. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 32 - Warehouses: 250

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 32 - Warehouses: 250r115K30K45K60K75KSE +/- 1078.76, N = 9690541. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 16 - Warehouses: 500

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 16 - Warehouses: 500r140K80K120K160K200KSE +/- 3159.46, N = 91952581. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 16 - Warehouses: 500

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 16 - Warehouses: 500r114K28K42K56K70KSE +/- 1031.07, N = 9644771. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 32 - Warehouses: 500

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 32 - Warehouses: 500r140K80K120K160K200KSE +/- 2885.40, N = 92084191. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 32 - Warehouses: 500

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 32 - Warehouses: 500r115K30K45K60K75KSE +/- 921.11, N = 9688181. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

MariaDB

Clients: 256

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 256r2b4080120160200SE +/- 0.22, N = 31601. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

MariaDB

Clients: 512

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 512r2b4080120160200SE +/- 0.87, N = 31661. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

MariaDB

Clients: 128

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 128r3r2b4080120160200SE +/- 0.35, N = 3SE +/- 0.65, N = 31891921. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 64 - Warehouses: 500

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 64 - Warehouses: 500r1ar140K80K120K160K200KSE +/- 2084.32, N = 9SE +/- 2149.33, N = 31887611946841. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 64 - Warehouses: 500

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 64 - Warehouses: 500r1ar114K28K42K56K70KSE +/- 730.55, N = 9SE +/- 620.04, N = 362311642981. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dr4r3r2br1ar180160240320400SE +/- 3.91, N = 9SE +/- 4.39, N = 9SE +/- 2.73, N = 9SE +/- 0.12, N = 3SE +/- 0.46, N = 3389.70386.39307.62311.96313.921. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transformr4r3r2br1ar1100200300400500SE +/- 24.71, N = 9SE +/- 17.46, N = 9SE +/- 47.90, N = 3SE +/- 1.66, N = 3SE +/- 2.02, N = 3373.8408.0357.4459.1459.31. 3.8.1.0

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filterr4r3r2br1ar1160320480640800SE +/- 32.02, N = 9SE +/- 31.57, N = 9SE +/- 53.33, N = 3SE +/- 1.04, N = 3SE +/- 1.94, N = 3622.0621.0645.8727.4734.01. 3.8.1.0

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filterr4r3r2br1ar1130260390520650SE +/- 25.67, N = 9SE +/- 26.49, N = 9SE +/- 45.07, N = 3SE +/- 0.46, N = 3SE +/- 0.38, N = 3487.7487.4498.2609.5610.61. 3.8.1.0

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filterr4r3r2br1ar1130260390520650SE +/- 11.25, N = 9SE +/- 16.19, N = 9SE +/- 44.41, N = 3SE +/- 0.20, N = 3SE +/- 1.45, N = 3515.6502.0470.0604.8603.01. 3.8.1.0

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)r4r3r2br1ar15001000150020002500SE +/- 82.03, N = 9SE +/- 72.44, N = 9SE +/- 168.17, N = 3SE +/- 2.24, N = 3SE +/- 0.93, N = 31619.21723.91684.42175.32183.51. 3.8.1.0

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filtersr4r3r2br1ar12004006008001000SE +/- 48.36, N = 9SE +/- 39.63, N = 9SE +/- 1.12, N = 3SE +/- 2.30, N = 3SE +/- 2.54, N = 3487.9580.5111.21015.21024.31. 3.8.1.0

MariaDB

Clients: 64

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 64r3r2b90180270360450SE +/- 0.16, N = 3SE +/- 0.62, N = 34044031. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4Kr4r3r2br1a0.93831.87662.81493.75324.6915SE +/- 0.01, N = 3SE +/- 0.02, N = 9SE +/- 0.03, N = 3SE +/- 0.03, N = 32.102.052.014.171. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

CP2K Molecular Dynamics

Input: Fayalite-FIST

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Input: Fayalite-FISTr2a300600900120015001374.66

HammerDB - MariaDB

Virtual Users: 16 - Warehouses: 250

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 16 - Warehouses: 250r140K80K120K160K200KSE +/- 2649.02, N = 31929131. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 16 - Warehouses: 250

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 16 - Warehouses: 250r114K28K42K56K70KSE +/- 880.35, N = 3637571. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

LuaRadio

Test: Complex Phase

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phaser4r3r2br1ar1120240360480600SE +/- 4.50, N = 6SE +/- 4.31, N = 6SE +/- 3.61, N = 9SE +/- 0.71, N = 3SE +/- 0.25, N = 3452.7458.2458.7548.2546.8

LuaRadio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transformr4r3r2br1ar120406080100SE +/- 0.61, N = 6SE +/- 0.47, N = 6SE +/- 0.41, N = 9SE +/- 0.00, N = 3SE +/- 0.00, N = 378.478.278.280.380.3

LuaRadio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filterr4r3r2br1ar190180270360450SE +/- 1.19, N = 6SE +/- 4.83, N = 6SE +/- 5.30, N = 9SE +/- 1.40, N = 3SE +/- 0.21, N = 3368.0370.3370.1409.6410.0

LuaRadio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filtersr4r3r2br1ar12004006008001000SE +/- 73.21, N = 6SE +/- 74.31, N = 6SE +/- 22.87, N = 9SE +/- 0.62, N = 3SE +/- 2.24, N = 3706.1662.8804.51094.51094.8

HammerDB - MariaDB

Virtual Users: 8 - Warehouses: 250

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 8 - Warehouses: 250r160K120K180K240K300KSE +/- 2006.72, N = 32900821. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 8 - Warehouses: 250

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 8 - Warehouses: 250r120K40K60K80K100KSE +/- 675.05, N = 3957681. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 8 - Warehouses: 500

OpenBenchmarking.orgTransactions Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 8 - Warehouses: 500r160K120K180K240K300KSE +/- 2338.98, N = 32859841. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

HammerDB - MariaDB

Virtual Users: 8 - Warehouses: 500

OpenBenchmarking.orgNew Orders Per Minute, More Is BetterHammerDB - MariaDB 10.5.9Virtual Users: 8 - Warehouses: 500r120K40K60K80K100KSE +/- 693.36, N = 3943791. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lbz2 -lsnappy -ldl -lz -lrt

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4Kr4r3r2br1ar1246810SE +/- 0.03, N = 5SE +/- 0.04, N = 3SE +/- 0.03, N = 9SE +/- 0.06, N = 3SE +/- 0.09, N = 153.233.203.227.557.371. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3r4r2b1224364860SE +/- 0.75, N = 12SE +/- 1.54, N = 352.2353.07MIN: 47.47 / MAX: 94.69MIN: 49.59 / MAX: 69.621. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0r4r2b0.75651.5132.26953.0263.7825SE +/- 0.021, N = 12SE +/- 0.089, N = 33.3623.213MIN: 2.98 / MAX: 6.66MIN: 2.8 / MAX: 6.71. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224r4r2b0.92251.8452.76753.694.6125SE +/- 0.135, N = 12SE +/- 0.333, N = 34.1004.078MIN: 2.97 / MAX: 12.98MIN: 2.9 / MAX: 13.171. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50r4r2b1122334455SE +/- 1.07, N = 12SE +/- 2.59, N = 348.0448.73MIN: 42.13 / MAX: 145.2MIN: 43.19 / MAX: 69.591. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0r4r2b246810SE +/- 0.078, N = 12SE +/- 0.002, N = 37.1707.174MIN: 6.38 / MAX: 9.97MIN: 6.95 / MAX: 7.881. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

MariaDB

Clients: 1

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 1r3r2b7001400210028003500SE +/- 61.33, N = 12SE +/- 73.97, N = 15345833361. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSr4r3r2br1ar150K100K150K200K250KSE +/- 2769.20, N = 3SE +/- 267.95, N = 3SE +/- 84.15, N = 3SE +/- 236.12, N = 3SE +/- 234.37, N = 32227472252912253432253662254121. (CC) gcc options: -pedantic -O3

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4Kr4r3r2br1a0.04280.08560.12840.17120.214SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 12SE +/- 0.00, N = 50.140.150.140.191. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

MariaDB

Clients: 32

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 32r3r2b2004006008001000SE +/- 1.83, N = 3SE +/- 0.26, N = 38878851. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 12.0Build System: Unix Makefilesr4r3r2br1ar150100150200250SE +/- 0.43, N = 3SE +/- 1.24, N = 3SE +/- 0.77, N = 3SE +/- 0.80, N = 3SE +/- 0.91, N = 3224.29226.20226.44215.76216.32

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080pr4r3r2br1a246810SE +/- 0.01, N = 3SE +/- 0.04, N = 5SE +/- 0.03, N = 3SE +/- 0.02, N = 33.363.363.306.891. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Orange Juice - Acceleration: CPUr4r3r2br1ar148121620SE +/- 0.13, N = 15SE +/- 0.12, N = 15SE +/- 0.18, N = 3SE +/- 0.21, N = 3SE +/- 0.13, N = 313.9413.8914.2814.2614.36MIN: 11.06 / MAX: 17.84MIN: 11.08 / MAX: 17.77MIN: 11.93 / MAX: 17.73MIN: 11.6 / MAX: 19.3MIN: 11.58 / MAX: 19.44

MariaDB

Clients: 16

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 16r3r2b30060090012001500SE +/- 3.49, N = 3SE +/- 1.85, N = 3126212641. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compiler4r3r2br1ar14080120160200SE +/- 1.56, N = 3SE +/- 0.31, N = 3SE +/- 1.08, N = 3SE +/- 0.37, N = 3SE +/- 0.18, N = 3193.84192.25191.75113.80114.55

LuxCoreRender

Scene: DLSC - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: DLSC - Acceleration: CPUr4r3r2br1ar13691215SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 15SE +/- 0.09, N = 15SE +/- 0.09, N = 39.259.249.279.619.70MIN: 8.59 / MAX: 11.4MIN: 8.74 / MAX: 11.37MIN: 8.31 / MAX: 11.98MIN: 8 / MAX: 12.27MIN: 8.98 / MAX: 12.22

Intel Memory Latency Checker

Test: Max Bandwidth - Stream-Triad Like

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Max Bandwidth - Stream-Triad Liker5r4r3r2br2ar1ar170K140K210K280K350KSE +/- 22.58, N = 3SE +/- 7.71, N = 3SE +/- 50.80, N = 3SE +/- 50.20, N = 3SE +/- 53.08, N = 3SE +/- 11.61, N = 3SE +/- 25.05, N = 3325312.30325314.62325218.50325409.99325260.41325184.58325766.94

Intel Memory Latency Checker

Test: Max Bandwidth - 1:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Max Bandwidth - 1:1 Reads-Writesr5r4r3r2br2ar1ar190K180K270K360K450KSE +/- 1051.98, N = 3SE +/- 2322.32, N = 3SE +/- 276.68, N = 3SE +/- 3117.58, N = 3SE +/- 1844.14, N = 3SE +/- 1093.30, N = 3SE +/- 821.19, N = 3440205.22440315.41440939.22441732.77442460.05441408.09439496.74

Intel Memory Latency Checker

Test: Max Bandwidth - 2:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Max Bandwidth - 2:1 Reads-Writesr5r4r3r2br2ar1ar1100K200K300K400K500KSE +/- 53.22, N = 3SE +/- 8.60, N = 3SE +/- 89.89, N = 3SE +/- 51.02, N = 3SE +/- 54.98, N = 3SE +/- 129.26, N = 3SE +/- 33.49, N = 3458756.46458790.96457141.24459226.53456545.88456629.89459455.38

Intel Memory Latency Checker

Test: Max Bandwidth - 3:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Max Bandwidth - 3:1 Reads-Writesr5r4r3r2br2ar1ar190K180K270K360K450KSE +/- 133.64, N = 3SE +/- 67.02, N = 3SE +/- 109.66, N = 3SE +/- 71.38, N = 3SE +/- 392.90, N = 3SE +/- 465.24, N = 3SE +/- 105.41, N = 3425467.51425848.09424925.84425997.22424818.83424612.62426148.96

Intel Memory Latency Checker

Test: Max Bandwidth - All Reads

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Max Bandwidth - All Readsr5r4r3r2br2ar1ar180K160K240K320K400KSE +/- 46.23, N = 3SE +/- 83.70, N = 3SE +/- 59.61, N = 3SE +/- 83.63, N = 3SE +/- 107.35, N = 3SE +/- 142.76, N = 3SE +/- 67.01, N = 3357550.82357925.98358268.00357774.43358456.09358364.56357285.28

MariaDB

Clients: 8

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 8r3r2b30060090012001500SE +/- 3.56, N = 3SE +/- 10.97, N = 3142014131. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 12.0Build System: Ninjar4r3r2br1ar1306090120150SE +/- 0.56, N = 3SE +/- 0.32, N = 3SE +/- 1.12, N = 3SE +/- 0.75, N = 3SE +/- 0.52, N = 3146.91147.16148.48145.55145.72

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4Kr4r3r2br1ar148121620SE +/- 0.01, N = 3SE +/- 0.07, N = 12SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 36.005.975.9715.1915.091. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUr4r3r2br1ar12004006008001000SE +/- 16.86, N = 14SE +/- 0.83, N = 3SE +/- 0.61, N = 3SE +/- 1.56, N = 3SE +/- 7.01, N = 3811.94793.92791.70793.36804.39MIN: 761.61MIN: 769MIN: 769.61MIN: 765.14MIN: 763.491. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

MariaDB

Clients: 4

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 10.5.2Clients: 4r3r2b30060090012001500SE +/- 7.20, N = 3SE +/- 16.07, N = 3158016141. (CXX) g++ options: -fPIC -pie -fstack-protector -O2 -shared -lpthread -lsnappy -ldl -lz -lrt

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4Kr4r3r2br1ar1714212835SE +/- 0.17, N = 3SE +/- 0.12, N = 15SE +/- 0.08, N = 15SE +/- 0.29, N = 5SE +/- 0.19, N = 312.1011.9412.0328.9929.201. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Timer4r3r2br1ar1100020003000400050004525.74504.54524.54642.84642.11. (CC) gcc options: -O3 -fomit-frame-pointer -lm

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CPU-Onlyr4r2b20406080100SE +/- 0.59, N = 3SE +/- 0.18, N = 3109.96110.02

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compiler4r3r2br1ar1306090120150SE +/- 0.78, N = 3SE +/- 0.68, N = 3SE +/- 0.50, N = 3SE +/- 0.29, N = 3SE +/- 0.27, N = 3111.67111.79110.93100.45101.10

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTr4r3r2br1ar120406080100SE +/- 2.94, N = 15SE +/- 2.33, N = 15SE +/- 1.75, N = 15SE +/- 0.90, N = 3SE +/- 1.45, N = 1363.761.754.777.276.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNr4r3r2br1ar120406080100SE +/- 2.43, N = 14SE +/- 1.88, N = 15SE +/- 2.02, N = 15SE +/- 0.69, N = 3SE +/- 1.67, N = 1367.666.962.377.476.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTr4r3r2br1ar120406080100SE +/- 1.98, N = 15SE +/- 1.99, N = 15SE +/- 1.14, N = 15SE +/- 1.01, N = 3SE +/- 1.88, N = 1372.468.959.876.875.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNr4r3r2br1ar11632486480SE +/- 1.95, N = 15SE +/- 2.18, N = 15SE +/- 2.06, N = 15SE +/- 3.11, N = 3SE +/- 1.42, N = 1470.866.461.972.373.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-Tr4r3r2br1ar1160320480640800SE +/- 3.20, N = 15SE +/- 2.02, N = 15SE +/- 27.49, N = 15SE +/- 5.04, N = 3SE +/- 2.46, N = 13647.0647.0389.9319.0719.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-Nr4r3r2br1ar11632486480SE +/- 0.25, N = 15SE +/- 3.93, N = 15SE +/- 3.75, N = 15SE +/- 2.90, N = 3SE +/- 0.36, N = 1470.264.362.363.672.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTr4r3r2br1ar1160320480640800SE +/- 2.76, N = 15SE +/- 50.57, N = 15SE +/- 34.40, N = 14SE +/- 34.44, N = 3SE +/- 6.43, N = 14765.00713.47447.65371.00720.001. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYr4r3r2br1ar12004006008001000SE +/- 5.62, N = 15SE +/- 82.34, N = 15SE +/- 40.80, N = 15SE +/- 23.02, N = 3SE +/- 20.63, N = 141158.01024.2507.1392.01058.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYr4r3r2br1ar12004006008001000SE +/- 9.73, N = 15SE +/- 26.97, N = 15SE +/- 35.11, N = 15SE +/- 29.90, N = 3SE +/- 25.47, N = 14936.0913.0422.2335.0843.01. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTr4r3r2br1ar1130260390520650SE +/- 2.45, N = 15SE +/- 2.55, N = 15SE +/- 5.60, N = 15SE +/- 11.67, N = 3SE +/- 2.34, N = 145355323492776201. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYr4r3r2br1ar12004006008001000SE +/- 11.35, N = 15SE +/- 8.11, N = 15SE +/- 10.36, N = 15SE +/- 15.25, N = 3SE +/- 6.62, N = 1485586247437010031. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYr4r3r2br1ar1400800120016002000SE +/- 54.62, N = 15SE +/- 51.32, N = 15SE +/- 22.07, N = 15SE +/- 4.10, N = 3SE +/- 16.63, N = 141167113569150418341. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Monero - Hash Count: 1Mr4r3r2br1ar14K8K12K16K20KSE +/- 243.31, N = 15SE +/- 245.77, N = 3SE +/- 151.73, N = 3SE +/- 20.55, N = 3SE +/- 23.28, N = 320574.620652.919311.119452.019299.51. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080pr4r3r2br1a510152025SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.17, N = 37.437.387.4521.251. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compiler4r3r2br1ar1714212835SE +/- 0.37, N = 14SE +/- 0.41, N = 14SE +/- 0.32, N = 14SE +/- 0.28, N = 4SE +/- 0.30, N = 428.0928.0228.0024.3624.38

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPUr4r2b50K100K150K200K250KSE +/- 269.51, N = 3SE +/- 247.29, N = 3214241.34214210.831. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CPU-Onlyr4r2b20406080100SE +/- 0.28, N = 3SE +/- 0.08, N = 388.6888.57

Timed Wasmer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Wasmer Compilation 1.0.2Time To Compiler4r3r2br1ar11632486480SE +/- 0.51, N = 3SE +/- 0.66, N = 7SE +/- 0.42, N = 3SE +/- 0.62, N = 3SE +/- 0.22, N = 370.7671.1371.9361.9362.161. (CC) gcc options: -m64 -pie -nodefaultlibs -ldl -lrt -lpthread -lgcc_s -lc -lm -lutil

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUr4r3r2br1ar12004006008001000SE +/- 2.67, N = 3SE +/- 1.09, N = 3SE +/- 9.76, N = 3SE +/- 4.49, N = 3SE +/- 7.46, N = 3792.30796.69808.29804.32801.41MIN: 763.96MIN: 771.28MIN: 767.97MIN: 765.37MIN: 767.381. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUr4r3r2br1ar12004006008001000SE +/- 1.96, N = 3SE +/- 2.18, N = 3SE +/- 1.48, N = 3SE +/- 3.65, N = 3SE +/- 2.07, N = 3792.05793.08789.84791.93792.83MIN: 765.9MIN: 768.2MIN: 767.03MIN: 765.01MIN: 763.761. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUr4r3r2br1ar10.28010.56020.84031.12041.4005SE +/- 0.00891, N = 15SE +/- 0.01066, N = 15SE +/- 0.01174, N = 15SE +/- 0.01126, N = 15SE +/- 0.01080, N = 151.241161.245081.237961.222781.21594MIN: 0.85MIN: 0.89MIN: 0.87MIN: 0.85MIN: 0.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUr4r3r2br1ar1100200300400500SE +/- 1.10, N = 3SE +/- 2.40, N = 3SE +/- 0.78, N = 3SE +/- 0.90, N = 3SE +/- 0.58, N = 3446.54450.65446.39447.31447.97MIN: 429.71MIN: 432.96MIN: 432.04MIN: 432.33MIN: 433.221. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUr4r3r2br1ar1100200300400500SE +/- 3.51, N = 3SE +/- 1.24, N = 3SE +/- 0.65, N = 3SE +/- 1.79, N = 3SE +/- 0.58, N = 3448.91447.14447.29446.94445.14MIN: 431.33MIN: 432.42MIN: 433.06MIN: 430.47MIN: 431.521. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUr4r3r2br1ar1100200300400500SE +/- 2.63, N = 3SE +/- 0.04, N = 3SE +/- 1.13, N = 3SE +/- 2.18, N = 3SE +/- 0.85, N = 3447.96446.92447.70447.44445.52MIN: 429.99MIN: 433.64MIN: 433.04MIN: 429.4MIN: 431.181. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4Kr4r3r2br1ar1816243240SE +/- 0.08, N = 3SE +/- 0.18, N = 4SE +/- 0.15, N = 15SE +/- 0.28, N = 3SE +/- 0.28, N = 314.7314.0614.3032.5133.071. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CPU-Onlyr4r2b1632486480SE +/- 0.13, N = 3SE +/- 0.08, N = 372.2971.78

KTX-Software toktx

Settings: UASTC 4 + Zstd Compression 19

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 4 + Zstd Compression 19r4r2b1326395265SE +/- 0.74, N = 3SE +/- 0.68, N = 456.7756.66

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUr4r3r2br1ar1714212835SE +/- 0.38629, N = 12SE +/- 0.30585, N = 15SE +/- 0.31773, N = 13SE +/- 0.01835, N = 3SE +/- 0.02080, N = 328.4613028.1815028.402307.500597.49467MIN: 14.76MIN: 14.34MIN: 14.66MIN: 6.91MIN: 6.981. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Danish Mood - Acceleration: CPUr4r3r2br1ar1246810SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 35.685.655.737.557.42MIN: 1.26 / MAX: 7.6MIN: 1.24 / MAX: 7.63MIN: 1.3 / MAX: 7.65MIN: 3.28 / MAX: 8.86MIN: 3.2 / MAX: 8.74

LuxCoreRender

Scene: LuxCore Benchmark - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: LuxCore Benchmark - Acceleration: CPUr4r3r2br1ar1246810SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 35.875.925.848.047.84MIN: 1.15 / MAX: 7.95MIN: 1.15 / MAX: 7.98MIN: 1.16 / MAX: 7.97MIN: 3.51 / MAX: 9.33MIN: 3.44 / MAX: 9.2

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0r4r3r2br1ar11530456075SE +/- 0.68, N = 3SE +/- 0.20, N = 3SE +/- 0.22, N = 3SE +/- 0.24, N = 3SE +/- 0.21, N = 365.8965.9664.9757.7157.981. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080pr4r3r2br1a0.11480.22960.34440.45920.574SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.330.330.320.511. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

LuxCoreRender

Scene: Rainbow Colors and Prism - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.5Scene: Rainbow Colors and Prism - Acceleration: CPUr4r3r2br1ar148121620SE +/- 0.79, N = 12SE +/- 1.13, N = 12SE +/- 0.87, N = 13SE +/- 0.47, N = 15SE +/- 1.05, N = 1514.7916.4713.4213.3417.04MIN: 9.85 / MAX: 20.95MIN: 10.39 / MAX: 21.43MIN: 8.28 / MAX: 21.15MIN: 10.32 / MAX: 17.45MIN: 11.27 / MAX: 22.05

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080pr4r3r2br1a714212835SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 310.5410.3910.3928.661. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CPU-Onlyr4r2b1122334455SE +/- 0.25, N = 3SE +/- 0.15, N = 346.7346.38

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUr4r3r2br1ar10.67711.35422.03132.70843.3855SE +/- 0.02449, N = 14SE +/- 0.02478, N = 14SE +/- 0.02287, N = 13SE +/- 0.00276, N = 3SE +/- 0.00128, N = 33.009073.009293.004642.968572.96135MIN: 2.84MIN: 2.84MIN: 2.84MIN: 2.84MIN: 2.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

VOSK Speech Recognition Toolkit

OpenBenchmarking.orgSeconds, Fewer Is BetterVOSK Speech Recognition Toolkit 0.3.21r4r3r2br1ar1816243240SE +/- 0.32, N = 3SE +/- 0.43, N = 3SE +/- 0.43, N = 3SE +/- 0.29, N = 8SE +/- 0.32, N = 335.5035.5836.4235.0135.92

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Timer4r3r2br1ar140M80M120M160M200MSE +/- 2183262.34, N = 4SE +/- 1924842.52, N = 3SE +/- 1982639.48, N = 3SE +/- 2404481.41, N = 3SE +/- 1585265.68, N = 151860132611892144991815542181862635521816448191. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Losslessr4r3r2br1ar1918273645SE +/- 0.36, N = 6SE +/- 0.35, N = 3SE +/- 0.24, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 338.5138.5938.4031.6232.111. (CXX) g++ options: -O3 -fPIC -lm

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Testr4r3r2br1ar120406080100SE +/- 0.62, N = 3SE +/- 1.14, N = 3SE +/- 0.38, N = 3SE +/- 1.16, N = 3SE +/- 0.76, N = 378.376.175.077.376.91. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

srsLTE

Test: PHY_DL_Test

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_Testr4r3r2br1ar14080120160200SE +/- 0.58, N = 3SE +/- 2.42, N = 3SE +/- 1.23, N = 3SE +/- 0.36, N = 3SE +/- 1.15, N = 3183.7181.6181.6184.2183.41. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6r4r3r2br1ar148121620SE +/- 0.12, N = 15SE +/- 0.13, N = 15SE +/- 0.23, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 316.2116.6216.0713.3313.251. (CXX) g++ options: -O3 -fPIC -lm

srsLTE

Test: OFDM_Test

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_Testr4r3r2br1ar130M60M90M120M150MSE +/- 233333.33, N = 3SE +/- 600925.21, N = 3SE +/- 366666.67, N = 3SE +/- 240370.09, N = 3SE +/- 611010.09, N = 31206666671208333331207333331201333331203000001. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -mavx512f -mavx512cd -mavx512bw -mavx512dq -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memoryr4r2b3K6K9K12K15KSE +/- 118.72, N = 15SE +/- 125.16, N = 1512553.4412510.561. (CC) gcc options: -pthread -O2 -funroll-loops -rdynamic -ldl -laio -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2r4r3r2br1ar1918273645SE +/- 0.08, N = 3SE +/- 0.20, N = 3SE +/- 0.40, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 337.8038.3138.3731.4831.541. (CXX) g++ options: -O3 -fPIC -lm

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decryptr4r3r2br1ar112002400360048006000SE +/- 12.66, N = 3SE +/- 1.10, N = 3SE +/- 0.94, N = 3SE +/- 0.12, N = 3SE +/- 1.20, N = 35650.145662.345662.765663.615663.061. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256r4r3r2br1ar112002400360048006000SE +/- 51.03, N = 3SE +/- 42.23, N = 3SE +/- 55.60, N = 3SE +/- 0.28, N = 3SE +/- 0.92, N = 35612.005593.375606.975670.815669.701. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: ETC1Sr4r2b816243240SE +/- 0.42, N = 3SE +/- 0.21, N = 334.4234.241. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 0r4r2b3691215SE +/- 0.08, N = 3SE +/- 0.08, N = 1511.2311.251. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Losslessr4r3r2br1ar13691215SE +/- 0.157, N = 15SE +/- 0.130, N = 15SE +/- 0.154, N = 15SE +/- 0.016, N = 3SE +/- 0.036, N = 310.20810.08810.2828.8128.8521. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080pr4r3r2br1a306090120150SE +/- 0.28, N = 3SE +/- 0.31, N = 15SE +/- 0.49, N = 3SE +/- 0.82, N = 1542.3743.4243.26125.251. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CPU-Onlyr4r2b714212835SE +/- 0.32, N = 3SE +/- 0.08, N = 329.6929.56

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decryptr4r3r2br1ar1130260390520650SE +/- 2.81, N = 3SE +/- 3.74, N = 3SE +/- 3.49, N = 3SE +/- 0.57, N = 3SE +/- 0.40, N = 3615.98612.15612.44619.54619.461. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305r4r3r2br1ar1130260390520650SE +/- 2.98, N = 3SE +/- 3.19, N = 3SE +/- 3.48, N = 3SE +/- 0.17, N = 3SE +/- 0.03, N = 3619.64616.50615.81623.20623.491. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decryptr4r3r2br1ar180160240320400SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3363.28363.31363.20363.33363.261. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfishr4r3r2br1ar180160240320400SE +/- 3.51, N = 3SE +/- 3.73, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.56, N = 3359.57359.45362.93363.62363.041. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decryptr4r3r2br1ar160120180240300SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 3292.61292.83292.40292.37292.741. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofishr4r3r2br1ar160120180240300SE +/- 2.83, N = 3SE +/- 2.66, N = 3SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.14, N = 3286.00286.18288.56288.85289.131. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decryptr4r3r2br1ar1306090120150SE +/- 0.01, N = 3SE +/- 0.35, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3116.07115.72116.08116.07116.071. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256r4r3r2br1ar1306090120150SE +/- 1.17, N = 3SE +/- 1.33, N = 3SE +/- 1.15, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3114.65114.52114.66115.97115.971. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decryptr4r3r2br1ar120406080100SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 374.2974.3174.2874.2974.321. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIr4r3r2br1ar120406080100SE +/- 0.87, N = 3SE +/- 0.77, N = 3SE +/- 1.01, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 376.4076.4176.2977.3177.291. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBr4r3r2br1ar115003000450060007500SE +/- 81.70, N = 15SE +/- 69.20, N = 15SE +/- 73.83, N = 15SE +/- 80.68, N = 3SE +/- 59.06, N = 15701670036984696468501. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.12.1Variant: Wownero - Hash Count: 1Mr4r3r2br1ar111K22K33K44K55KSE +/- 235.04, N = 3SE +/- 358.18, N = 3SE +/- 238.38, N = 3SE +/- 588.34, N = 3SE +/- 425.40, N = 749937.349813.449908.350166.148051.51. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - Stream-Triad Like

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Peak Injection Bandwidth - Stream-Triad Liker5r4r3r2br2ar1ar170K140K210K280K350KSE +/- 55.81, N = 3SE +/- 60.42, N = 3SE +/- 32.03, N = 3SE +/- 12.95, N = 3SE +/- 34.05, N = 3SE +/- 38.10, N = 3SE +/- 177.93, N = 3324234.5324112.8324227.4324209.8323826.9323924.2324377.2

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - 1:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Peak Injection Bandwidth - 1:1 Reads-Writesr5r4r3r2br2ar1ar1100K200K300K400K500KSE +/- 847.23, N = 3SE +/- 1601.80, N = 3SE +/- 138.13, N = 3SE +/- 314.54, N = 3SE +/- 212.40, N = 3SE +/- 148.63, N = 3SE +/- 1187.16, N = 3448800.1446396.0449554.1440454.7442144.2442843.2442422.3

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - 2:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Peak Injection Bandwidth - 2:1 Reads-Writesr5r4r3r2br2ar1ar1100K200K300K400K500KSE +/- 12.06, N = 3SE +/- 36.24, N = 3SE +/- 73.04, N = 3SE +/- 64.32, N = 3SE +/- 115.55, N = 3SE +/- 130.28, N = 3SE +/- 274.15, N = 3458830.6458941.9457190.5459309.8456408.6456260.3459038.6

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - 3:1 Reads-Writes

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Peak Injection Bandwidth - 3:1 Reads-Writesr5r4r3r2br2ar1ar190K180K270K360K450KSE +/- 23.30, N = 3SE +/- 23.30, N = 3SE +/- 88.34, N = 3SE +/- 25.04, N = 3SE +/- 236.99, N = 3SE +/- 94.95, N = 3SE +/- 163.24, N = 3425508.1425822.1424904.5425925.6424077.3424096.6425933.7

Intel Memory Latency Checker

Test: Peak Injection Bandwidth - All Reads

OpenBenchmarking.orgMB/s, More Is BetterIntel Memory Latency CheckerTest: Peak Injection Bandwidth - All Readsr5r4r3r2br2ar1ar180K160K240K320K400KSE +/- 23.85, N = 3SE +/- 26.62, N = 3SE +/- 24.95, N = 3SE +/- 14.54, N = 3SE +/- 37.47, N = 3SE +/- 14.58, N = 3SE +/- 709.43, N = 3357722.7358110.5358463.7357742.9358269.7358385.5356476.2

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUr4r3r2br1ar10.07690.15380.23070.30760.3845SE +/- 0.004121, N = 3SE +/- 0.003372, N = 6SE +/- 0.003448, N = 5SE +/- 0.002562, N = 3SE +/- 0.000853, N = 30.3402430.3419550.3418930.3416630.338327MIN: 0.3MIN: 0.31MIN: 0.3MIN: 0.31MIN: 0.31. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUr4r3r2br1ar10.04880.09760.14640.19520.244SE +/- 0.001544, N = 12SE +/- 0.002019, N = 7SE +/- 0.001893, N = 8SE +/- 0.000781, N = 3SE +/- 0.000867, N = 30.2150850.2165860.2168060.2136430.215115MIN: 0.19MIN: 0.19MIN: 0.19MIN: 0.19MIN: 0.191. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Helsing

Digit Range: 14 digit

OpenBenchmarking.orgSeconds, Fewer Is BetterHelsing 1.0-betaDigit Range: 14 digitr4r3r2br1ar12040608010078.5478.0878.3378.1677.871. (CC) gcc options: -O2 -pthread -lcrypto

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.1.0Test: Decompression Throughputr4r3r2br1ar14080120160200SE +/- 0.47, N = 3SE +/- 1.04, N = 3SE +/- 0.07, N = 3SE +/- 0.39, N = 3SE +/- 0.15, N = 3159.24159.19160.26156.97161.631. (CC) gcc options: -O3 -rdynamic

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Thoroughr4r2b3691215SE +/- 0.0879, N = 7SE +/- 0.0796, N = 89.30919.29071. (CXX) g++ options: -O3 -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Exhaustiver4r2b48121620SE +/- 0.02, N = 3SE +/- 0.00, N = 316.3716.361. (CXX) g++ options: -O3 -flto -pthread

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10r4r3r2br1ar1246810SE +/- 0.130, N = 15SE +/- 0.145, N = 15SE +/- 0.116, N = 15SE +/- 0.014, N = 3SE +/- 0.038, N = 36.7466.5976.6565.5055.4771. (CXX) g++ options: -O3 -fPIC -lm

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: Mediumr4r2b246810SE +/- 0.0290, N = 3SE +/- 0.0906, N = 157.14727.18871. (CXX) g++ options: -O3 -flto -pthread

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compiler4r3r2br1ar1510152025SE +/- 0.11, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 321.3121.3721.5820.3820.95

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUr4r3r2br1ar10.80151.6032.40453.2064.0075SE +/- 0.00650, N = 3SE +/- 0.01280, N = 3SE +/- 0.00854, N = 3SE +/- 0.00732, N = 3SE +/- 0.00193, N = 33.547833.562243.531213.543673.53026MIN: 3.37MIN: 3.39MIN: 3.37MIN: 3.38MIN: 3.381. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUr4r3r2br1ar10.09150.1830.27450.3660.4575SE +/- 0.002415, N = 14SE +/- 0.003204, N = 10SE +/- 0.004259, N = 4SE +/- 0.001124, N = 3SE +/- 0.001135, N = 30.4029190.4068770.4034090.3955880.398282MIN: 0.36MIN: 0.37MIN: 0.36MIN: 0.36MIN: 0.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pr4r3r2br1ar1918273645SE +/- 0.31, N = 3SE +/- 0.14, N = 3SE +/- 0.09, N = 3SE +/- 0.24, N = 3SE +/- 0.29, N = 328.0128.2227.8037.3436.911. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080pr4r3r2br1a20406080100SE +/- 0.27, N = 3SE +/- 0.26, N = 3SE +/- 0.19, N = 3SE +/- 1.01, N = 1536.3536.0636.20103.921. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Liquid-DSP

Threads: 160 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 160 - Buffer Length: 256 - Filter Length: 57r4r3r2br1ar1700M1400M2100M2800M3500MSE +/- 16411005.79, N = 3SE +/- 14901789.60, N = 3SE +/- 14685858.66, N = 3SE +/- 2062630.47, N = 3SE +/- 17047384.94, N = 3314026666731433000003131866667316206666731448000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57r4r3r2br1ar1700M1400M2100M2800M3500MSE +/- 16537936.19, N = 3SE +/- 6896617.53, N = 3SE +/- 14312737.14, N = 3SE +/- 38975091.76, N = 3SE +/- 8088331.79, N = 3339880000034110000003400066667335273333334159333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57r4r3r2br1ar1700M1400M2100M2800M3500MSE +/- 12876378.03, N = 3SE +/- 14893734.70, N = 3SE +/- 17049079.48, N = 3SE +/- 2150193.79, N = 3SE +/- 5206513.02, N = 3324566666732327000003227433333326370000032671333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57r4r3r2br1ar1400M800M1200M1600M2000MSE +/- 6582552.70, N = 3SE +/- 10121648.97, N = 3SE +/- 2515949.13, N = 3SE +/- 3951371.07, N = 3169750000017045000001699333333173680000017351000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57r4r3r2br1ar1200M400M600M800M1000MSE +/- 10609570.10, N = 3SE +/- 859903.10, N = 3SE +/- 3620722.76, N = 3SE +/- 669162.00, N = 3SE +/- 691953.76, N = 38600466678654100008628900008902733338853200001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57r4r3r2br190M180M270M360M450MSE +/- 2739929.03, N = 3SE +/- 1240739.03, N = 3SE +/- 2458908.97, N = 3SE +/- 422150.58, N = 34320133334321700004281000004419533331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 57r4r3r2br150M100M150M200M250MSE +/- 1956802.95, N = 3SE +/- 1663583.82, N = 3SE +/- 824809.74, N = 3SE +/- 1090112.12, N = 32167733332153433332132033332176433331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 57r4r3r2br120M40M60M80M100MSE +/- 132035.35, N = 3SE +/- 430348.70, N = 3SE +/- 907677.13, N = 3SE +/- 729984.78, N = 31094300001115100001101733331107133331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57r4r3r2br112M24M36M48M60MSE +/- 534784.17, N = 3SE +/- 550708.74, N = 3SE +/- 613156.95, N = 3SE +/- 173700.89, N = 3552516675719766756230333577920001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

KTX-Software toktx

Settings: Zstd Compression 19

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: Zstd Compression 19r4r2b510152025SE +/- 0.20, N = 3SE +/- 0.22, N = 320.0819.78

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 3r4r2b48121620SE +/- 0.01, N = 3SE +/- 0.02, N = 317.1917.161. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUr4r3r2br1ar10.05470.10940.16410.21880.2735SE +/- 0.002245, N = 7SE +/- 0.002507, N = 5SE +/- 0.003187, N = 3SE +/- 0.000662, N = 3SE +/- 0.000856, N = 30.2424500.2433080.2430260.2401220.239989MIN: 0.22MIN: 0.22MIN: 0.22MIN: 0.23MIN: 0.221. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pr4r3r2br1ar190180270360450SE +/- 0.65, N = 3SE +/- 1.57, N = 3SE +/- 4.05, N = 12SE +/- 16.03, N = 12SE +/- 15.40, N = 12184.07185.53182.26393.46386.291. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

KTX-Software toktx

Settings: UASTC 3

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 3r4r2b1.27442.54883.82325.09766.372SE +/- 0.008, N = 3SE +/- 0.053, N = 155.5625.664

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUr4r3r2br1ar10.2820.5640.8461.1281.41SE +/- 0.01282, N = 3SE +/- 0.01211, N = 3SE +/- 0.00964, N = 3SE +/- 0.01592, N = 15SE +/- 0.00180, N = 31.242221.241761.253131.252671.24809MIN: 1.19MIN: 1.18MIN: 1.2MIN: 1.19MIN: 1.21. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Intel Memory Latency Checker

Test: Idle Latency

OpenBenchmarking.orgns, Fewer Is BetterIntel Memory Latency CheckerTest: Idle Latencyr5r4r3r2ar2r1ar11530456075SE +/- 0.09, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.28, N = 8SE +/- 0.09, N = 3SE +/- 0.39, N = 3SE +/- 0.10, N = 368.167.867.632.567.533.035.1

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUr4r3r2br1ar10.21230.42460.63690.84921.0615SE +/- 0.008450, N = 3SE +/- 0.007264, N = 3SE +/- 0.011253, N = 3SE +/- 0.002111, N = 3SE +/- 0.002101, N = 30.9407140.9369410.9436240.9122790.918568MIN: 0.86MIN: 0.85MIN: 0.86MIN: 0.86MIN: 0.851. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.13Settings: UASTC Level 2r4r2b48121620SE +/- 0.15, N = 3SE +/- 0.18, N = 314.1613.981. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Directionr4r3r2br1ar148121620SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.02, N = 314.6614.6011.5611.2711.361. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

KTX-Software toktx

Settings: UASTC 3 + Zstd Compression 19

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: UASTC 3 + Zstd Compression 19r4r2b3691215SE +/- 0.11, N = 5SE +/- 0.06, N = 310.0310.01

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUr4r3r2br1ar10.04910.09820.14730.19640.2455SE +/- 0.004970, N = 15SE +/- 0.003384, N = 15SE +/- 0.004449, N = 15SE +/- 0.001109, N = 3SE +/- 0.002205, N = 150.2179410.2183490.2103240.2107280.210919MIN: 0.19MIN: 0.19MIN: 0.18MIN: 0.2MIN: 0.191. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUr4r3r2br1ar10.13550.2710.40650.5420.6775SE +/- 0.003648, N = 3SE +/- 0.004400, N = 3SE +/- 0.004180, N = 3SE +/- 0.000780, N = 3SE +/- 0.001703, N = 30.6020380.6023140.6021220.5956610.593042MIN: 0.56MIN: 0.56MIN: 0.56MIN: 0.56MIN: 0.561. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionr4r3r2br1ar10.80391.60782.41173.21564.0195SE +/- 0.02850005, N = 15SE +/- 0.03072276, N = 15SE +/- 0.02799890, N = 3SE +/- 0.01532048, N = 3SE +/- 0.00774937, N = 33.572781533.565927743.022819922.738590962.743709961. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

KTX-Software toktx

Settings: Zstd Compression 9

OpenBenchmarking.orgSeconds, Fewer Is BetterKTX-Software toktx 4.0Settings: Zstd Compression 9r4r2b0.83181.66362.49543.32724.159SE +/- 0.064, N = 15SE +/- 0.003, N = 33.6973.470

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUr4r3r2br1ar10.81971.63942.45913.27884.0985SE +/- 0.05617, N = 14SE +/- 0.05675, N = 14SE +/- 0.05421, N = 14SE +/- 0.00795, N = 3SE +/- 0.00924, N = 33.643193.640333.642323.576623.57247MIN: 3.5MIN: 3.47MIN: 3.51MIN: 3.5MIN: 3.531. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUr4r3r2br1ar10.19720.39440.59160.78880.986SE +/- 0.007461, N = 14SE +/- 0.007890, N = 14SE +/- 0.008361, N = 14SE +/- 0.002055, N = 3SE +/- 0.002419, N = 30.8762270.8749680.8740800.8632140.864164MIN: 0.84MIN: 0.84MIN: 0.83MIN: 0.84MIN: 0.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasksr4r3r2br1ar12K4K6K8K10KSE +/- 85.46, N = 4SE +/- 93.55, N = 4SE +/- 102.03, N = 3SE +/- 80.44, N = 4SE +/- 43.45, N = 3803780488050772478791. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.4.1Model: Church Facader4r2b15003000450060007500SE +/- 3.33, N = 3SE +/- 20.01, N = 3708270011. (CXX) g++ options: -O3

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUr4r3r2br1ar10.41480.82961.24441.65922.074SE +/- 0.00968, N = 3SE +/- 0.02043, N = 3SE +/- 0.01382, N = 3SE +/- 0.00121, N = 3SE +/- 0.00580, N = 31.819131.843391.817741.798811.80046MIN: 1.68MIN: 1.67MIN: 1.69MIN: 1.69MIN: 1.681. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.4.1Model: Lionr4r2b13002600390052006500SE +/- 21.15, N = 3SE +/- 25.21, N = 3617061261. (CXX) g++ options: -O3

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMPr4r3r2br1ar116003200480064008000SE +/- 91.12, N = 4SE +/- 85.45, N = 4SE +/- 101.59, N = 3SE +/- 0.88, N = 3SE +/- 5.13, N = 3742974397412730873181. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threadsr4r3r2br1ar115003000450060007500SE +/- 76.94, N = 4SE +/- 98.76, N = 3SE +/- 89.67, N = 3SE +/- 29.96, N = 3SE +/- 49.12, N = 3714172037149698070181. (CXX) g++ options: -O3 -lpthread -lm -lgcc -lgcc_s -lc

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pr4r3r2br1ar170140210280350SE +/- 1.59, N = 3SE +/- 1.63, N = 3SE +/- 1.13, N = 3SE +/- 1.10, N = 3SE +/- 1.20, N = 3162.21164.51164.32329.53327.871. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pr4r3r2br1ar190180270360450SE +/- 0.47, N = 3SE +/- 2.25, N = 3SE +/- 0.90, N = 3SE +/- 0.66, N = 3SE +/- 1.44, N = 3179.13181.52182.17408.24401.291. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUr4r3r2br1ar10.25780.51560.77341.03121.289SE +/- 0.01182, N = 3SE +/- 0.00975, N = 3SE +/- 0.00330, N = 3SE +/- 0.00124, N = 3SE +/- 0.00274, N = 31.118111.145781.118741.122241.10991MIN: 1.02MIN: 1.04MIN: 1.02MIN: 1.02MIN: 1.021. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUr4r3r2br1ar10.20290.40580.60870.81161.0145SE +/- 0.005244, N = 3SE +/- 0.006631, N = 3SE +/- 0.004902, N = 3SE +/- 0.003986, N = 3SE +/- 0.006225, N = 30.8754210.9018230.8699780.8791370.877815MIN: 0.82MIN: 0.84MIN: 0.82MIN: 0.83MIN: 0.821. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUr4r3r2br1ar10.47640.95281.42921.90562.382SE +/- 0.01801, N = 3SE +/- 0.01943, N = 3SE +/- 0.01980, N = 3SE +/- 0.00168, N = 3SE +/- 0.00138, N = 32.108372.108412.117122.085322.07944MIN: 2.03MIN: 2.03MIN: 2.03MIN: 2.03MIN: 2.031. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pr4r3r2br1ar1110220330440550SE +/- 1.14, N = 3SE +/- 1.80, N = 10SE +/- 2.64, N = 4SE +/- 4.78, N = 3SE +/- 3.80, N = 3233.96234.39234.51493.51499.231. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pr4r3r2br1ar160120180240300SE +/- 1.22, N = 3SE +/- 1.64, N = 3SE +/- 1.76, N = 5SE +/- 1.37, N = 3SE +/- 1.68, N = 3156.26157.83158.16288.99290.671. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt


Phoronix Test Suite v10.8.5