EPYC 7702 April 2021

AMD EPYC 7702 64-Core testing with a ASRockRack EPYCD8 (P2.40 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104043-IB-EPYC7702A33&grs&sro.

EPYC 7702 April 2021ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolution123AMD EPYC 7702 64-Core @ 2.00GHz (64 Cores / 128 Threads)ASRockRack EPYCD8 (P2.40 BIOS)AMD Starship/Matisse126GB3841GB Micron_9300_MTFDHAL3T8TDPASPEEDVE2282 x Intel I350Ubuntu 20.045.9.0-050900rc6daily20200921-generic (x86_64) 20200920GNOME Shell 3.36.4X Server 1.20.8GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034 Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC 7702 April 2021svt-vp9: VMAF Optimized - Bosphorus 1080pbuild-linux-kernel: Time To Compileaom-av1: Speed 8 Realtime - Bosphorus 1080pincompact3d: input.i3d 129 Cells Per Directionviennacl: CPU BLAS - dGEMV-Tcompress-zstd: 8 - Compression Speedtoybrot: TBBonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUaom-av1: Speed 6 Two-Pass - Bosphorus 4Konednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUviennacl: CPU BLAS - dAXPYgnuradio: Five Back to Back FIR Filtersaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080ponednn: Recurrent Neural Network Training - u8s8f32 - CPUgnuradio: FM Deemphasis Filteronednn: IP Shapes 3D - u8s8f32 - CPUviennacl: CPU BLAS - dGEMM-NNcompress-zstd: 19 - Compression Speedviennacl: CPU BLAS - dCOPYavifenc: 10onednn: Deconvolution Batch shapes_1d - f32 - CPUgnuradio: Signal Source (Cosine)stockfish: Total Timecompress-zstd: 3, Long Mode - Compression Speedviennacl: CPU BLAS - sCOPYsvt-hevc: 7 - Bosphorus 1080pviennacl: CPU BLAS - sAXPYincompact3d: input.i3d 193 Cells Per Directionaom-av1: Speed 6 Realtime - Bosphorus 4Ktoybrot: OpenMPaom-av1: Speed 6 Two-Pass - Bosphorus 1080ponednn: IP Shapes 1D - u8s8f32 - CPUaom-av1: Speed 9 Realtime - Bosphorus 4Ktoybrot: C++ Threadscompress-zstd: 19, Long Mode - Compression Speedonednn: IP Shapes 1D - f32 - CPUviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMM-NTblender: BMW27 - CPU-Onlycompress-zstd: 3 - Compression Speedavifenc: 10, Losslessgnuradio: IIR Filtersvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psimdjson: PartialTweetssimdjson: DistinctUserIDblender: Classroom - CPU-Onlyonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUblender: Pabellon Barcelona - CPU-Onlyviennacl: CPU BLAS - dGEMM-TNgnuradio: FIR Filtersvt-hevc: 1 - Bosphorus 1080paom-av1: Speed 4 Two-Pass - Bosphorus 1080psimdjson: Kostyasvt-hevc: 10 - Bosphorus 1080pavifenc: 6viennacl: CPU BLAS - dGEMM-TTonednn: Recurrent Neural Network Training - f32 - CPUliquid-dsp: 64 - 256 - 57toybrot: C++ Tasksblender: Fishy Cat - CPU-Onlyavifenc: 6, Losslessaom-av1: Speed 6 Realtime - Bosphorus 1080psysbench: RAM / Memoryonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUcompress-zstd: 19 - Decompression Speedbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptblender: Barbershop - CPU-Onlyluaradio: Complex Phaseliquid-dsp: 128 - 256 - 57build-mesa: Time To Compilegnuradio: Hilbert Transformcompress-zstd: 3, Long Mode - Decompression Speedaom-av1: Speed 4 Two-Pass - Bosphorus 4Konednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUcompress-zstd: 8 - Decompression Speedbuild-erlang: Time To Compileonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUliquid-dsp: 1 - 256 - 57luaradio: Five Back to Back FIR Filtersbuild-nodejs: Time To Compilecompress-zstd: 8, Long Mode - Decompression Speedincompact3d: X3D-benchmarking input.i3dluaradio: FM Deemphasis Filterliquid-dsp: 16 - 256 - 57avifenc: 0liquid-dsp: 8 - 256 - 57onednn: IP Shapes 3D - f32 - CPUliquid-dsp: 32 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 4 - 256 - 57botan: AES-256 - Decryptbotan: Blowfish - Decryptbotan: AES-256gmpbench: Total Timebotan: Blowfishavifenc: 2onednn: Recurrent Neural Network Inference - f32 - CPUbotan: Twofishonednn: Deconvolution Batch shapes_3d - f32 - CPUcompress-zstd: 19, Long Mode - Decompression Speedbotan: Twofish - Decryptsysbench: CPUbotan: CAST-256 - Decryptbotan: KASUMI - Decryptbotan: KASUMIbotan: CAST-256onednn: Convolution Batch Shapes Auto - f32 - CPUluaradio: Hilbert Transformsimdjson: LargeRandviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - sDOTcompress-zstd: 8, Long Mode - Compression Speed123238.1433.5558.375.54909182636251074710.9118026.720.4338381320331.722.2267.34279.12081.537440.9988019383.513903.7121.719932913.8134641813410.5757277.9168725.883249311.89784816.221.4831425.94720741.51.2798589690.538.745279.46.767486.7362.923.553.59100.031.224942082.99116.5295.1534.534.415.792.12462.6110.44292.92089.27271570000075995529.43818.246128.19730.7841.149652559.5633.174632.656142.58515.5312970000023.304346.52892.43.82734.1732763.5157.6793.45459654000451.2128.1842986.2689.604065339.293775000053.3064756500005.0430716906000001189200002379700004560.395367.424560.3764542369.32128.412732.05302.4382.670162552.2302.23496671.09120.06675.09177.653120.0551.0090182.70.6630.9461444.1361.0031.15554.475.961010776202441.172440.8727696.980.4461411283323.022.1865.79273.592085.38734.10.98032391.482.113673.7541.747742868.2133169802404.5764276.0767925.552420911.74794116.091.4704825.71727841.11.2754490489.738.885243.66.796489.7361.433.533.61100.581.222242085.08117.1494.6537.334.595.772.13460.8610.46892.92098.232718266667757954.8029.50018.226104.81732.8101.151662564.8635.007631.089142.94513.8312423333323.232346.12888.03.82732.2942770.2157.4793.4614159569333452.0128.4682992.6690.480672339.893677666753.2164761166675.0432016930666671190366672382366674559.088367.7974558.6264546.4369.21928.391732.625302.4922.670362553.7302.33796717.84120.05875.10177.661120.0651.0089982.70.6628.5507444.0360.7431.16254.325.685589636532386.171190.8938306.790.4503101327322.621.6766.10277.872122.66730.10.98680692.582.313833.7731.735682879.4132630158408.0753279.7468825.611750311.74792116.041.4670725.99725841.21.2876490489.839.045250.36.809488.3363.643.533.59100.081.218412094.12116.6694.7535.134.455.762.13462.9110.48892.52095.262727266667761155.0229.55618.176121.17733.5121.153912555.8635.338630.505142.46514.5311940000023.232347.12884.63.83732.4832768.5157.3023.4531459520333452.2128.3782989.6691.061747339.993614666753.2874764200005.0510316912666671190866672380533334555.311367.8044555.6254542.4368.99328.414732.382302.2852.668772553.0302.18096694.41120.09675.10377.663120.0551.0089482.70.6627.3479473.1OpenBenchmarking.org

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p12380160240320400SE +/- 0.55, N = 3SE +/- 0.35, N = 3238.14361.00360.741. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compile123816243240SE +/- 0.28, N = 10SE +/- 0.28, N = 1033.5531.1631.16

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p1231326395265SE +/- 0.25, N = 3SE +/- 0.38, N = 358.3754.4754.321. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction1231.34122.68244.02365.36486.706SE +/- 0.08892473, N = 3SE +/- 0.08476458, N = 35.549091825.961010775.685589631. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-T123140280420560700SE +/- 14.00, N = 3SE +/- 7.86, N = 36366206531. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression Speed1235001000150020002500SE +/- 26.18, N = 3SE +/- 11.70, N = 32510.02441.12386.11. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBB12316003200480064008000SE +/- 35.23, N = 3SE +/- 12.41, N = 37471724471191. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.20520.41040.61560.82081.026SE +/- 0.011992, N = 4SE +/- 0.007517, N = 30.9118020.8727690.893830MIN: 0.81MIN: 0.8MIN: 0.811. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K123246810SE +/- 0.01, N = 3SE +/- 0.04, N = 36.726.986.791. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.10130.20260.30390.40520.5065SE +/- 0.001158, N = 3SE +/- 0.001517, N = 30.4338380.4461410.450310MIN: 0.39MIN: 0.38MIN: 0.391. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPY12330060090012001500SE +/- 41.77, N = 3SE +/- 3.33, N = 31320128313271. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filters12370140210280350SE +/- 3.91, N = 3SE +/- 3.89, N = 3331.7323.0322.61. 3.8.1.0

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K123510152025SE +/- 0.20, N = 3SE +/- 0.30, N = 322.2222.1821.671. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p1231530456075SE +/- 0.53, N = 3SE +/- 0.03, N = 367.3465.7966.101. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p12360120180240300SE +/- 2.47, N = 3SE +/- 3.78, N = 3279.10273.59277.871. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1235001000150020002500SE +/- 3.09, N = 3SE +/- 10.59, N = 32081.532085.382122.66MIN: 2065.05MIN: 2066.13MIN: 2083.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filter123160320480640800SE +/- 2.41, N = 3SE +/- 5.82, N = 3744.0734.1730.11. 3.8.1.0

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.22470.44940.67410.89881.1235SE +/- 0.001130, N = 3SE +/- 0.001469, N = 30.9988010.9803230.986806MIN: 0.94MIN: 0.94MIN: 0.91. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NN12320406080100SE +/- 0.35, N = 3SE +/- 0.17, N = 393.091.492.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression Speed12320406080100SE +/- 0.44, N = 3SE +/- 1.26, N = 383.582.182.31. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPY12330060090012001500SE +/- 14.53, N = 3SE +/- 8.82, N = 31390136713831. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 101230.84891.69782.54673.39564.2445SE +/- 0.007, N = 3SE +/- 0.001, N = 33.7123.7543.7731. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1230.39320.78641.17961.57281.966SE +/- 0.00765, N = 3SE +/- 0.00100, N = 31.719931.747741.73568MIN: 1.64MIN: 1.65MIN: 1.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)1236001200180024003000SE +/- 4.36, N = 3SE +/- 27.86, N = 32913.82868.22879.41. 3.8.1.0

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time12330M60M90M120M150MSE +/- 1363379.57, N = 8SE +/- 1858448.97, N = 41346418131331698021326301581. (CXX) g++ options: -fprofile-use -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression Speed12390180270360450SE +/- 6.05, N = 3SE +/- 3.33, N = 3410.5404.5408.01. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPY123160320480640800SE +/- 11.85, N = 3SE +/- 5.84, N = 37577647531. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p12360120180240300SE +/- 0.37, N = 3SE +/- 1.54, N = 3277.91276.07279.741. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPY123150300450600750SE +/- 6.89, N = 3SE +/- 0.58, N = 36876796881. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction123612182430SE +/- 0.22, N = 3SE +/- 0.28, N = 325.8825.5525.611. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K1233691215SE +/- 0.04, N = 3SE +/- 0.03, N = 311.8911.7411.741. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP1232K4K6K8K10KSE +/- 18.59, N = 3SE +/- 13.93, N = 37848794179211. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p12348121620SE +/- 0.03, N = 3SE +/- 0.03, N = 316.2216.0916.041. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.33370.66741.00111.33481.6685SE +/- 0.00614, N = 3SE +/- 0.00090, N = 31.483141.470481.46707MIN: 1.24MIN: 1.21MIN: 1.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K123612182430SE +/- 0.21, N = 3SE +/- 0.12, N = 325.9425.7125.991. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads12316003200480064008000SE +/- 11.93, N = 3SE +/- 30.64, N = 37207727872581. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression Speed123918273645SE +/- 0.09, N = 3SE +/- 0.23, N = 341.541.141.21. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1230.28970.57940.86911.15881.4485SE +/- 0.00139, N = 3SE +/- 0.01102, N = 31.279851.275441.28764MIN: 1.22MIN: 1.22MIN: 1.221. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOT1232004006008001000SE +/- 6.84, N = 38969049041. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NT12320406080100SE +/- 0.15, N = 3SE +/- 0.15, N = 390.589.789.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CPU-Only123918273645SE +/- 0.11, N = 3SE +/- 0.24, N = 338.7438.8839.04

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Compression Speed12311002200330044005500SE +/- 2.38, N = 3SE +/- 3.83, N = 35279.45243.65250.31. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Lossless123246810SE +/- 0.010, N = 3SE +/- 0.017, N = 36.7676.7966.8091. (CXX) g++ options: -O3 -fPIC -lm

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filter123110220330440550SE +/- 1.77, N = 3SE +/- 0.90, N = 3486.7489.7488.31. 3.8.1.0

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p12380160240320400SE +/- 1.33, N = 3SE +/- 3.53, N = 3362.92361.43363.641. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets1230.79881.59762.39643.19523.994SE +/- 0.01, N = 3SE +/- 0.00, N = 33.553.533.531. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID1230.81231.62462.43693.24924.0615SE +/- 0.00, N = 3SE +/- 0.01, N = 33.593.613.591. (CXX) g++ options: -O3 -march=native -pthread

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CPU-Only12320406080100SE +/- 0.60, N = 3SE +/- 0.04, N = 3100.03100.58100.08

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.27560.55120.82681.10241.378SE +/- 0.00091, N = 3SE +/- 0.00239, N = 31.224941.222241.21841MIN: 1.15MIN: 1.13MIN: 1.121. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU123400800120016002000SE +/- 1.02, N = 3SE +/- 6.39, N = 32082.992085.082094.12MIN: 2067.76MIN: 2067.79MIN: 2069.051. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CPU-Only123306090120150SE +/- 0.36, N = 3SE +/- 0.22, N = 3116.52117.14116.66

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TN12320406080100SE +/- 0.03, N = 3SE +/- 0.15, N = 395.194.694.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filter123120240360480600SE +/- 0.95, N = 3SE +/- 0.73, N = 3534.5537.3535.11. 3.8.1.0

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p123816243240SE +/- 0.08, N = 3SE +/- 0.05, N = 334.4134.5934.451. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p1231.30282.60563.90845.21126.514SE +/- 0.01, N = 3SE +/- 0.01, N = 35.795.775.761. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya1230.47930.95861.43791.91722.3965SE +/- 0.00, N = 3SE +/- 0.00, N = 32.122.132.131. (CXX) g++ options: -O3 -march=native -pthread

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p123100200300400500SE +/- 2.49, N = 3SE +/- 3.83, N = 3462.61460.86462.911. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 61233691215SE +/- 0.01, N = 3SE +/- 0.01, N = 310.4410.4710.491. (CXX) g++ options: -O3 -fPIC -lm

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TT12320406080100SE +/- 0.20, N = 2SE +/- 0.30, N = 392.992.992.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1235001000150020002500SE +/- 11.83, N = 3SE +/- 3.90, N = 32089.272098.232095.26MIN: 2067.67MIN: 2065.31MIN: 2072.71. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57123600M1200M1800M2400M3000MSE +/- 4836091.17, N = 3SE +/- 1003881.36, N = 32715700000271826666727272666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks12316003200480064008000SE +/- 110.25, N = 3SE +/- 31.63, N = 37599757976111. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CPU-Only1231224364860SE +/- 0.05, N = 3SE +/- 0.03, N = 355.0054.8055.02

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Lossless123714212835SE +/- 0.01, N = 3SE +/- 0.03, N = 329.4429.5029.561. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p12348121620SE +/- 0.01, N = 3SE +/- 0.05, N = 318.2418.2218.171. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory12313002600390052006500SE +/- 9.71, N = 3SE +/- 9.05, N = 36128.196104.816121.171. (CC) gcc options: -pthread -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU123160320480640800SE +/- 1.60, N = 3SE +/- 1.21, N = 3730.78732.81733.51MIN: 719.75MIN: 720.04MIN: 720.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.25960.51920.77881.03841.298SE +/- 0.00070, N = 3SE +/- 0.00189, N = 31.149651.151661.15391MIN: 1.09MIN: 1.08MIN: 1.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression Speed1236001200180024003000SE +/- 5.02, N = 3SE +/- 7.03, N = 32559.52564.82555.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305123140280420560700SE +/- 0.48, N = 3SE +/- 0.61, N = 3633.17635.01635.341. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt123140280420560700SE +/- 0.23, N = 3SE +/- 0.53, N = 3632.66631.09630.511. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CPU-Only123306090120150SE +/- 0.05, N = 3SE +/- 0.10, N = 3142.58142.94142.46

LuaRadio

Test: Complex Phase

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phase123110220330440550SE +/- 0.22, N = 3SE +/- 0.64, N = 3515.5513.8514.5

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57123700M1400M2100M2800M3500MSE +/- 2649108.86, N = 3SE +/- 503322.30, N = 33129700000312423333331194000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile123612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 323.3023.2323.23

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transform12380160240320400SE +/- 0.32, N = 3SE +/- 2.03, N = 3346.5346.1347.11. 3.8.1.0

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression Speed1236001200180024003000SE +/- 0.59, N = 3SE +/- 3.16, N = 32892.42888.02884.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K1230.86181.72362.58543.44724.309SE +/- 0.01, N = 3SE +/- 0.01, N = 33.823.823.831. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU123160320480640800SE +/- 0.35, N = 3SE +/- 0.14, N = 3734.17732.29732.48MIN: 722.64MIN: 720.4MIN: 720.581. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression Speed1236001200180024003000SE +/- 1.83, N = 3SE +/- 5.72, N = 32763.52770.22768.51. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compile123306090120150SE +/- 0.43, N = 3SE +/- 0.50, N = 3157.68157.48157.30

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1230.77881.55762.33643.11523.894SE +/- 0.00371, N = 3SE +/- 0.00642, N = 33.454003.461413.45314MIN: 3.4MIN: 3.38MIN: 3.381. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 5712313M26M39M52M65MSE +/- 43364.09, N = 3SE +/- 66338.36, N = 35965400059569333595203331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

LuaRadio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filters123100200300400500SE +/- 0.38, N = 3SE +/- 1.34, N = 3451.2452.0452.2

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile123306090120150SE +/- 0.31, N = 3SE +/- 0.03, N = 3128.18128.47128.38

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression Speed1236001200180024003000SE +/- 5.14, N = 3SE +/- 2.45, N = 152986.22992.62989.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3d123150300450600750SE +/- 0.64, N = 3SE +/- 0.84, N = 3689.60690.48691.061. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

LuaRadio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filter12370140210280350SE +/- 0.19, N = 3SE +/- 0.03, N = 3339.2339.8339.9

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57123200M400M600M800M1000MSE +/- 935420.29, N = 3SE +/- 1134097.78, N = 39377500009367766679361466671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 01231224364860SE +/- 0.00, N = 3SE +/- 0.01, N = 353.3153.2253.291. (CXX) g++ options: -O3 -fPIC -lm

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57123100M200M300M400M500MSE +/- 493637.29, N = 3SE +/- 230289.67, N = 34756500004761166674764200001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1231.13652.2733.40954.5465.6825SE +/- 0.00358, N = 3SE +/- 0.00299, N = 35.043075.043205.05103MIN: 4.9MIN: 4.85MIN: 4.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57123400M800M1200M1600M2000MSE +/- 88191.71, N = 3SE +/- 218581.28, N = 31690600000169306666716912666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 5712330M60M90M120M150MSE +/- 28480.01, N = 3SE +/- 60092.52, N = 31189200001190366671190866671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 5712350M100M150M200M250MSE +/- 92616.29, N = 3SE +/- 116237.31, N = 32379700002382366672380533331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt12310002000300040005000SE +/- 1.20, N = 3SE +/- 5.30, N = 34560.404559.094555.311. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt12380160240320400SE +/- 0.08, N = 3SE +/- 0.08, N = 3367.42367.80367.801. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-25612310002000300040005000SE +/- 0.67, N = 3SE +/- 4.16, N = 34560.384558.634555.631. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time123100020003000400050004542.04546.44542.41. (CC) gcc options: -O3 -fomit-frame-pointer -lm

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish12380160240320400SE +/- 0.12, N = 3SE +/- 0.14, N = 3369.32369.22368.991. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2123714212835SE +/- 0.03, N = 3SE +/- 0.02, N = 328.4128.3928.411. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU123160320480640800SE +/- 1.26, N = 3SE +/- 1.01, N = 3732.05732.63732.38MIN: 720.72MIN: 720.33MIN: 718.931. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish12370140210280350SE +/- 0.02, N = 3SE +/- 0.11, N = 3302.44302.49302.291. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1230.60081.20161.80242.40323.004SE +/- 0.00818, N = 3SE +/- 0.00016, N = 32.670162.670362.66877MIN: 2.51MIN: 2.5MIN: 2.481. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression Speed1235001000150020002500SE +/- 3.23, N = 3SE +/- 1.71, N = 32552.22553.72553.01. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt12370140210280350SE +/- 0.03, N = 3SE +/- 0.04, N = 3302.23302.34302.181. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU12320K40K60K80K100KSE +/- 9.51, N = 3SE +/- 20.57, N = 396671.0996717.8496694.411. (CC) gcc options: -pthread -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt123306090120150SE +/- 0.01, N = 3SE +/- 0.02, N = 3120.07120.06120.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt12320406080100SE +/- 0.01, N = 3SE +/- 0.03, N = 375.0975.1075.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI12320406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 377.6577.6677.661. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256123306090120150SE +/- 0.02, N = 3SE +/- 0.01, N = 3120.06120.07120.061. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1230.2270.4540.6810.9081.135SE +/- 0.00095, N = 3SE +/- 0.00167, N = 31.009011.008991.00894MIN: 0.97MIN: 0.97MIN: 0.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

LuaRadio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transform12320406080100SE +/- 0.03, N = 3SE +/- 0.00, N = 382.782.782.7

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom1230.14850.2970.44550.5940.7425SE +/- 0.00, N = 3SE +/- 0.00, N = 30.660.660.661. (CXX) g++ options: -O3 -march=native -pthread

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-N123714212835SE +/- 0.47, N = 3SE +/- 1.05, N = 330.928.527.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOT123110220330440550SE +/- 27.14, N = 3SE +/- 13.58, N = 34615074791. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression Speed123100200300400500SE +/- 0.80, N = 3SE +/- 8.81, N = 15444.1444.0473.11. (CC) gcc options: -O3 -march=native -pthread -lz -llzma


Phoronix Test Suite v10.8.4