EPYC 7702 April 2021

AMD EPYC 7702 64-Core testing with a ASRockRack EPYCD8 (P2.40 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104043-IB-EPYC7702A33&grr.

EPYC 7702 April 2021ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolution123AMD EPYC 7702 64-Core @ 2.00GHz (64 Cores / 128 Threads)ASRockRack EPYCD8 (P2.40 BIOS)AMD Starship/Matisse126GB3841GB Micron_9300_MTFDHAL3T8TDPASPEEDVE2282 x Intel I350Ubuntu 20.045.9.0-050900rc6daily20200921-generic (x86_64) 20200920GNOME Shell 3.36.4X Server 1.20.8GCC 9.3.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0x8301034 Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

EPYC 7702 April 2021incompact3d: X3D-benchmarking input.i3dgnuradio: Hilbert Transformgnuradio: FM Deemphasis Filtergnuradio: IIR Filtergnuradio: FIR Filtergnuradio: Signal Source (Cosine)gnuradio: Five Back to Back FIR Filtersluaradio: Complex Phaseluaradio: Hilbert Transformluaradio: FM Deemphasis Filterluaradio: Five Back to Back FIR Filtersaom-av1: Speed 4 Two-Pass - Bosphorus 4Kgmpbench: Total Timebuild-erlang: Time To Compileblender: Barbershop - CPU-Onlybuild-nodejs: Time To Compileaom-av1: Speed 4 Two-Pass - Bosphorus 1080pblender: Pabellon Barcelona - CPU-Onlyaom-av1: Speed 6 Two-Pass - Bosphorus 4Kblender: Classroom - CPU-Onlycompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedbuild-linux-kernel: Time To Compilesysbench: CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUsimdjson: DistinctUserIDsimdjson: PartialTweetssimdjson: Kostyasimdjson: LargeRandblender: Fishy Cat - CPU-Onlyaom-av1: Speed 6 Two-Pass - Bosphorus 1080pavifenc: 0aom-av1: Speed 6 Realtime - Bosphorus 4Kstockfish: Total Timecompress-zstd: 19 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedblender: BMW27 - CPU-Onlycompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 3 - Compression Speedcompress-zstd: 8 - Decompression Speedcompress-zstd: 8 - Compression Speedbotan: AES-256 - Decryptbotan: AES-256aom-av1: Speed 6 Realtime - Bosphorus 1080pbotan: ChaCha20Poly1305 - Decryptbotan: ChaCha20Poly1305botan: Blowfish - Decryptbotan: Blowfishbotan: Twofish - Decryptbotan: Twofishbotan: CAST-256 - Decryptbotan: CAST-256botan: KASUMI - Decryptbotan: KASUMIavifenc: 6, Losslessaom-av1: Speed 8 Realtime - Bosphorus 4Kavifenc: 2incompact3d: input.i3d 193 Cells Per Directionaom-av1: Speed 9 Realtime - Bosphorus 4Konednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUbuild-mesa: Time To Compileviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYonednn: Deconvolution Batch shapes_1d - f32 - CPUliquid-dsp: 8 - 256 - 57liquid-dsp: 128 - 256 - 57liquid-dsp: 64 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 16 - 256 - 57liquid-dsp: 4 - 256 - 57liquid-dsp: 2 - 256 - 57liquid-dsp: 1 - 256 - 57svt-hevc: 1 - Bosphorus 1080psysbench: RAM / Memoryonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUaom-av1: Speed 8 Realtime - Bosphorus 1080pavifenc: 6aom-av1: Speed 9 Realtime - Bosphorus 1080ponednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUtoybrot: OpenMPtoybrot: C++ Taskstoybrot: TBBtoybrot: C++ Threadsavifenc: 10, Losslessincompact3d: input.i3d 129 Cells Per Directiononednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUavifenc: 10svt-vp9: Visual Quality Optimized - Bosphorus 1080ponednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUsvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080p123689.604065346.5744486.7534.52913.8331.7515.582.7339.2451.23.824542157.679142.58128.1845.79116.526.72100.032986.2444.133.5596671.092081.532089.272082.99730.784734.173732.053.593.552.120.665516.2253.30611.891346418132559.583.52552.241.538.742892.4410.55279.42763.525104560.3954560.37618.24632.656633.174367.42369.321302.234302.438120.066120.05575.09177.65329.43822.2228.41225.883249325.940.91180223.30492.995.190.59363630.9896132013904616877571.719934756500003129700000271570000016906000009377500002379700001189200005965400034.416128.191.279851.483140.4338381.1496558.3710.44267.345.043070.99880178487599747172076.7675.549091821.009013.4543.712279.12.670161.22494238.14362.92277.91462.61690.480672346.1734.1489.7537.32868.2323.0513.882.7339.8452.03.824546.4157.479142.94128.4685.77117.146.98100.582992.6444.031.15596717.842085.382098.232085.08732.810732.294732.6253.613.532.130.6654.8016.0953.21611.741331698022564.882.12553.741.138.882888.0404.55243.62770.22441.14559.0884558.62618.22631.089635.007367.797369.219302.337302.492120.058120.06575.10177.66129.50022.1828.39125.552420925.710.87276923.23292.994.689.791.462028.5904128313675076797641.747744761166673124233333271826666716930666679367766672382366671190366675956933334.596104.811.275441.470480.4461411.1516654.4710.46865.795.043200.98032379417579724472786.7965.961010771.008993.461413.754273.592.670361.22224361.00361.43276.07460.86691.061747347.1730.1488.3535.12879.4322.6514.582.7339.9452.23.834542.4157.302142.46128.3785.76116.666.79100.082989.6473.131.16296694.412122.662095.262094.12733.512732.483732.3823.593.532.130.6655.0216.0453.28711.741326301582555.882.32553.041.239.042884.6408.05250.32768.52386.14555.3114555.62518.17630.505635.338367.804368.993302.180302.285120.096120.05575.10377.66329.55621.6728.41425.611750325.990.89383023.23292.594.789.892.565327.3904132713834796887531.735684764200003119400000272726666716912666679361466672380533331190866675952033334.456121.171.287641.467070.4503101.1539154.3210.48866.105.051030.98680679217611711972586.8095.685589631.008943.453143.773277.872.668771.21841360.74363.64279.74462.91OpenBenchmarking.org

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3d123150300450600750SE +/- 0.64, N = 3SE +/- 0.84, N = 3689.60690.48691.061. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

GNU Radio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert Transform12380160240320400SE +/- 0.32, N = 3SE +/- 2.03, N = 3346.5346.1347.11. 3.8.1.0

GNU Radio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis Filter123160320480640800SE +/- 2.41, N = 3SE +/- 5.82, N = 3744.0734.1730.11. 3.8.1.0

GNU Radio

Test: IIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR Filter123110220330440550SE +/- 1.77, N = 3SE +/- 0.90, N = 3486.7489.7488.31. 3.8.1.0

GNU Radio

Test: FIR Filter

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR Filter123120240360480600SE +/- 0.95, N = 3SE +/- 0.73, N = 3534.5537.3535.11. 3.8.1.0

GNU Radio

Test: Signal Source (Cosine)

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)1236001200180024003000SE +/- 4.36, N = 3SE +/- 27.86, N = 32913.82868.22879.41. 3.8.1.0

GNU Radio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR Filters12370140210280350SE +/- 3.91, N = 3SE +/- 3.89, N = 3331.7323.0322.61. 3.8.1.0

LuaRadio

Test: Complex Phase

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex Phase123110220330440550SE +/- 0.22, N = 3SE +/- 0.64, N = 3515.5513.8514.5

LuaRadio

Test: Hilbert Transform

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert Transform12320406080100SE +/- 0.03, N = 3SE +/- 0.00, N = 382.782.782.7

LuaRadio

Test: FM Deemphasis Filter

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis Filter12370140210280350SE +/- 0.19, N = 3SE +/- 0.03, N = 3339.2339.8339.9

LuaRadio

Test: Five Back to Back FIR Filters

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR Filters123100200300400500SE +/- 0.38, N = 3SE +/- 1.34, N = 3451.2452.0452.2

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K1230.86181.72362.58543.44724.309SE +/- 0.01, N = 3SE +/- 0.01, N = 33.823.823.831. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

GNU GMP GMPbench

Total Time

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total Time123100020003000400050004542.04546.44542.41. (CC) gcc options: -O3 -fomit-frame-pointer -lm

Timed Erlang/OTP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To Compile123306090120150SE +/- 0.43, N = 3SE +/- 0.50, N = 3157.68157.48157.30

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CPU-Only123306090120150SE +/- 0.05, N = 3SE +/- 0.10, N = 3142.58142.94142.46

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To Compile123306090120150SE +/- 0.31, N = 3SE +/- 0.03, N = 3128.18128.47128.38

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p1231.30282.60563.90845.21126.514SE +/- 0.01, N = 3SE +/- 0.01, N = 35.795.775.761. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CPU-Only123306090120150SE +/- 0.36, N = 3SE +/- 0.22, N = 3116.52117.14116.66

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K123246810SE +/- 0.01, N = 3SE +/- 0.04, N = 36.726.986.791. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CPU-Only12320406080100SE +/- 0.60, N = 3SE +/- 0.04, N = 3100.03100.58100.08

Zstd Compression

Compression Level: 8, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Decompression Speed1236001200180024003000SE +/- 5.14, N = 3SE +/- 2.45, N = 152986.22992.62989.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 8, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8, Long Mode - Compression Speed123100200300400500SE +/- 0.80, N = 3SE +/- 8.81, N = 15444.1444.0473.11. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Timed Linux Kernel Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To Compile123816243240SE +/- 0.28, N = 10SE +/- 0.28, N = 1033.5531.1631.16

Sysbench

Test: CPU

OpenBenchmarking.orgEvents Per Second, More Is BetterSysbench 1.0.20Test: CPU12320K40K60K80K100KSE +/- 9.51, N = 3SE +/- 20.57, N = 396671.0996717.8496694.411. (CC) gcc options: -pthread -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1235001000150020002500SE +/- 3.09, N = 3SE +/- 10.59, N = 32081.532085.382122.66MIN: 2065.05MIN: 2066.13MIN: 2083.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1235001000150020002500SE +/- 11.83, N = 3SE +/- 3.90, N = 32089.272098.232095.26MIN: 2067.67MIN: 2065.31MIN: 2072.71. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU123400800120016002000SE +/- 1.02, N = 3SE +/- 6.39, N = 32082.992085.082094.12MIN: 2067.76MIN: 2067.79MIN: 2069.051. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU123160320480640800SE +/- 1.60, N = 3SE +/- 1.21, N = 3730.78732.81733.51MIN: 719.75MIN: 720.04MIN: 720.151. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU123160320480640800SE +/- 0.35, N = 3SE +/- 0.14, N = 3734.17732.29732.48MIN: 722.64MIN: 720.4MIN: 720.581. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU123160320480640800SE +/- 1.26, N = 3SE +/- 1.01, N = 3732.05732.63732.38MIN: 720.72MIN: 720.33MIN: 718.931. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserID1230.81231.62462.43693.24924.0615SE +/- 0.00, N = 3SE +/- 0.01, N = 33.593.613.591. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweets1230.79881.59762.39643.19523.994SE +/- 0.01, N = 3SE +/- 0.00, N = 33.553.533.531. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: Kostya1230.47930.95861.43791.91722.3965SE +/- 0.00, N = 3SE +/- 0.00, N = 32.122.132.131. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandom1230.14850.2970.44550.5940.7425SE +/- 0.00, N = 3SE +/- 0.00, N = 30.660.660.661. (CXX) g++ options: -O3 -march=native -pthread

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CPU-Only1231224364860SE +/- 0.05, N = 3SE +/- 0.03, N = 355.0054.8055.02

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p12348121620SE +/- 0.03, N = 3SE +/- 0.03, N = 316.2216.0916.041. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 01231224364860SE +/- 0.00, N = 3SE +/- 0.01, N = 353.3153.2253.291. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K1233691215SE +/- 0.04, N = 3SE +/- 0.03, N = 311.8911.7411.741. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total Time12330M60M90M120M150MSE +/- 1363379.57, N = 8SE +/- 1858448.97, N = 41346418131331698021326301581. (CXX) g++ options: -fprofile-use -m64 -lpthread -O3 -march=native -fno-exceptions -std=c++17 -pedantic -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Decompression Speed1236001200180024003000SE +/- 5.02, N = 3SE +/- 7.03, N = 32559.52564.82555.81. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19 - Compression Speed12320406080100SE +/- 0.44, N = 3SE +/- 1.26, N = 383.582.182.31. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Decompression Speed1235001000150020002500SE +/- 3.23, N = 3SE +/- 1.71, N = 32552.22553.72553.01. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 19, Long Mode - Compression Speed123918273645SE +/- 0.09, N = 3SE +/- 0.23, N = 341.541.141.21. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CPU-Only123918273645SE +/- 0.11, N = 3SE +/- 0.24, N = 338.7438.8839.04

Zstd Compression

Compression Level: 3, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Decompression Speed1236001200180024003000SE +/- 0.59, N = 3SE +/- 3.16, N = 32892.42888.02884.61. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 3, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3, Long Mode - Compression Speed12390180270360450SE +/- 6.05, N = 3SE +/- 3.33, N = 3410.5404.5408.01. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 3 - Compression Speed12311002200330044005500SE +/- 2.38, N = 3SE +/- 3.83, N = 35279.45243.65250.31. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Decompression Speed1236001200180024003000SE +/- 1.83, N = 3SE +/- 5.72, N = 32763.52770.22768.51. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Zstd Compression

Compression Level: 8 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.9Compression Level: 8 - Compression Speed1235001000150020002500SE +/- 26.18, N = 3SE +/- 11.70, N = 32510.02441.12386.11. (CC) gcc options: -O3 -march=native -pthread -lz -llzma

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - Decrypt12310002000300040005000SE +/- 1.20, N = 3SE +/- 5.30, N = 34560.404559.094555.311. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-25612310002000300040005000SE +/- 0.67, N = 3SE +/- 4.16, N = 34560.384558.634555.631. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p12348121620SE +/- 0.01, N = 3SE +/- 0.05, N = 318.2418.2218.171. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - Decrypt123140280420560700SE +/- 0.23, N = 3SE +/- 0.53, N = 3632.66631.09630.511. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305123140280420560700SE +/- 0.48, N = 3SE +/- 0.61, N = 3633.17635.01635.341. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - Decrypt12380160240320400SE +/- 0.08, N = 3SE +/- 0.08, N = 3367.42367.80367.801. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish12380160240320400SE +/- 0.12, N = 3SE +/- 0.14, N = 3369.32369.22368.991. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - Decrypt12370140210280350SE +/- 0.03, N = 3SE +/- 0.04, N = 3302.23302.34302.181. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish12370140210280350SE +/- 0.02, N = 3SE +/- 0.11, N = 3302.44302.49302.291. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - Decrypt123306090120150SE +/- 0.01, N = 3SE +/- 0.02, N = 3120.07120.06120.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256123306090120150SE +/- 0.02, N = 3SE +/- 0.01, N = 3120.06120.07120.061. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - Decrypt12320406080100SE +/- 0.01, N = 3SE +/- 0.03, N = 375.0975.1075.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI12320406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 377.6577.6677.661. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, Lossless123714212835SE +/- 0.01, N = 3SE +/- 0.03, N = 329.4429.5029.561. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K123510152025SE +/- 0.20, N = 3SE +/- 0.30, N = 322.2222.1821.671. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2123714212835SE +/- 0.03, N = 3SE +/- 0.02, N = 328.4128.3928.411. (CXX) g++ options: -O3 -fPIC -lm

Xcompact3d Incompact3d

Input: input.i3d 193 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per Direction123612182430SE +/- 0.22, N = 3SE +/- 0.28, N = 325.8825.5525.611. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K123612182430SE +/- 0.21, N = 3SE +/- 0.12, N = 325.9425.7125.991. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1230.20520.41040.61560.82081.026SE +/- 0.011992, N = 4SE +/- 0.007517, N = 30.9118020.8727690.893830MIN: 0.81MIN: 0.8MIN: 0.811. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile123612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 323.3023.2323.23

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TT12320406080100SE +/- 0.20, N = 2SE +/- 0.30, N = 392.992.992.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TN12320406080100SE +/- 0.03, N = 3SE +/- 0.15, N = 395.194.694.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NT12320406080100SE +/- 0.15, N = 3SE +/- 0.15, N = 390.589.789.81. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NN12320406080100SE +/- 0.35, N = 3SE +/- 0.17, N = 393.091.492.51. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-T123140280420560700SE +/- 14.00, N = 3SE +/- 7.86, N = 36366206531. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-N123714212835SE +/- 0.47, N = 3SE +/- 1.05, N = 330.928.527.31. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOT1232004006008001000SE +/- 6.84, N = 38969049041. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPY12330060090012001500SE +/- 41.77, N = 3SE +/- 3.33, N = 31320128313271. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPY12330060090012001500SE +/- 14.53, N = 3SE +/- 8.82, N = 31390136713831. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOT123110220330440550SE +/- 27.14, N = 3SE +/- 13.58, N = 34615074791. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPY123150300450600750SE +/- 6.89, N = 3SE +/- 0.58, N = 36876796881. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPY123160320480640800SE +/- 11.85, N = 3SE +/- 5.84, N = 37577647531. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1230.39320.78641.17961.57281.966SE +/- 0.00765, N = 3SE +/- 0.00100, N = 31.719931.747741.73568MIN: 1.64MIN: 1.65MIN: 1.651. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 8 - Buffer Length: 256 - Filter Length: 57123100M200M300M400M500MSE +/- 493637.29, N = 3SE +/- 230289.67, N = 34756500004761166674764200001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57123700M1400M2100M2800M3500MSE +/- 2649108.86, N = 3SE +/- 503322.30, N = 33129700000312423333331194000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57123600M1200M1800M2400M3000MSE +/- 4836091.17, N = 3SE +/- 1003881.36, N = 32715700000271826666727272666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57123400M800M1200M1600M2000MSE +/- 88191.71, N = 3SE +/- 218581.28, N = 31690600000169306666716912666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57123200M400M600M800M1000MSE +/- 935420.29, N = 3SE +/- 1134097.78, N = 39377500009367766679361466671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 4 - Buffer Length: 256 - Filter Length: 5712350M100M150M200M250MSE +/- 92616.29, N = 3SE +/- 116237.31, N = 32379700002382366672380533331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 2 - Buffer Length: 256 - Filter Length: 5712330M60M90M120M150MSE +/- 28480.01, N = 3SE +/- 60092.52, N = 31189200001190366671190866671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 5712313M26M39M52M65MSE +/- 43364.09, N = 3SE +/- 66338.36, N = 35965400059569333595203331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080p123816243240SE +/- 0.08, N = 3SE +/- 0.05, N = 334.4134.5934.451. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

Sysbench

Test: RAM / Memory

OpenBenchmarking.orgMiB/sec, More Is BetterSysbench 1.0.20Test: RAM / Memory12313002600390052006500SE +/- 9.71, N = 3SE +/- 9.05, N = 36128.196104.816121.171. (CC) gcc options: -pthread -O2 -funroll-loops -O3 -march=native -rdynamic -ldl -laio -lm

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1230.28970.57940.86911.15881.4485SE +/- 0.00139, N = 3SE +/- 0.01102, N = 31.279851.275441.28764MIN: 1.22MIN: 1.22MIN: 1.221. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1230.33370.66741.00111.33481.6685SE +/- 0.00614, N = 3SE +/- 0.00090, N = 31.483141.470481.46707MIN: 1.24MIN: 1.21MIN: 1.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1230.10130.20260.30390.40520.5065SE +/- 0.001158, N = 3SE +/- 0.001517, N = 30.4338380.4461410.450310MIN: 0.39MIN: 0.38MIN: 0.391. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1230.25960.51920.77881.03841.298SE +/- 0.00070, N = 3SE +/- 0.00189, N = 31.149651.151661.15391MIN: 1.09MIN: 1.08MIN: 1.091. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p1231326395265SE +/- 0.25, N = 3SE +/- 0.38, N = 358.3754.4754.321. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 61233691215SE +/- 0.01, N = 3SE +/- 0.01, N = 310.4410.4710.491. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p1231530456075SE +/- 0.53, N = 3SE +/- 0.03, N = 367.3465.7966.101. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1231.13652.2733.40954.5465.6825SE +/- 0.00358, N = 3SE +/- 0.00299, N = 35.043075.043205.05103MIN: 4.9MIN: 4.85MIN: 4.871. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1230.22470.44940.67410.89881.1235SE +/- 0.001130, N = 3SE +/- 0.001469, N = 30.9988010.9803230.986806MIN: 0.94MIN: 0.94MIN: 0.91. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMP1232K4K6K8K10KSE +/- 18.59, N = 3SE +/- 13.93, N = 37848794179211. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Tasks12316003200480064008000SE +/- 110.25, N = 3SE +/- 31.63, N = 37599757976111. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBB12316003200480064008000SE +/- 35.23, N = 3SE +/- 12.41, N = 37471724471191. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ Threads12316003200480064008000SE +/- 11.93, N = 3SE +/- 30.64, N = 37207727872581. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, Lossless123246810SE +/- 0.010, N = 3SE +/- 0.017, N = 36.7676.7966.8091. (CXX) g++ options: -O3 -fPIC -lm

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Direction1231.34122.68244.02365.36486.706SE +/- 0.08892473, N = 3SE +/- 0.08476458, N = 35.549091825.961010775.685589631. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1230.2270.4540.6810.9081.135SE +/- 0.00095, N = 3SE +/- 0.00167, N = 31.009011.008991.00894MIN: 0.97MIN: 0.97MIN: 0.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1230.77881.55762.33643.11523.894SE +/- 0.00371, N = 3SE +/- 0.00642, N = 33.454003.461413.45314MIN: 3.4MIN: 3.38MIN: 3.381. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 101230.84891.69782.54673.39564.2445SE +/- 0.007, N = 3SE +/- 0.001, N = 33.7123.7543.7731. (CXX) g++ options: -O3 -fPIC -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080p12360120180240300SE +/- 2.47, N = 3SE +/- 3.78, N = 3279.10273.59277.871. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1230.60081.20161.80242.40323.004SE +/- 0.00818, N = 3SE +/- 0.00016, N = 32.670162.670362.66877MIN: 2.51MIN: 2.5MIN: 2.481. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1230.27560.55120.82681.10241.378SE +/- 0.00091, N = 3SE +/- 0.00239, N = 31.224941.222241.21841MIN: 1.15MIN: 1.13MIN: 1.121. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp=libomp -msse4.1 -fPIC -pie -lpthread -ldl

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080p12380160240320400SE +/- 0.55, N = 3SE +/- 0.35, N = 3238.14361.00360.741. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p12380160240320400SE +/- 1.33, N = 3SE +/- 3.53, N = 3362.92361.43363.641. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080p12360120180240300SE +/- 0.37, N = 3SE +/- 1.54, N = 3277.91276.07279.741. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080p123100200300400500SE +/- 2.49, N = 3SE +/- 3.83, N = 3462.61460.86462.911. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt


Phoronix Test Suite v10.8.5