EPYC 7763 LLVM Clang Compiler Tests

AMD EPYC 7763 64-Core testing with a Supermicro H12SSL-i v1.01 (2.0 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104140-IB-EPYC7763L31&sor&grs.

EPYC 7763 LLVM Clang Compiler TestsProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionClang 12.0Clang 11.0Clang 12.0 LTOGCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.0AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads)Supermicro H12SSL-i v1.01 (2.0 BIOS)AMD Starship/Matisse126GB3841GB Micron_9300_MTFDHAL3T8TDPASPEED2 x Broadcom NetXtreme BCM5720 2-port PCIeUbuntu 20.045.12.0-051200rc6daily20210408-generic (x86_64) 20210407GNOME Shell 3.36.4X Server 1.20.8Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73ext41024x768Clang 11.0.0-2~ubuntu20.04.1Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73GCC 9.3.0GCC 10.3.0GCC 11.0.1 20210413Clang 12.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- Clang 12.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 11.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 12.0 LTO: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- GCC 9.3: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 10.3: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 11.0.1: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- AMD AOCC 3.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa001119Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affectedCompiler Details- GCC 9.3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - GCC 10.3: --disable-multilib --enable-checking=release- GCC 11.0.1: --disable-multilib --enable-checking=release- AMD AOCC 3.0: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: (unknown)

EPYC 7763 LLVM Clang Compiler Testsonednn: Deconvolution Batch shapes_1d - f32 - CPUviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYetcpak: DXT1viennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-TNdav1d: Chimera 1080p 10-bitgraphics-magick: Resizingbotan: ChaCha20Poly1305 - Decryptc-ray: Total Time - 4K, 16 Rays Per Pixelbotan: ChaCha20Poly1305onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUlibraw: Post-Processing Benchmarkfinancebench: Bonds OpenMPonednn: Convolution Batch Shapes Auto - f32 - CPUviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dDOTsvt-av1: Enc Mode 0 - 1080ptoybrot: C++ Threadsetcpak: ETC1toybrot: TBBtoybrot: OpenMPtoybrot: C++ Tasksviennacl: CPU BLAS - dGEMM-TTscimark2: Sparse Matrix Multiplybotan: Blowfishgraphics-magick: Sharpenonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUgraphics-magick: HWB Color Spaceonednn: IP Shapes 3D - u8s8f32 - CPUfinancebench: Repo OpenMPsvt-av1: Enc Mode 4 - 1080ponednn: Convolution Batch Shapes Auto - u8s8f32 - CPUviennacl: CPU BLAS - dGEMV-Tsvt-av1: Enc Mode 8 - 1080pcoremark: CoreMark Size 666 - Iterations Per Secondastcenc: Mediumonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUfftw: Float + SSE - 1D FFT Size 2048liquid-dsp: 128 - 256 - 57onednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUscimark2: Jacobi Successive Over-Relaxationgraphics-magick: Noise-Gaussianonnx: shufflenet-v2-10 - OpenMP CPUbotan: Blowfish - Decryptetcpak: ETC2botan: AES-256astcenc: Thoroughencode-flac: WAV To FLACbotan: AES-256 - Decryptencode-mp3: WAV To MP3tscp: AI Chess Performancegraphics-magick: Enhancedngspice: C2670simdjson: PartialTweetsquantlib: simdjson: DistinctUserIDsimdjson: LargeRandonnx: yolov4 - OpenMP CPUavifenc: 6, Losslessfftw: Float + SSE - 1D FFT Size 4096onednn: IP Shapes 1D - u8s8f32 - CPUfftw: Stock - 1D FFT Size 32botan: Twofishfftw: Float + SSE - 1D FFT Size 32onednn: IP Shapes 1D - f32 - CPUwebp: Quality 100, Highest Compressiononnx: fcn-resnet101-11 - OpenMP CPUgraphics-magick: Swirlliquid-dsp: 1 - 256 - 57botan: Twofish - Decryptonednn: IP Shapes 3D - f32 - CPUfftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 1024securemark: SecureMark-TLSaom-av1: Speed 9 Realtime - Bosphorus 1080pwebp2: Quality 100, Compression Effort 5fftw: Stock - 1D FFT Size 1024pgbench: 100 - 100 - Read Write - Average Latencypgbench: 100 - 100 - Read Writefftw: Stock - 1D FFT Size 2048avifenc: 2liquid-dsp: 32 - 256 - 57fftw: Float + SSE - 2D FFT Size 4096scimark2: Fast Fourier Transformaom-av1: Speed 8 Realtime - Bosphorus 1080pavifenc: 6onednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUavifenc: 0avifenc: 10aom-av1: Speed 6 Realtime - Bosphorus 1080pwebp2: Quality 100, Lossless Compressionwebp2: Quality 95, Compression Effort 7onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUwebp2: Quality 75, Compression Effort 7compress-lz4: 9 - Compression Speedfftw: Stock - 2D FFT Size 2048mrbayes: Primate Phylogeny Analysisgraphics-magick: Rotatesvt-hevc: 10 - Bosphorus 1080pngspice: C7552aom-av1: Speed 4 Two-Pass - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080pbotan: KASUMIpovray: Trace Timefftw: Float + SSE - 1D FFT Size 1024avifenc: 10, Losslesssvt-hevc: 1 - Bosphorus 1080ppgbench: 100 - 250 - Read Writejpegxl: PNG - 7pgbench: 100 - 250 - Read Write - Average Latencyjpegxl: PNG - 5scimark2: Monte Carloaom-av1: Speed 6 Realtime - Bosphorus 4Kwebp2: Defaultaom-av1: Speed 9 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 4Kx265: Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 0 Two-Pass - Bosphorus 1080ptachyon: Total Timecompress-lz4: 3 - Compression Speedsvt-vp9: Visual Quality Optimized - Bosphorus 1080pliquid-dsp: 64 - 256 - 57pgbench: 100 - 1 - Read Onlywebp: Quality 100, Losslesssvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080ppgbench: 100 - 1 - Read Only - Average Latencyaom-av1: Speed 0 Two-Pass - Bosphorus 4Kbotan: KASUMI - Decryptwebp: Defaultscimark2: Dense LU Matrix Factorizationdav1d: Chimera 1080pbotan: CAST-256 - Decryptbotan: CAST-256scimark2: Compositegcrypt: fftw: Stock - 2D FFT Size 4096astcenc: Exhaustivewebp: Quality 100, Lossless, Highest Compressionaom-av1: Speed 4 Two-Pass - Bosphorus 4Kwebp: Quality 100fftw: Float + SSE - 2D FFT Size 2048simdjson: Kostyaaom-av1: Speed 6 Two-Pass - Bosphorus 1080pjpegxl: JPEG - 8pgbench: 100 - 100 - Read Only - Average Latencypgbench: 100 - 100 - Read Onlypgbench: 100 - 1 - Read Writex265: Bosphorus 1080ppgbench: 100 - 1 - Read Write - Average Latencycompress-lz4: 9 - Decompression Speedcompress-lz4: 3 - Decompression Speedencode-opus: WAV To Opus Encodejpegxl: PNG - 8dav1d: Summer Nature 4Kpgbench: 100 - 250 - Read Onlypgbench: 100 - 250 - Read Only - Average Latencydav1d: Summer Nature 1080ponednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUfftw: Float + SSE - 2D FFT Size 1024jpegxl: JPEG - 5jpegxl: JPEG - 7onnx: super-resolution-10 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYClang 12.0Clang 11.0Clang 12.0 LTOGCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.01.444256048782718.52548.651.9308.322136843.40415.870850.4961.1725841.7851596.8671871.2213265.78190.1837220284.64267807507743773.04280.22380.0546142.367970.4919406050.71012433246.83723911.4742.03606626118.0671785466.2839694.00580.313689512543643766667593.972590.182597.4811785.504579904351.284202.0854659.3386.76477.8544682.4558.25615709661076118.8704.602653.84.620.8433325.220454281.0750713333315.409156491.077016.309112199355663000321.1903.285079862.09088.3265204103.176.690108051.607623191046725.175156483333322797363.8588.789.5101307.491302.7047.8843.36126.85374.035207.0081305.10109.52548.507789.989.116712643.5895.9567.10345.3082.6449.296503505.74641.095668412.154.43174.27675.1317.222.73938.118.9930.3233.390.5316.046852.07372.4930706333332431019.016487.43488.230.0410.2184.2291.3318848.401198.22133.048132.8203190.62236.9246744.118.993638.4494.872.199319352.7522.1328.130.0941069022328174.000.30513926.513911.57.5670.82541.5610712090.2341244.110.7797763623966.6666.38445649869.14343574711.45757187710431872.75983.688.3184.192034840.63715.599848.2361.1514038.7151900.4348960.84116979.39330.1816395205.06562477029683684.04590.37319.2346132.318590.4892786160.59472933178.49869811.8211.60540677117.3921790837.0100003.98370.315522500843596533333563.200562.970563.2471785.424639797351.075168.8194901.1276.76747.9794895.5588.25016382651068103.8264.412640.24.410.8134626.034466761.0757713324299.214145901.080116.243108191556307000302.4053.527879438.68809.6260119100.557.366105641.6266161610004.225.472157840000022913399.1686.099.5361277.621276.0447.8943.42926.61392.849203.6341271.91109.63649.017878.588.620665652.7490.5277.20346.8979.1499.408507405.87941.015448812.014.60378.41674.8617.132.74337.289.1429.9433.140.5316.409952.35373.9930513666672494318.573481.05482.020.0400.2180.2211.3369146.881190.41127.740128.5863319.34240.2056823.819.025537.7274.952.240317412.6822.0027.240.0941069367331273.360.30213927.913840.37.3920.8543.4310655060.2351251.250.7791013618165.5865.43452347151.24624124952719.9857143284.76370857367202.1012657.848.4793.63350.9313698.713715.07.19213158715211082.36598.5100.9305.361238611.9779.158616.0960.71778260.2076805.5807290.86930895.311330.1295142269.67351075451541497.93765.88412.8468062.997590.5991407850.65401042399.8077579.3251.6626079892.9842086609.9780104.87450.376992527493012066667658.660659.191657.8762149.155479419412.072174.8125484.6767.85378.5345391.9907.01114463721217101.5353.932338.93.980.9435129.080520991.1743414399337.355165901.174867.053116212961404000339.0693.67278105489798.6238935106.556.753116891.688593641105327.784172190000025068384.0391.9710.3991358.561357.2952.2173.65924.84388.946220.9441356.91118.44751.978408.589.163709605.5089.0916.69322.4284.8649.968532756.13138.41538254.657668.1016.292.77839.129.5728.9134.560.515.683753.83354.2129404666672389519.298463.12464.570.0420.284.1301.3979178.971145.50127.343127.2983229.22232.5727007.319.479439.0724.782.273313412.7521.420.0951057125329872.140.30313895.313793.47.504530.8210674860.2351228.630.78676236321518349565.063681312177.236861461.22158.41114.60398.7104316.141208476.1759.029485.0190.78819258.9051770.5091140.87078494.41056.420.1695383281.14651815524561098.53820.77422.1388073.003410.6021557720.64625234979.29427111.2301.64268741.4109.6972110880.4279784.86990.377733534973005033333659.265658.277658.0382038.1554410197420.853173.2265525.7107.83708.5675529.4027.23114671791039103.5984.022392.64.130.935126.911521301.1974712576341.847166501.178947.078115211262467333325.3893.61144101799247.3242700107.466.934113191.701588941071127.386171800000023774388.9893.0510.4171379.511382.4151.4543.64326.49406.027215.5651375.71116.65552.368134.593.656689615.6290.4326.87330.5379.1159.570520546.10739.03530194.731682.8717.032.91839.329.1028.6035.260.5216.146852.87364.1229428666672484518.883472.61477.670.0400.2181.4531.3729248.891171.04127.775127.7413235.94231.2386974.019.458338.5484.842.225320612.7721.640.0931076357336972.600.29713806.613906.17.469536.7110897310.2301245.110.78247635973555950556.2592.971350.01065.6015992359100.5104334.3511889.22757.2451376.81640695.011530.17699.33462.6680977134199.60026011.905794110.7022176407.6659294.81605471030557666672148.845507.69898.7097.47314942501082103.00527.0575139112765165907.003216160886333102059238.4243861111.27110441.777563691067527.103167980000024888388.8894.4010.29151.0343.60727.0151.178231.189.432694611.7390.2646.95329.32517066.14938.86531024.722647.8217.3739.719.4128.7935.260.5215.498951.32366.3929894000002366118.314472.32478.160.0420.211.3869263.551180.443182.35233.5146948.219.618937.9484.842.2743166222.110.0921090824338371.790.29613857.413882.27.381538.2810901600.2301249.743571863.9649149612101.37059194410172654.72184.090.0192.001866838.08915.649845.1411.1704441.6451885.5195310.83392178.811650.1837144211.73369457477718984.44594.27319.7876172.287550.4597246140.55423133146.02864611.6901.59597783116.4931720060.4413073.88110.301885444123606466667544.099544.306544.6001785.4546611325355.059178.8524891.0726.64099.2804887.5738.14216978461057103.9294.332725.74.470.8238625.783455211.0448413192304.996161461.038996.578122192957411333303.8063.415839603.28902.12646377.403106691022725.598160963333323111398.969.7251259.591267.1848.1273.543382.985205.0341268.08109.81150.327784.886.742660638.1091.986343.8582.8279.494496855.94840.9511.3779.23690.942.81630.4416.058153.77373.89310040000019.126476.95478.6282.9491.3519021.831188.43128.008127.7683298.29240.4056875.318.912738.3382.262310132.7327.2973.5113561.513562.50.81541.581251.910.7732333610065.5765.68438345955.2477326531OpenBenchmarking.org

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 12.0Clang 11.0GCC 9.3GCC 10.3246810SE +/- 0.00485, N = 3SE +/- 0.00123, N = 3SE +/- 0.00568, N = 3SE +/- 0.02683, N = 3SE +/- 0.03687, N = 31.370591.444251.457577.192137.23686-fopenmp=libomp - MIN: 1.28-fopenmp=libomp - MIN: 1.34-fopenmp=libomp - MIN: 1.35-fopenmp - MIN: 6.14-fopenmp - MIN: 6.181. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYAMD AOCC 3.0Clang 11.0GCC 11.0.1GCC 9.3GCC 10.3Clang 12.0400800120016002000SE +/- 9.88, N = 12SE +/- 8.32, N = 15SE +/- 2.67, N = 15SE +/- 9.19, N = 15SE +/- 131.59, N = 12SE +/- 15.32, N = 111944.01877.01599.01587.01461.2604.0-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYGCC 11.0.1GCC 10.3GCC 9.3Clang 11.0AMD AOCC 3.0Clang 12.05001000150020002500SE +/- 2.74, N = 15SE +/- 194.35, N = 12SE +/- 2.06, N = 15SE +/- 1.59, N = 15SE +/- 3.59, N = 12SE +/- 20.06, N = 122359.02158.41521.01043.01017.0878.0-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1Clang 12.0 LTOClang 12.0AMD AOCC 3.0Clang 11.0GCC 10.3GCC 9.36001200180024003000SE +/- 6.09, N = 3SE +/- 2.64, N = 3SE +/- 8.09, N = 3SE +/- 1.69, N = 3SE +/- 0.48, N = 3SE +/- 0.16, N = 32719.992718.532654.721872.761114.601082.371. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNGCC 11.0.1GCC 10.3GCC 9.3AMD AOCC 3.0Clang 11.0Clang 12.020406080100SE +/- 0.29, N = 15SE +/- 1.05, N = 12SE +/- 0.16, N = 15SE +/- 0.04, N = 12SE +/- 0.06, N = 15SE +/- 0.05, N = 12100.598.798.584.083.648.6-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNGCC 11.0.1GCC 10.3GCC 9.3AMD AOCC 3.0Clang 11.0Clang 12.020406080100SE +/- 0.62, N = 12SE +/- 0.08, N = 15SE +/- 0.05, N = 12SE +/- 0.02, N = 15SE +/- 0.09, N = 12104.0104.0100.990.088.351.9-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bitGCC 11.0.1GCC 10.3Clang 12.0GCC 9.3AMD AOCC 3.0Clang 11.070140210280350SE +/- 1.11, N = 3SE +/- 0.21, N = 3SE +/- 0.93, N = 3SE +/- 0.71, N = 3SE +/- 0.39, N = 3SE +/- 0.48, N = 3334.35316.14308.32305.36192.00184.19-lm - MIN: 234.24 / MAX: 544.9-lm - MIN: 218.19 / MAX: 515.85MIN: 220.53 / MAX: 490.51-lm - MIN: 210.86 / MAX: 493.21-lm - MIN: 118.57 / MAX: 324.98-lm - MIN: 114.52 / MAX: 310.51. (CC) gcc options: -O3 -march=native -pthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingClang 12.0Clang 11.0AMD AOCC 3.0GCC 9.3GCC 10.3GCC 11.0.15001000150020002500SE +/- 41.63, N = 12SE +/- 27.29, N = 3SE +/- 52.84, N = 15SE +/- 18.77, N = 3SE +/- 14.93, N = 3SE +/- 17.34, N = 32136203418661238120811881. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptClang 12.0Clang 11.0AMD AOCC 3.0GCC 9.3GCC 10.32004006008001000SE +/- 4.64, N = 3SE +/- 0.16, N = 3SE +/- 3.17, N = 3SE +/- 0.40, N = 3SE +/- 0.02, N = 3843.40840.64838.09611.98476.181. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 10.3GCC 9.3GCC 11.0.1Clang 11.0AMD AOCC 3.0Clang 12.048121620SE +/- 0.014, N = 3SE +/- 0.014, N = 3SE +/- 0.027, N = 3SE +/- 0.009, N = 3SE +/- 0.063, N = 3SE +/- 0.023, N = 39.0299.1589.22715.59915.64915.8701. (CC) gcc options: -lm -lpthread -O3 -march=native

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305Clang 12.0Clang 11.0AMD AOCC 3.0GCC 9.3GCC 10.32004006008001000SE +/- 4.85, N = 3SE +/- 0.62, N = 3SE +/- 3.15, N = 3SE +/- 0.13, N = 3SE +/- 0.28, N = 3850.50848.24845.14616.10485.021. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUGCC 9.3GCC 10.3Clang 11.0AMD AOCC 3.0Clang 12.00.26380.52760.79141.05521.319SE +/- 0.003430, N = 3SE +/- 0.005622, N = 3SE +/- 0.006530, N = 3SE +/- 0.004625, N = 3SE +/- 0.004576, N = 30.7177820.7881921.1514001.1704401.172580-fopenmp - MIN: 0.67-fopenmp - MIN: 0.74-fopenmp=libomp - MIN: 1.09-fopenmp=libomp - MIN: 1.11-fopenmp=libomp - MIN: 1.121. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkGCC 9.3GCC 10.3GCC 11.0.1Clang 12.0AMD AOCC 3.0Clang 11.01326395265SE +/- 0.19, N = 3SE +/- 0.23, N = 3SE +/- 0.16, N = 3SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.33, N = 360.2058.9057.2441.7841.6438.711. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPGCC 11.0.1Clang 12.0GCC 10.3AMD AOCC 3.0Clang 11.0GCC 9.316K32K48K64K80KSE +/- 42.62, N = 3SE +/- 10.95, N = 3SE +/- 23.55, N = 3SE +/- 242.64, N = 3SE +/- 4.51, N = 3SE +/- 971.24, N = 351376.8251596.8751770.5151885.5251900.4376805.581. (CXX) g++ options: -O3 -march=native -fopenmp

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0GCC 9.3GCC 10.3Clang 12.00.27480.54960.82441.09921.374SE +/- 0.000645, N = 3SE +/- 0.000480, N = 3SE +/- 0.001032, N = 3SE +/- 0.001247, N = 3SE +/- 0.018279, N = 40.8339210.8411690.8693080.8707841.221320-fopenmp=libomp - MIN: 0.81-fopenmp=libomp - MIN: 0.82-fopenmp - MIN: 0.84-fopenmp - MIN: 0.84-fopenmp=libomp - MIN: 1.131. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTGCC 9.3GCC 11.0.1GCC 10.3Clang 11.0AMD AOCC 3.0Clang 12.020406080100SE +/- 0.07, N = 15SE +/- 0.08, N = 15SE +/- 0.59, N = 12SE +/- 0.03, N = 15SE +/- 0.07, N = 12SE +/- 0.56, N = 1295.395.094.479.378.865.7-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTAMD AOCC 3.0GCC 11.0.1GCC 9.3GCC 10.3Clang 11.0Clang 12.030060090012001500SE +/- 2.61, N = 12SE +/- 1.87, N = 15SE +/- 1.59, N = 15SE +/- 95.41, N = 12SE +/- 1.49, N = 15SE +/- 17.06, N = 121165.001153.001133.001056.42933.00819.00-fopenmp=libomp-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pAMD AOCC 3.0Clang 12.0Clang 11.0GCC 11.0.1GCC 10.3GCC 9.30.04120.08240.12360.16480.206SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1830.1830.1810.1760.1690.1291. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ ThreadsGCC 9.3GCC 10.3Clang 11.0Clang 12.0 LTOAMD AOCC 3.0Clang 12.015003000450060007500SE +/- 8.33, N = 3SE +/- 6.12, N = 3SE +/- 25.04, N = 3SE +/- 15.06, N = 3SE +/- 24.26, N = 3SE +/- 30.90, N = 3514253836395714371447220-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc1. (CXX) g++ options: -O3 -march=native -lpthread

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1Clang 12.0 LTOClang 12.0GCC 10.3GCC 9.3AMD AOCC 3.0Clang 11.060120180240300SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3284.76284.64281.15269.67211.73205.071. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBGCC 9.3GCC 10.3Clang 11.0Clang 12.0AMD AOCC 3.0Clang 12.0 LTO15003000450060007500SE +/- 74.84, N = 3SE +/- 67.68, N = 3SE +/- 67.11, N = 7SE +/- 87.21, N = 3SE +/- 52.54, N = 3SE +/- 86.43, N = 3510751816247678069457085-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto1. (CXX) g++ options: -O3 -march=native -lpthread

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMPGCC 9.3GCC 10.3Clang 11.0AMD AOCC 3.0Clang 12.016003200480064008000SE +/- 3.18, N = 3SE +/- 2.60, N = 3SE +/- 20.42, N = 3SE +/- 22.73, N = 3SE +/- 14.89, N = 3545155247029747775071. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ TasksGCC 9.3GCC 10.3Clang 11.0AMD AOCC 3.0Clang 12.0 LTOClang 12.016003200480064008000SE +/- 49.08, N = 3SE +/- 31.52, N = 3SE +/- 7.31, N = 3SE +/- 41.46, N = 3SE +/- 17.21, N = 3SE +/- 33.67, N = 3541456106836718973677437-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto-lm -lgcc -lgcc_s -lc1. (CXX) g++ options: -O3 -march=native -lpthread

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTGCC 11.0.1GCC 10.3GCC 9.3AMD AOCC 3.0Clang 11.0Clang 12.020406080100SE +/- 0.05, N = 15SE +/- 0.60, N = 12SE +/- 0.05, N = 15SE +/- 0.08, N = 12SE +/- 0.02, N = 14SE +/- 0.07, N = 1299.398.597.984.484.073.0-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3GCC 11.0.110002000300040005000SE +/- 5.98, N = 3SE +/- 3.87, N = 3SE +/- 10.41, N = 3SE +/- 0.86, N = 3SE +/- 1.69, N = 3SE +/- 0.39, N = 34594.274590.374280.223820.773765.883462.661. (CC) gcc options: -O3 -march=native -lm

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishGCC 10.3GCC 9.3Clang 12.0AMD AOCC 3.0Clang 11.090180270360450SE +/- 0.11, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 1.14, N = 3SE +/- 1.73, N = 3422.14412.85380.05319.79319.231. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenGCC 11.0.1GCC 10.3GCC 9.3AMD AOCC 3.0Clang 12.0Clang 11.02004006008001000SE +/- 0.58, N = 3SE +/- 2.03, N = 38098078066176146131. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 9.3GCC 10.30.67581.35162.02742.70323.379SE +/- 0.00564, N = 3SE +/- 0.02389, N = 3SE +/- 0.02100, N = 3SE +/- 0.00845, N = 3SE +/- 0.00883, N = 32.287552.318592.367972.997593.00341-fopenmp=libomp - MIN: 1.91-fopenmp=libomp - MIN: 1.92-fopenmp=libomp - MIN: 2.01-fopenmp - MIN: 2.24-fopenmp - MIN: 2.351. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 9.3GCC 10.30.13550.2710.40650.5420.6775SE +/- 0.000365, N = 3SE +/- 0.001652, N = 3SE +/- 0.002843, N = 3SE +/- 0.001469, N = 3SE +/- 0.001964, N = 30.4597240.4892780.4919400.5991400.602155-fopenmp=libomp - MIN: 0.44-fopenmp=libomp - MIN: 0.46-fopenmp=libomp - MIN: 0.47-fopenmp - MIN: 0.56-fopenmp - MIN: 0.571. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceGCC 9.3GCC 10.3GCC 11.0.1Clang 11.0AMD AOCC 3.0Clang 12.02004006008001000SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 1.20, N = 3SE +/- 0.67, N = 37857727716166146051. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0GCC 10.3GCC 9.3Clang 12.00.15980.31960.47940.63920.799SE +/- 0.000764, N = 3SE +/- 0.008914, N = 3SE +/- 0.003112, N = 3SE +/- 0.003317, N = 3SE +/- 0.011383, N = 30.5542310.5947290.6462520.6540100.710124-fopenmp=libomp - MIN: 0.5-fopenmp=libomp - MIN: 0.53-fopenmp - MIN: 0.6-fopenmp - MIN: 0.59-fopenmp=libomp - MIN: 0.641. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPAMD AOCC 3.0Clang 11.0Clang 12.0GCC 11.0.1GCC 10.3GCC 9.39K18K27K36K45KSE +/- 9.32, N = 3SE +/- 0.81, N = 3SE +/- 64.93, N = 3SE +/- 3.94, N = 3SE +/- 102.23, N = 3SE +/- 453.41, N = 1433146.0333178.5033246.8434199.6034979.2942399.811. (CXX) g++ options: -O3 -march=native -fopenmp

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pGCC 11.0.1Clang 11.0AMD AOCC 3.0Clang 12.0GCC 10.3GCC 9.33691215SE +/- 0.139, N = 3SE +/- 0.164, N = 4SE +/- 0.189, N = 3SE +/- 0.170, N = 3SE +/- 0.111, N = 9SE +/- 0.086, N = 311.90511.82111.69011.47411.2309.3251. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0GCC 10.3GCC 9.3Clang 12.00.45810.91621.37431.83242.2905SE +/- 0.00195, N = 3SE +/- 0.00118, N = 3SE +/- 0.00384, N = 3SE +/- 0.01150, N = 3SE +/- 0.01922, N = 121.595971.605401.642681.662602.03606-fopenmp=libomp - MIN: 1.54-fopenmp=libomp - MIN: 1.55-fopenmp - MIN: 1.58-fopenmp - MIN: 1.59-fopenmp=libomp - MIN: 1.811. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TGCC 9.3GCC 11.0.1AMD AOCC 3.0GCC 10.3Clang 11.0Clang 12.02004006008001000SE +/- 2.10, N = 14SE +/- 2.88, N = 15SE +/- 1.94, N = 12SE +/- 66.49, N = 12SE +/- 1.41, N = 14SE +/- 4.04, N = 12798.0794.0783.0741.4677.0626.0-fopenmp-fopenmp-fopenmp=libomp-fopenmp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pClang 12.0Clang 11.0AMD AOCC 3.0GCC 11.0.1GCC 10.3GCC 9.3306090120150SE +/- 0.10, N = 3SE +/- 0.46, N = 3SE +/- 0.33, N = 3SE +/- 0.18, N = 3SE +/- 1.05, N = 3SE +/- 0.83, N = 3118.07117.39116.49110.70109.7092.981. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 11.0.1GCC 10.3GCC 9.3Clang 11.0Clang 12.0AMD AOCC 3.0500K1000K1500K2000K2500KSE +/- 5755.65, N = 3SE +/- 2170.85, N = 3SE +/- 4791.32, N = 3SE +/- 971.31, N = 3SE +/- 984.68, N = 3SE +/- 3670.84, N = 32176407.672110880.432086609.981790837.011785466.281720060.441. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: MediumAMD AOCC 3.0Clang 11.0Clang 12.0GCC 11.0.1GCC 10.3GCC 9.31.09682.19363.29044.38725.484SE +/- 0.0042, N = 3SE +/- 0.0013, N = 3SE +/- 0.0116, N = 3SE +/- 0.0099, N = 3SE +/- 0.0047, N = 3SE +/- 0.0035, N = 33.88113.98374.00584.81604.86994.87451. (CXX) g++ options: -O3 -march=native -flto -pthread

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 12.0Clang 11.0GCC 9.3GCC 10.30.0850.170.2550.340.425SE +/- 0.000492, N = 3SE +/- 0.000321, N = 3SE +/- 0.000247, N = 3SE +/- 0.000576, N = 3SE +/- 0.004341, N = 30.3018850.3136890.3155220.3769920.377733-fopenmp=libomp - MIN: 0.29-fopenmp=libomp - MIN: 0.3-fopenmp=libomp - MIN: 0.3-fopenmp - MIN: 0.36-fopenmp - MIN: 0.361. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048GCC 11.0.1GCC 10.3GCC 9.3Clang 12.0Clang 11.0AMD AOCC 3.012K24K36K48K60KSE +/- 156.75, N = 3SE +/- 743.81, N = 3SE +/- 725.00, N = 3SE +/- 439.50, N = 3SE +/- 582.34, N = 3SE +/- 756.91, N = 35471053497527495125450084444121. (CC) gcc options: -pthread -O3 -march=native -lm

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57Clang 12.0AMD AOCC 3.0Clang 11.0GCC 11.0.1GCC 9.3GCC 10.3800M1600M2400M3200M4000MSE +/- 883804.91, N = 3SE +/- 1543084.93, N = 3SE +/- 1559202.08, N = 3SE +/- 6016181.88, N = 3SE +/- 3384441.53, N = 3SE +/- 1679616.36, N = 33643766667360646666735965333333055766667301206666730050333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 9.3GCC 10.3140280420560700SE +/- 0.53, N = 3SE +/- 0.83, N = 3SE +/- 9.50, N = 3SE +/- 0.64, N = 3SE +/- 0.61, N = 3544.10563.20593.97658.66659.27-fopenmp=libomp - MIN: 532.32-fopenmp=libomp - MIN: 550.23-fopenmp=libomp - MIN: 570.44-fopenmp - MIN: 639.86-fopenmp - MIN: 642.671. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3140280420560700SE +/- 0.90, N = 3SE +/- 0.25, N = 3SE +/- 1.89, N = 3SE +/- 0.83, N = 3SE +/- 1.25, N = 3544.31562.97590.18658.28659.19-fopenmp=libomp - MIN: 531.9-fopenmp=libomp - MIN: 551.49-fopenmp=libomp - MIN: 575.41-fopenmp - MIN: 639.78-fopenmp - MIN: 642.051. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 9.3GCC 10.3140280420560700SE +/- 0.62, N = 3SE +/- 0.10, N = 3SE +/- 3.02, N = 3SE +/- 0.52, N = 3SE +/- 1.86, N = 3544.60563.25597.48657.88658.04-fopenmp=libomp - MIN: 532.91-fopenmp=libomp - MIN: 551.31-fopenmp=libomp - MIN: 580.8-fopenmp - MIN: 638.35-fopenmp - MIN: 635.781. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0AMD AOCC 3.0Clang 11.05001000150020002500SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 32149.152148.842038.151785.501785.451785.421. (CC) gcc options: -O3 -march=native -lm

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianGCC 11.0.1GCC 9.3GCC 10.3AMD AOCC 3.0Clang 11.0Clang 12.0120240360480600SE +/- 1.00, N = 3SE +/- 0.33, N = 3SE +/- 1.00, N = 35505475444664634571. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPUAMD AOCC 3.0GCC 10.3Clang 12.0Clang 11.0GCC 9.32K4K6K8K10KSE +/- 171.77, N = 3SE +/- 7.52, N = 3SE +/- 88.25, N = 12SE +/- 102.76, N = 8SE +/- 138.76, N = 31132510197990497979419-fopenmp=libomp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptGCC 10.3GCC 9.3AMD AOCC 3.0Clang 12.0Clang 11.090180270360450SE +/- 0.95, N = 3SE +/- 0.12, N = 3SE +/- 1.17, N = 3SE +/- 0.04, N = 3SE +/- 2.03, N = 3420.85412.07355.06351.28351.081. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2Clang 12.0 LTOClang 12.0AMD AOCC 3.0GCC 9.3GCC 10.3Clang 11.04080120160200SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3202.10202.09178.85174.81173.23168.821. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256GCC 10.3GCC 9.3Clang 11.0AMD AOCC 3.0Clang 12.012002400360048006000SE +/- 4.47, N = 3SE +/- 42.69, N = 3SE +/- 2.16, N = 3SE +/- 0.05, N = 3SE +/- 2.14, N = 35525.715484.684901.134891.074659.341. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ThoroughAMD AOCC 3.0Clang 12.0Clang 11.0GCC 11.0.1GCC 10.3GCC 9.3246810SE +/- 0.0015, N = 3SE +/- 0.0028, N = 3SE +/- 0.0026, N = 3SE +/- 0.0034, N = 3SE +/- 0.0011, N = 3SE +/- 0.0029, N = 36.64096.76476.76747.69897.83707.85371. (CXX) g++ options: -O3 -march=native -flto -pthread

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACClang 12.0Clang 11.0GCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.03691215SE +/- 0.007, N = 5SE +/- 0.006, N = 5SE +/- 0.011, N = 5SE +/- 0.008, N = 5SE +/- 0.006, N = 5SE +/- 0.006, N = 57.8547.9798.5348.5678.7099.280-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptGCC 10.3GCC 9.3Clang 11.0AMD AOCC 3.0Clang 12.012002400360048006000SE +/- 5.42, N = 3SE +/- 11.31, N = 3SE +/- 1.35, N = 3SE +/- 3.70, N = 3SE +/- 4.78, N = 35529.405391.994895.564887.574682.461. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.0Clang 11.0Clang 12.0246810SE +/- 0.019, N = 3SE +/- 0.006, N = 3SE +/- 0.005, N = 3SE +/- 0.008, N = 3SE +/- 0.021, N = 3SE +/- 0.003, N = 37.0117.2317.4738.1428.2508.256-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr1. (CC) gcc options: -O3 -pipe -march=native -lncurses -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceAMD AOCC 3.0Clang 11.0Clang 12.0GCC 11.0.1GCC 10.3GCC 9.3400K800K1200K1600K2000KSE +/- 2098.00, N = 5SE +/- 2852.59, N = 5SE +/- 1798.40, N = 5SE +/- 1626.80, N = 5SE +/- 956.77, N = 5SE +/- 760.80, N = 51697846163826515709661494250146717914463721. (CC) gcc options: -O3 -march=native

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedGCC 9.3GCC 11.0.1Clang 12.0Clang 11.0AMD AOCC 3.0GCC 10.330060090012001500SE +/- 1.53, N = 3SE +/- 0.33, N = 3SE +/- 1.86, N = 3SE +/- 1.53, N = 3SE +/- 0.88, N = 31217108210761068105710391. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670GCC 9.3GCC 11.0.1GCC 10.3Clang 11.0AMD AOCC 3.0Clang 12.0306090120150SE +/- 1.32, N = 3SE +/- 1.53, N = 3SE +/- 0.48, N = 3SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.53, N = 3101.54103.01103.60103.83103.93118.87-lstdc++-lstdc++-lstdc++-lstdc++1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweetsClang 12.0Clang 11.0AMD AOCC 3.0GCC 10.3GCC 9.31.0352.073.1054.145.175SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.604.414.334.023.931. (CXX) g++ options: -O3 -march=native -pthread

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21AMD AOCC 3.0Clang 12.0 LTOClang 12.0Clang 11.0GCC 10.3GCC 9.36001200180024003000SE +/- 2.28, N = 3SE +/- 1.62, N = 3SE +/- 1.92, N = 3SE +/- 1.01, N = 3SE +/- 2.06, N = 3SE +/- 4.53, N = 32725.72657.82653.82640.22392.62338.91. (CXX) g++ options: -O3 -march=native -rdynamic

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserIDClang 12.0AMD AOCC 3.0Clang 11.0GCC 10.3GCC 9.31.03952.0793.11854.1585.1975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.624.474.414.133.981. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandomGCC 9.3GCC 10.3Clang 12.0AMD AOCC 3.0Clang 11.00.21150.4230.63450.8461.0575SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.940.900.840.820.811. (CXX) g++ options: -O3 -march=native -pthread

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUAMD AOCC 3.0GCC 10.3GCC 9.3Clang 11.0Clang 12.080160240320400SE +/- 2.50, N = 3SE +/- 0.17, N = 3SE +/- 0.50, N = 3SE +/- 1.42, N = 3SE +/- 4.15, N = 4386351351346333-fopenmp=libomp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessClang 12.0AMD AOCC 3.0Clang 11.0GCC 10.3GCC 11.0.1GCC 9.3714212835SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 325.2225.7826.0326.9127.0629.081. (CXX) g++ options: -O3 -fPIC -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096GCC 10.3GCC 9.3GCC 11.0.1Clang 11.0AMD AOCC 3.0Clang 12.011K22K33K44K55KSE +/- 844.19, N = 3SE +/- 228.68, N = 3SE +/- 227.13, N = 3SE +/- 413.24, N = 15SE +/- 542.47, N = 15SE +/- 671.66, N = 155213052099513914667645521454281. (CC) gcc options: -pthread -O3 -march=native -lm

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 12.0Clang 11.0GCC 9.3GCC 10.30.26940.53880.80821.07761.347SE +/- 0.00668, N = 3SE +/- 0.00286, N = 3SE +/- 0.00395, N = 3SE +/- 0.00597, N = 3SE +/- 0.00438, N = 31.044841.075071.075771.174341.19747-fopenmp=libomp - MIN: 0.83-fopenmp=libomp - MIN: 0.87-fopenmp=libomp - MIN: 0.86-fopenmp - MIN: 0.96-fopenmp - MIN: 0.981. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32GCC 9.3Clang 12.0Clang 11.0AMD AOCC 3.0GCC 11.0.1GCC 10.33K6K9K12K15KSE +/- 67.28, N = 3SE +/- 24.25, N = 3SE +/- 20.33, N = 3SE +/- 41.35, N = 3SE +/- 45.16, N = 3SE +/- 16.05, N = 31439913333133241319212765125761. (CC) gcc options: -pthread -O3 -march=native -lm

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishGCC 10.3GCC 9.3Clang 12.0AMD AOCC 3.0Clang 11.070140210280350SE +/- 0.52, N = 3SE +/- 0.04, N = 3SE +/- 0.13, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3341.85337.36315.41305.00299.211. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32GCC 10.3GCC 11.0.1GCC 9.3AMD AOCC 3.0Clang 12.0Clang 11.04K8K12K16K20KSE +/- 108.41, N = 3SE +/- 168.99, N = 3SE +/- 170.19, N = 8SE +/- 5.33, N = 3SE +/- 48.79, N = 3SE +/- 129.55, N = 31665016590165901614615649145901. (CC) gcc options: -pthread -O3 -march=native -lm

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 12.0Clang 11.0GCC 9.3GCC 10.30.26530.53060.79591.06121.3265SE +/- 0.00160, N = 3SE +/- 0.00199, N = 3SE +/- 0.00127, N = 3SE +/- 0.00349, N = 3SE +/- 0.00296, N = 31.038991.077011.080111.174861.17894-fopenmp=libomp - MIN: 0.99-fopenmp=libomp - MIN: 1.04-fopenmp=libomp - MIN: 1.03-fopenmp - MIN: 1.12-fopenmp - MIN: 1.121. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionClang 11.0Clang 12.0AMD AOCC 3.0GCC 11.0.1GCC 9.3GCC 10.3246810SE +/- 0.018, N = 3SE +/- 0.004, N = 3SE +/- 0.009, N = 3SE +/- 0.021, N = 3SE +/- 0.009, N = 3SE +/- 0.006, N = 36.2436.3096.5787.0037.0537.078-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPUAMD AOCC 3.0GCC 9.3GCC 10.3Clang 12.0Clang 11.0306090120150SE +/- 0.50, N = 3SE +/- 0.44, N = 3SE +/- 0.17, N = 3SE +/- 0.50, N = 3SE +/- 0.29, N = 3122116115112108-fopenmp=libomp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlGCC 11.0.1GCC 9.3GCC 10.3Clang 12.0AMD AOCC 3.0Clang 11.05001000150020002500SE +/- 4.81, N = 3SE +/- 1.20, N = 3SE +/- 1.20, N = 3SE +/- 6.57, N = 3SE +/- 4.63, N = 3SE +/- 12.41, N = 32161212921121993192919151. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57GCC 10.3GCC 9.3GCC 11.0.1AMD AOCC 3.0Clang 11.0Clang 12.013M26M39M52M65MSE +/- 6887.99, N = 3SE +/- 870702.21, N = 3SE +/- 318169.94, N = 3SE +/- 47026.00, N = 3SE +/- 40360.87, N = 3SE +/- 790005.27, N = 36246733361404000608863335741133356307000556630001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptGCC 9.3GCC 10.3Clang 12.0AMD AOCC 3.0Clang 11.070140210280350SE +/- 0.04, N = 3SE +/- 0.44, N = 3SE +/- 0.16, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 3339.07325.39321.19303.81302.411. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUClang 12.0AMD AOCC 3.0Clang 11.0GCC 10.3GCC 9.30.82641.65282.47923.30564.132SE +/- 0.01639, N = 3SE +/- 0.02018, N = 3SE +/- 0.04735, N = 3SE +/- 0.02637, N = 3SE +/- 0.03246, N = 33.285073.415833.527873.611443.67278-fopenmp=libomp - MIN: 3.15-fopenmp=libomp - MIN: 3.24-fopenmp=libomp - MIN: 3.29-fopenmp - MIN: 3.37-fopenmp - MIN: 3.391. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0AMD AOCC 3.0Clang 11.02K4K6K8K10KSE +/- 20.21, N = 3SE +/- 48.56, N = 3SE +/- 57.26, N = 3SE +/- 101.36, N = 3SE +/- 43.38, N = 3SE +/- 15.16, N = 310548.010205.010179.09862.09603.29438.61. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024GCC 9.3GCC 10.3GCC 11.0.1Clang 12.0AMD AOCC 3.0Clang 11.02K4K6K8K10KSE +/- 19.46, N = 3SE +/- 41.68, N = 3SE +/- 25.87, N = 3SE +/- 48.25, N = 3SE +/- 14.28, N = 3SE +/- 45.95, N = 39798.69247.39238.49088.38902.18809.61. (CC) gcc options: -pthread -O3 -march=native -lm

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSClang 12.0AMD AOCC 3.0Clang 11.0GCC 11.0.1GCC 10.3GCC 9.360K120K180K240K300KSE +/- 1778.47, N = 3SE +/- 251.99, N = 3SE +/- 407.86, N = 3SE +/- 675.55, N = 3SE +/- 1024.96, N = 3SE +/- 537.86, N = 32652042646372601192438612427002389351. (CC) gcc options: -pedantic -O3

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080pGCC 11.0.1GCC 10.3GCC 9.3Clang 12.0Clang 11.020406080100SE +/- 1.15, N = 8SE +/- 1.76, N = 3SE +/- 1.10, N = 8SE +/- 0.31, N = 3SE +/- 0.53, N = 3111.27107.46106.55103.17100.551. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5Clang 12.0GCC 9.3GCC 10.3Clang 11.0AMD AOCC 3.0246810SE +/- 0.006, N = 3SE +/- 0.017, N = 3SE +/- 0.017, N = 3SE +/- 0.022, N = 3SE +/- 0.028, N = 36.6906.7536.9347.3667.4031. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024GCC 9.3GCC 10.3GCC 11.0.1Clang 12.0AMD AOCC 3.0Clang 11.03K6K9K12K15KSE +/- 44.20, N = 3SE +/- 32.26, N = 3SE +/- 189.35, N = 3SE +/- 27.10, N = 3SE +/- 34.64, N = 3SE +/- 35.53, N = 31168911319110441080510669105641. (CC) gcc options: -pthread -O3 -march=native -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average LatencyClang 12.0Clang 11.0GCC 9.3GCC 10.3GCC 11.0.10.39980.79961.19941.59921.999SE +/- 0.004, N = 3SE +/- 0.011, N = 3SE +/- 0.028, N = 3SE +/- 0.013, N = 3SE +/- 0.029, N = 31.6071.6261.6881.7011.7771. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read WriteClang 12.0Clang 11.0GCC 9.3GCC 10.3GCC 11.0.113K26K39K52K65KSE +/- 162.92, N = 3SE +/- 400.92, N = 3SE +/- 994.44, N = 3SE +/- 469.64, N = 3SE +/- 899.82, N = 362319616165936458894563691. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048GCC 9.3GCC 10.3GCC 11.0.1Clang 12.0AMD AOCC 3.0Clang 11.02K4K6K8K10KSE +/- 37.69, N = 3SE +/- 14.75, N = 3SE +/- 55.19, N = 3SE +/- 7.75, N = 3SE +/- 39.89, N = 3SE +/- 28.76, N = 311053.010711.010675.010467.010227.010004.21. (CC) gcc options: -pthread -O3 -march=native -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2Clang 12.0Clang 11.0AMD AOCC 3.0GCC 11.0.1GCC 10.3GCC 9.3714212835SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 325.1825.4725.6027.1027.3927.781. (CXX) g++ options: -O3 -fPIC -lm

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57GCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.0Clang 11.0Clang 12.0400M800M1200M1600M2000MSE +/- 4864497.23, N = 3SE +/- 15763988.50, N = 3SE +/- 17297784.06, N = 3SE +/- 2130988.29, N = 3SE +/- 1331665.62, N = 3SE +/- 2255610.29, N = 31721900000171800000016798000001609633333157840000015648333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096GCC 9.3GCC 11.0.1GCC 10.3AMD AOCC 3.0Clang 11.0Clang 12.05K10K15K20K25KSE +/- 106.49, N = 3SE +/- 160.97, N = 3SE +/- 538.47, N = 9SE +/- 349.17, N = 9SE +/- 220.77, N = 3SE +/- 348.10, N = 92506824888237742311122913227971. (CC) gcc options: -pthread -O3 -march=native -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformClang 11.0AMD AOCC 3.0GCC 10.3GCC 11.0.1GCC 9.3Clang 12.090180270360450SE +/- 0.67, N = 3SE +/- 0.70, N = 3SE +/- 0.25, N = 3SE +/- 1.03, N = 3SE +/- 0.66, N = 3SE +/- 0.46, N = 3399.16398.96388.98388.88384.03363.851. (CC) gcc options: -O3 -march=native -lm

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080pGCC 11.0.1GCC 10.3GCC 9.3Clang 12.0Clang 11.020406080100SE +/- 0.47, N = 3SE +/- 0.65, N = 3SE +/- 0.89, N = 3SE +/- 1.07, N = 3SE +/- 0.51, N = 394.4093.0591.9788.7886.091. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6Clang 12.0Clang 11.0AMD AOCC 3.0GCC 11.0.1GCC 9.3GCC 10.33691215SE +/- 0.014, N = 3SE +/- 0.022, N = 3SE +/- 0.016, N = 3SE +/- 0.052, N = 3SE +/- 0.031, N = 3SE +/- 0.032, N = 39.5109.5369.72510.29110.39910.4171. (CXX) g++ options: -O3 -fPIC -lm

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 9.3GCC 10.330060090012001500SE +/- 1.97, N = 3SE +/- 7.11, N = 3SE +/- 3.61, N = 3SE +/- 3.05, N = 3SE +/- 1.75, N = 31259.591277.621307.491358.561379.51-fopenmp=libomp - MIN: 1247.29-fopenmp=libomp - MIN: 1252.39-fopenmp=libomp - MIN: 1293.38-fopenmp - MIN: 1337.17-fopenmp - MIN: 1361.61. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 9.3GCC 10.330060090012001500SE +/- 5.94, N = 3SE +/- 9.46, N = 3SE +/- 3.92, N = 3SE +/- 4.44, N = 3SE +/- 3.72, N = 31267.181276.041302.701357.291382.41-fopenmp=libomp - MIN: 1248.35-fopenmp=libomp - MIN: 1249.65-fopenmp=libomp - MIN: 1289.86-fopenmp - MIN: 1335.63-fopenmp - MIN: 1360.581. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0Clang 12.0Clang 11.0AMD AOCC 3.0GCC 11.0.1GCC 10.3GCC 9.31224364860SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 347.8847.8948.1351.0351.4552.221. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10Clang 12.0Clang 11.0AMD AOCC 3.0GCC 11.0.1GCC 10.3GCC 9.30.82331.64662.46993.29324.1165SE +/- 0.014, N = 3SE +/- 0.010, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 3SE +/- 0.022, N = 3SE +/- 0.016, N = 33.3613.4293.5433.6073.6433.6591. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080pGCC 11.0.1Clang 12.0Clang 11.0GCC 10.3GCC 9.3612182430SE +/- 0.28, N = 3SE +/- 0.27, N = 3SE +/- 0.13, N = 3SE +/- 0.25, N = 3SE +/- 0.13, N = 327.0126.8526.6126.4924.841. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless CompressionClang 12.0AMD AOCC 3.0GCC 9.3Clang 11.0GCC 10.390180270360450SE +/- 0.49, N = 3SE +/- 0.39, N = 3SE +/- 1.92, N = 3SE +/- 0.17, N = 3SE +/- 3.10, N = 3374.04382.99388.95392.85406.031. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7Clang 11.0AMD AOCC 3.0Clang 12.0GCC 10.3GCC 9.350100150200250SE +/- 0.66, N = 3SE +/- 0.17, N = 3SE +/- 0.07, N = 3SE +/- 0.46, N = 3SE +/- 1.32, N = 3203.63205.03207.01215.57220.941. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 9.3GCC 10.330060090012001500SE +/- 0.58, N = 3SE +/- 9.75, N = 3SE +/- 1.78, N = 3SE +/- 4.57, N = 3SE +/- 2.65, N = 31268.081271.911305.101356.911375.71-fopenmp=libomp - MIN: 1257.35-fopenmp=libomp - MIN: 1252.33-fopenmp=libomp - MIN: 1294.76-fopenmp - MIN: 1335.04-fopenmp - MIN: 1355.681. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7Clang 12.0Clang 11.0AMD AOCC 3.0GCC 10.3GCC 9.3306090120150SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.16, N = 3SE +/- 0.10, N = 3109.53109.64109.81116.66118.451. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedGCC 10.3GCC 9.3GCC 11.0.1AMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTO1224364860SE +/- 0.72, N = 4SE +/- 0.65, N = 5SE +/- 0.65, N = 3SE +/- 0.26, N = 3SE +/- 0.46, N = 3SE +/- 0.42, N = 3SE +/- 0.74, N = 352.3651.9751.1750.3249.0148.5048.471. (CC) gcc options: -O3

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048GCC 9.3GCC 11.0.1GCC 10.3Clang 11.0Clang 12.0AMD AOCC 3.02K4K6K8K10KSE +/- 50.36, N = 3SE +/- 36.00, N = 3SE +/- 56.49, N = 3SE +/- 27.38, N = 3SE +/- 65.76, N = 3SE +/- 19.99, N = 38408.58231.18134.57878.57789.97784.81. (CC) gcc options: -pthread -O3 -march=native -lm

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisAMD AOCC 3.0Clang 11.0Clang 12.0GCC 9.3GCC 11.0.1Clang 12.0 LTOGCC 10.320406080100SE +/- 0.26, N = 3SE +/- 0.98, N = 3SE +/- 0.98, N = 3SE +/- 0.16, N = 3SE +/- 0.33, N = 3SE +/- 1.09, N = 3SE +/- 1.29, N = 486.7488.6289.1289.1689.4393.6393.66-mabm-mabm-flto-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateClang 12.0GCC 9.3GCC 11.0.1GCC 10.3Clang 11.0AMD AOCC 3.0150300450600750SE +/- 2.60, N = 3SE +/- 6.43, N = 3SE +/- 5.21, N = 3SE +/- 1.33, N = 37127096946896656601. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pClang 11.0Clang 12.0AMD AOCC 3.0GCC 10.3GCC 11.0.1GCC 9.3140280420560700SE +/- 5.55, N = 3SE +/- 3.01, N = 3SE +/- 3.03, N = 3SE +/- 2.42, N = 3SE +/- 5.75, N = 3SE +/- 3.83, N = 3652.74643.58638.10615.62611.73605.501. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552GCC 9.3GCC 11.0.1GCC 10.3Clang 11.0AMD AOCC 3.0Clang 12.020406080100SE +/- 0.60, N = 3SE +/- 0.43, N = 3SE +/- 0.12, N = 3SE +/- 1.37, N = 3SE +/- 0.12, N = 3SE +/- 1.11, N = 689.0990.2690.4390.5391.9995.96-lstdc++-lstdc++-lstdc++-lstdc++1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 11.0.1GCC 10.3GCC 9.3246810SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 37.207.106.956.876.691. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pClang 11.0Clang 12.0AMD AOCC 3.0GCC 10.3GCC 11.0.1GCC 9.380160240320400SE +/- 3.43, N = 3SE +/- 1.56, N = 3SE +/- 1.09, N = 3SE +/- 1.54, N = 3SE +/- 1.51, N = 3SE +/- 1.20, N = 3346.89345.30343.85330.53329.32322.421. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIGCC 9.3AMD AOCC 3.0Clang 12.0Clang 11.0GCC 10.320406080100SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 384.8682.8382.6479.1579.121. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeClang 12.0Clang 11.0AMD AOCC 3.0GCC 10.3GCC 9.33691215SE +/- 0.041, N = 3SE +/- 0.032, N = 3SE +/- 0.026, N = 3SE +/- 0.049, N = 3SE +/- 0.053, N = 39.2969.4089.4949.5709.9681. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024GCC 9.3GCC 10.3GCC 11.0.1Clang 11.0Clang 12.0AMD AOCC 3.011K22K33K44K55KSE +/- 788.42, N = 3SE +/- 439.64, N = 15SE +/- 568.96, N = 3SE +/- 585.78, N = 3SE +/- 952.64, N = 12SE +/- 621.84, N = 155327552054517065074050350496851. (CC) gcc options: -pthread -O3 -march=native -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessClang 12.0Clang 11.0AMD AOCC 3.0GCC 10.3GCC 9.3GCC 11.0.1246810SE +/- 0.013, N = 3SE +/- 0.011, N = 3SE +/- 0.022, N = 3SE +/- 0.007, N = 3SE +/- 0.022, N = 3SE +/- 0.017, N = 35.7465.8795.9486.1076.1316.1491. (CXX) g++ options: -O3 -fPIC -lm

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pClang 12.0Clang 11.0AMD AOCC 3.0GCC 10.3GCC 11.0.1GCC 9.3918273645SE +/- 0.17, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 341.0941.0140.9539.0338.8638.411. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read WriteClang 12.0Clang 11.0GCC 9.3GCC 11.0.1GCC 10.312K24K36K48K60KSE +/- 702.52, N = 15SE +/- 883.12, N = 3SE +/- 396.40, N = 3SE +/- 211.73, N = 3SE +/- 591.89, N = 756684544885382553102530191. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 7Clang 12.0Clang 11.0AMD AOCC 3.03691215SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 312.1512.0111.371. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average LatencyClang 12.0Clang 11.0GCC 9.3GCC 11.0.1GCC 10.31.06452.1293.19354.2585.3225SE +/- 0.054, N = 15SE +/- 0.074, N = 3SE +/- 0.034, N = 3SE +/- 0.021, N = 3SE +/- 0.052, N = 74.4314.6034.6574.7224.7311. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 5AMD AOCC 3.0Clang 11.0Clang 12.020406080100SE +/- 0.41, N = 3SE +/- 0.24, N = 3SE +/- 0.17, N = 379.2378.4174.271. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloAMD AOCC 3.0GCC 10.3Clang 12.0Clang 11.0GCC 9.3GCC 11.0.1150300450600750SE +/- 0.18, N = 3SE +/- 1.71, N = 3SE +/- 0.40, N = 3SE +/- 0.40, N = 3SE +/- 0.14, N = 3SE +/- 0.29, N = 3690.94682.87675.13674.86668.10647.821. (CC) gcc options: -O3 -march=native -lm

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KGCC 11.0.1Clang 12.0Clang 11.0GCC 10.3GCC 9.348121620SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 317.3717.2217.1317.0316.291. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: DefaultClang 12.0Clang 11.0GCC 9.3AMD AOCC 3.0GCC 10.30.65661.31321.96982.62643.283SE +/- 0.027, N = 3SE +/- 0.031, N = 3SE +/- 0.038, N = 3SE +/- 0.010, N = 3SE +/- 0.032, N = 72.7392.7432.7782.8162.9181. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KGCC 11.0.1GCC 10.3GCC 9.3Clang 12.0Clang 11.0918273645SE +/- 0.29, N = 3SE +/- 0.19, N = 3SE +/- 0.38, N = 3SE +/- 0.43, N = 3SE +/- 0.31, N = 339.7139.3239.1238.1137.281. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4KGCC 9.3GCC 11.0.1Clang 11.0GCC 10.3Clang 12.03691215SE +/- 0.11, N = 6SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 39.579.419.149.108.991. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KAMD AOCC 3.0Clang 12.0Clang 11.0GCC 9.3GCC 11.0.1GCC 10.3714212835SE +/- 0.13, N = 3SE +/- 0.23, N = 3SE +/- 0.25, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 330.4430.3229.9428.9128.7928.601. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KGCC 11.0.1GCC 10.3GCC 9.3Clang 12.0Clang 11.0816243240SE +/- 0.47, N = 3SE +/- 0.19, N = 3SE +/- 0.12, N = 3SE +/- 0.48, N = 3SE +/- 0.22, N = 335.2635.2634.5633.3933.141. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 11.0.1GCC 10.3GCC 9.30.11930.23860.35790.47720.5965SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.530.530.520.520.501. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeGCC 11.0.1GCC 9.3Clang 12.0AMD AOCC 3.0GCC 10.3Clang 11.048121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 315.5015.6816.0516.0616.1516.411. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedGCC 9.3AMD AOCC 3.0GCC 10.3Clang 11.0Clang 12.0GCC 11.0.1Clang 12.0 LTO1224364860SE +/- 0.77, N = 4SE +/- 0.48, N = 3SE +/- 0.01, N = 3SE +/- 0.33, N = 3SE +/- 0.80, N = 3SE +/- 0.73, N = 4SE +/- 0.02, N = 353.8353.7752.8752.3552.0751.3250.931. (CC) gcc options: -O3

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pClang 11.0AMD AOCC 3.0Clang 12.0GCC 11.0.1GCC 10.3GCC 9.380160240320400SE +/- 1.91, N = 3SE +/- 2.72, N = 3SE +/- 1.11, N = 3SE +/- 0.70, N = 3SE +/- 0.47, N = 3SE +/- 3.83, N = 3373.99373.89372.49366.39364.12354.211. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57AMD AOCC 3.0Clang 12.0Clang 11.0GCC 11.0.1GCC 10.3GCC 9.3700M1400M2100M2800M3500MSE +/- 1234233.91, N = 3SE +/- 6045475.81, N = 3SE +/- 2452436.43, N = 3SE +/- 1154700.54, N = 3SE +/- 4643753.27, N = 3SE +/- 2961043.36, N = 33100400000307063333330513666672989400000294286666729404666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read OnlyClang 11.0GCC 10.3Clang 12.0GCC 9.3GCC 11.0.15K10K15K20K25KSE +/- 289.16, N = 3SE +/- 118.05, N = 3SE +/- 303.43, N = 3SE +/- 41.57, N = 3SE +/- 281.76, N = 1524943248452431023895236611. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessGCC 11.0.1Clang 11.0GCC 10.3Clang 12.0AMD AOCC 3.0GCC 9.3510152025SE +/- 0.04, N = 3SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.13, N = 318.3118.5718.8819.0219.1319.30-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pClang 12.0Clang 11.0AMD AOCC 3.0GCC 10.3GCC 11.0.1GCC 9.3110220330440550SE +/- 1.37, N = 3SE +/- 0.23, N = 3SE +/- 2.67, N = 3SE +/- 0.24, N = 3SE +/- 1.15, N = 3SE +/- 0.82, N = 3487.43481.05476.95472.61472.32463.121. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pClang 12.0Clang 11.0AMD AOCC 3.0GCC 11.0.1GCC 10.3GCC 9.3110220330440550SE +/- 0.73, N = 3SE +/- 1.76, N = 3SE +/- 1.94, N = 3SE +/- 1.13, N = 3SE +/- 2.08, N = 3SE +/- 0.32, N = 3488.23482.02478.62478.16477.67464.571. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average LatencyClang 11.0GCC 10.3Clang 12.0GCC 9.3GCC 11.0.10.00950.0190.02850.0380.0475SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 150.0400.0400.0410.0420.0421. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4KGCC 11.0.1GCC 10.3Clang 11.0Clang 12.0GCC 9.30.04730.09460.14190.18920.2365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.210.210.210.210.201. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptClang 12.0GCC 9.3AMD AOCC 3.0GCC 10.3Clang 11.020406080100SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 384.2384.1382.9581.4580.221. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultClang 12.0Clang 11.0AMD AOCC 3.0GCC 10.3GCC 11.0.1GCC 9.30.31430.62860.94291.25721.5715SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 31.3311.3361.3511.3721.3861.397-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 11.0.1GCC 10.3GCC 9.3Clang 11.0AMD AOCC 3.0Clang 12.02K4K6K8K10KSE +/- 25.06, N = 3SE +/- 33.93, N = 3SE +/- 28.39, N = 3SE +/- 77.81, N = 3SE +/- 0.22, N = 3SE +/- 7.16, N = 39263.559248.899178.979146.889021.838848.401. (CC) gcc options: -O3 -march=native -lm

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080pClang 12.0Clang 11.0AMD AOCC 3.0GCC 11.0.1GCC 10.3GCC 9.330060090012001500SE +/- 2.95, N = 3SE +/- 6.69, N = 3SE +/- 0.97, N = 3SE +/- 1.75, N = 3SE +/- 3.74, N = 3SE +/- 5.12, N = 31198.221190.411188.431180.441171.041145.50MIN: 700.24 / MAX: 1494.16-lm - MIN: 685.16 / MAX: 1496.36-lm - MIN: 703.73 / MAX: 1484.94-lm - MIN: 680.31 / MAX: 1485.74-lm - MIN: 683.28 / MAX: 1473.51-lm - MIN: 664.19 / MAX: 1441.541. (CC) gcc options: -O3 -march=native -pthread

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptClang 12.0AMD AOCC 3.0GCC 10.3Clang 11.0GCC 9.3306090120150SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.32, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3133.05128.01127.78127.74127.341. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256Clang 12.0Clang 11.0AMD AOCC 3.0GCC 10.3GCC 9.3306090120150SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.33, N = 3SE +/- 0.09, N = 3132.82128.59127.77127.74127.301. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeClang 11.0AMD AOCC 3.0GCC 10.3GCC 9.3Clang 12.0GCC 11.0.17001400210028003500SE +/- 15.12, N = 3SE +/- 1.29, N = 3SE +/- 6.50, N = 3SE +/- 5.86, N = 3SE +/- 1.11, N = 3SE +/- 5.19, N = 33319.343298.293235.943229.223190.623182.351. (CC) gcc options: -O3 -march=native -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9GCC 10.3GCC 9.3GCC 11.0.1Clang 12.0Clang 11.0AMD AOCC 3.050100150200250SE +/- 0.32, N = 3SE +/- 0.54, N = 3SE +/- 0.18, N = 3SE +/- 0.44, N = 3SE +/- 0.28, N = 3SE +/- 0.82, N = 3231.24232.57233.51236.92240.21240.411. (CC) gcc options: -O3 -march=native -fvisibility=hidden

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.0Clang 11.0Clang 12.015003000450060007500SE +/- 30.40, N = 3SE +/- 25.90, N = 3SE +/- 23.67, N = 3SE +/- 65.81, N = 3SE +/- 60.67, N = 3SE +/- 35.20, N = 37007.36974.06948.26875.36823.86744.11. (CC) gcc options: -pthread -O3 -march=native -lm

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ExhaustiveAMD AOCC 3.0Clang 12.0Clang 11.0GCC 10.3GCC 9.3GCC 11.0.1510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 318.9118.9919.0319.4619.4819.621. (CXX) g++ options: -O3 -march=native -flto -pthread

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionClang 11.0GCC 11.0.1AMD AOCC 3.0Clang 12.0GCC 10.3GCC 9.3918273645SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 337.7337.9538.3438.4538.5539.07-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 11.0.1GCC 10.3GCC 9.31.11382.22763.34144.45525.569SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 34.954.874.844.844.781. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100Clang 12.0GCC 10.3Clang 11.0AMD AOCC 3.0GCC 9.3GCC 11.0.10.51171.02341.53512.04682.5585SE +/- 0.001, N = 3SE +/- 0.010, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.005, N = 3SE +/- 0.007, N = 32.1992.2252.2402.2622.2732.274-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048GCC 10.3Clang 12.0Clang 11.0GCC 11.0.1GCC 9.3AMD AOCC 3.07K14K21K28K35KSE +/- 14.99, N = 3SE +/- 77.17, N = 3SE +/- 146.10, N = 3SE +/- 209.56, N = 3SE +/- 37.37, N = 3SE +/- 378.89, N = 63206131935317413166231341310131. (CC) gcc options: -pthread -O3 -march=native -lm

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: KostyaGCC 10.3GCC 9.3Clang 12.0AMD AOCC 3.0Clang 11.00.62331.24661.86992.49323.1165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.772.752.752.732.681. (CXX) g++ options: -O3 -march=native -pthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080pClang 12.0GCC 11.0.1Clang 11.0GCC 10.3GCC 9.3510152025SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.15, N = 3SE +/- 0.09, N = 3SE +/- 0.11, N = 322.1322.1122.0021.6421.421. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 8Clang 12.0AMD AOCC 3.0Clang 11.0714212835SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 328.1327.2927.241. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average LatencyGCC 11.0.1GCC 10.3Clang 12.0Clang 11.0GCC 9.30.02140.04280.06420.08560.107SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0920.0930.0940.0940.0951. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read OnlyGCC 11.0.1GCC 10.3Clang 11.0Clang 12.0GCC 9.3200K400K600K800K1000KSE +/- 1514.63, N = 3SE +/- 183.22, N = 3SE +/- 1740.88, N = 3SE +/- 720.87, N = 3SE +/- 1623.23, N = 3109082410763571069367106902210571251. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read WriteGCC 11.0.1GCC 10.3Clang 11.0GCC 9.3Clang 12.07001400210028003500SE +/- 28.00, N = 3SE +/- 11.40, N = 3SE +/- 14.62, N = 3SE +/- 4.79, N = 3SE +/- 3.48, N = 3338333693312329832811. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pClang 12.0AMD AOCC 3.0Clang 11.0GCC 10.3GCC 9.3GCC 11.0.11632486480SE +/- 0.49, N = 3SE +/- 0.63, N = 3SE +/- 0.49, N = 3SE +/- 0.32, N = 3SE +/- 0.26, N = 3SE +/- 0.56, N = 374.0073.5173.3672.6072.1471.791. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average LatencyGCC 11.0.1GCC 10.3Clang 11.0GCC 9.3Clang 12.00.06860.13720.20580.27440.343SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.2960.2970.3020.3030.3051. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedClang 11.0Clang 12.0GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0 LTOAMD AOCC 3.03K6K9K12K15KSE +/- 23.21, N = 3SE +/- 65.90, N = 3SE +/- 17.75, N = 5SE +/- 62.74, N = 3SE +/- 6.60, N = 4SE +/- 46.50, N = 3SE +/- 33.89, N = 313927.913926.513895.313857.413806.613698.713561.51. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedClang 12.0GCC 10.3GCC 11.0.1Clang 11.0GCC 9.3Clang 12.0 LTOAMD AOCC 3.03K6K9K12K15KSE +/- 71.01, N = 3SE +/- 42.32, N = 3SE +/- 34.44, N = 4SE +/- 15.91, N = 3SE +/- 37.19, N = 4SE +/- 60.82, N = 3SE +/- 73.30, N = 313911.513906.113882.213840.313793.413715.013562.51. (CC) gcc options: -O3

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeGCC 11.0.1Clang 11.0GCC 10.3GCC 9.3Clang 12.0246810SE +/- 0.002, N = 5SE +/- 0.002, N = 5SE +/- 0.003, N = 5SE +/- 0.002, N = 5SE +/- 0.013, N = 57.3817.3927.4697.5047.567-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 8Clang 12.0AMD AOCC 3.0Clang 11.00.18450.3690.55350.7380.9225SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.820.810.801. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4KClang 11.0AMD AOCC 3.0Clang 12.0GCC 11.0.1GCC 10.3GCC 9.3120240360480600SE +/- 1.43, N = 3SE +/- 1.13, N = 3SE +/- 1.79, N = 3SE +/- 2.51, N = 3SE +/- 0.67, N = 3SE +/- 1.35, N = 3543.43541.58541.56538.28536.71530.82-lm - MIN: 256.75 / MAX: 593.99-lm - MIN: 259.4 / MAX: 585.8MIN: 252.01 / MAX: 587.53-lm - MIN: 251.6 / MAX: 584.38-lm - MIN: 256.44 / MAX: 577.82-lm - MIN: 248.84 / MAX: 574.281. (CC) gcc options: -O3 -march=native -pthread

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyGCC 11.0.1GCC 10.3Clang 12.0GCC 9.3Clang 11.0200K400K600K800K1000KSE +/- 8885.95, N = 3SE +/- 8859.63, N = 3SE +/- 6289.60, N = 3SE +/- 8843.08, N = 3SE +/- 13844.42, N = 3109016010897311071209106748610655061. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyGCC 10.3GCC 11.0.1Clang 12.0Clang 11.0GCC 9.30.05290.10580.15870.21160.2645SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 30.2300.2300.2340.2350.2351. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080pAMD AOCC 3.0Clang 11.0GCC 11.0.1GCC 10.3Clang 12.0GCC 9.330060090012001500SE +/- 4.95, N = 3SE +/- 2.13, N = 3SE +/- 1.96, N = 3SE +/- 8.15, N = 3SE +/- 7.87, N = 3SE +/- 2.25, N = 31251.911251.251249.741245.111244.111228.63-lm - MIN: 543.89 / MAX: 1394.16-lm - MIN: 556.46 / MAX: 1394.06-lm - MIN: 559.74 / MAX: 1387.11-lm - MIN: 539.07 / MAX: 1398.87MIN: 549.81 / MAX: 1390.03-lm - MIN: 555.28 / MAX: 1361.681. (CC) gcc options: -O3 -march=native -pthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.1770.3540.5310.7080.885SE +/- 0.001713, N = 3SE +/- 0.001200, N = 3SE +/- 0.004246, N = 3SE +/- 0.002532, N = 3SE +/- 0.002405, N = 30.7732330.7791010.7797760.7824760.786762-fopenmp=libomp - MIN: 0.72-fopenmp=libomp - MIN: 0.73-fopenmp=libomp - MIN: 0.73-fopenmp - MIN: 0.73-fopenmp - MIN: 0.751. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024GCC 9.3Clang 12.0Clang 11.0AMD AOCC 3.0GCC 10.3GCC 11.0.18K16K24K32K40KSE +/- 79.87, N = 3SE +/- 165.99, N = 3SE +/- 530.09, N = 4SE +/- 455.21, N = 12SE +/- 301.69, N = 3SE +/- 442.82, N = 33632136239361813610035973357181. (CC) gcc options: -pthread -O3 -march=native -lm

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 5Clang 12.0Clang 11.0AMD AOCC 3.01530456075SE +/- 0.14, N = 3SE +/- 0.20, N = 3SE +/- 0.17, N = 366.6665.5865.571. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 7Clang 12.0AMD AOCC 3.0Clang 11.01530456075SE +/- 0.16, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 366.3865.6865.431. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUGCC 10.3GCC 9.3Clang 11.0Clang 12.0AMD AOCC 3.012002400360048006000SE +/- 17.50, N = 3SE +/- 2.40, N = 3SE +/- 169.87, N = 9SE +/- 126.29, N = 12SE +/- 174.98, N = 1255595183452344564383-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPUGCC 10.3Clang 12.0GCC 9.3Clang 11.0AMD AOCC 3.0110220330440550SE +/- 0.87, N = 3SE +/- 10.30, N = 12SE +/- 4.64, N = 12SE +/- 5.55, N = 3SE +/- 10.39, N = 12505498495471459-fopenmp-fopenmp=libomp-fopenmp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NClang 12.0GCC 9.3GCC 11.0.1GCC 10.3AMD AOCC 3.0Clang 11.01530456075SE +/- 2.22, N = 12SE +/- 4.17, N = 15SE +/- 3.83, N = 15SE +/- 5.30, N = 12SE +/- 3.44, N = 12SE +/- 3.65, N = 1569.165.063.956.255.251.2-fopenmp=libomp-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTGCC 11.0.1GCC 9.3GCC 10.3AMD AOCC 3.0Clang 11.0Clang 12.0140280420560700SE +/- 2.60, N = 15SE +/- 0.80, N = 15SE +/- 53.43, N = 12SE +/- 37.59, N = 12SE +/- 38.96, N = 15SE +/- 35.24, N = 12649.00636.00592.97477.00462.00434.00-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYGCC 11.0.1GCC 10.3GCC 9.3Clang 11.0Clang 12.0AMD AOCC 3.030060090012001500SE +/- 62.40, N = 15SE +/- 132.58, N = 12SE +/- 2.85, N = 15SE +/- 34.43, N = 15SE +/- 15.69, N = 12SE +/- 26.90, N = 121496.01350.0813.0412.0357.0326.0-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYGCC 9.3GCC 11.0.1GCC 10.3AMD AOCC 3.0Clang 11.0Clang 12.030060090012001500SE +/- 25.85, N = 15SE +/- 25.34, N = 15SE +/- 101.07, N = 12SE +/- 32.29, N = 12SE +/- 36.50, N = 15SE +/- 15.30, N = 121217.001210.001065.60531.00495.00471.00-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL


Phoronix Test Suite v10.8.5