EPYC 7763 LLVM Clang Compiler Tests

AMD EPYC 7763 64-Core testing with a Supermicro H12SSL-i v1.01 (2.0 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104140-IB-EPYC7763L31&grr&sro.

EPYC 7763 LLVM Clang Compiler TestsProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionClang 12.0Clang 11.0Clang 12.0 LTOGCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.0AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads)Supermicro H12SSL-i v1.01 (2.0 BIOS)AMD Starship/Matisse126GB3841GB Micron_9300_MTFDHAL3T8TDPASPEED2 x Broadcom NetXtreme BCM5720 2-port PCIeUbuntu 20.045.12.0-051200rc6daily20210408-generic (x86_64) 20210407GNOME Shell 3.36.4X Server 1.20.8Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73ext41024x768Clang 11.0.0-2~ubuntu20.04.1Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73GCC 9.3.0GCC 10.3.0GCC 11.0.1 20210413Clang 12.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- Clang 12.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 11.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 12.0 LTO: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- GCC 9.3: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 10.3: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 11.0.1: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- AMD AOCC 3.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa001119Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affectedCompiler Details- GCC 9.3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - GCC 10.3: --disable-multilib --enable-checking=release- GCC 11.0.1: --disable-multilib --enable-checking=release- AMD AOCC 3.0: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: (unknown)

EPYC 7763 LLVM Clang Compiler Testsfftw: Float + SSE - 2D FFT Size 4096webp2: Quality 100, Lossless Compressionjpegxl: PNG - 8onnx: bertsquad-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUsecuremark: SecureMark-TLSgcrypt: onnx: shufflenet-v2-10 - OpenMP CPUwebp2: Quality 95, Compression Effort 7fftw: Stock - 2D FFT Size 4096aom-av1: Speed 4 Two-Pass - Bosphorus 4Kgraphics-magick: Resizingonnx: yolov4 - OpenMP CPUsvt-av1: Enc Mode 0 - 1080ponnx: fcn-resnet101-11 - OpenMP CPUwebp2: Quality 75, Compression Effort 7ngspice: C7552ngspice: C2670aom-av1: Speed 6 Two-Pass - Bosphorus 4Kfftw: Float + SSE - 2D FFT Size 2048viennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sCOPYaom-av1: Speed 4 Two-Pass - Bosphorus 1080paom-av1: Speed 0 Two-Pass - Bosphorus 4Kmrbayes: Primate Phylogeny Analysisjpegxl: PNG - 7onednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUcompress-lz4: 9 - Decompression Speedcompress-lz4: 9 - Compression Speedfinancebench: Repo OpenMPsimdjson: DistinctUserIDsimdjson: PartialTweetscompress-lz4: 3 - Decompression Speedcompress-lz4: 3 - Compression Speedpgbench: 100 - 250 - Read Write - Average Latencypgbench: 100 - 250 - Read Writefinancebench: Bonds OpenMPsimdjson: Kostyagraphics-magick: Noise-Gaussiangraphics-magick: Sharpengraphics-magick: Rotategraphics-magick: HWB Color Spacegraphics-magick: Enhancedgraphics-magick: Swirlpgbench: 100 - 1 - Read Only - Average Latencypgbench: 100 - 1 - Read Onlysimdjson: LargeRanddav1d: Chimera 1080p 10-bitavifenc: 0fftw: Stock - 2D FFT Size 2048aom-av1: Speed 6 Two-Pass - Bosphorus 1080paom-av1: Speed 0 Two-Pass - Bosphorus 1080pfftw: Float + SSE - 2D FFT Size 1024webp: Quality 100, Lossless, Highest Compressionaom-av1: Speed 6 Realtime - Bosphorus 4Kbotan: AES-256 - Decryptbotan: AES-256pgbench: 100 - 100 - Read Write - Average Latencypgbench: 100 - 100 - Read Writepgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 250 - Read Onlypgbench: 100 - 100 - Read Only - Average Latencypgbench: 100 - 100 - Read Onlypgbench: 100 - 1 - Read Write - Average Latencypgbench: 100 - 1 - Read Writequantlib: botan: ChaCha20Poly1305 - Decryptbotan: ChaCha20Poly1305botan: Blowfish - Decryptbotan: Blowfishcoremark: CoreMark Size 666 - Iterations Per Secondbotan: Twofish - Decryptbotan: Twofishbotan: CAST-256 - Decryptbotan: CAST-256botan: KASUMI - Decryptbotan: KASUMIscimark2: Compositeavifenc: 6, Losslessetcpak: ETC2avifenc: 2libraw: Post-Processing Benchmarkastcenc: Exhaustiveaom-av1: Speed 6 Realtime - Bosphorus 1080pfftw: Float + SSE - 1D FFT Size 4096onednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUx265: Bosphorus 4Kjpegxl: JPEG - 5liquid-dsp: 128 - 256 - 57liquid-dsp: 64 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 1 - 256 - 57etcpak: ETC1webp: Quality 100, Losslessaom-av1: Speed 8 Realtime - Bosphorus 4Kjpegxl: PNG - 5svt-av1: Enc Mode 4 - 1080pjpegxl: JPEG - 7aom-av1: Speed 9 Realtime - Bosphorus 4Ktachyon: Total Timesvt-hevc: 1 - Bosphorus 1080ponednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUfftw: Float + SSE - 1D FFT Size 1024encode-flac: WAV To FLACc-ray: Total Time - 4K, 16 Rays Per Pixelencode-opus: WAV To Opus Encodeonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUpovray: Trace Timefftw: Stock - 2D FFT Size 1024jpegxl: JPEG - 8onednn: Convolution Batch Shapes Auto - u8s8f32 - CPUavifenc: 6aom-av1: Speed 9 Realtime - Bosphorus 1080pdav1d: Chimera 1080ponednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUx265: Bosphorus 1080pencode-mp3: WAV To MP3svt-av1: Enc Mode 8 - 1080ptoybrot: TBBastcenc: Thoroughwebp2: Quality 100, Compression Effort 5dav1d: Summer Nature 4Kwebp: Quality 100, Highest Compressionaom-av1: Speed 8 Realtime - Bosphorus 1080ponednn: Convolution Batch Shapes Auto - f32 - CPUtoybrot: C++ Taskstoybrot: OpenMPtoybrot: C++ Threadsfftw: Float + SSE - 1D FFT Size 2048avifenc: 10, Losslessfftw: Stock - 1D FFT Size 4096fftw: Stock - 1D FFT Size 1024astcenc: Mediumfftw: Stock - 1D FFT Size 2048webp2: Defaultavifenc: 10fftw: Float + SSE - 1D FFT Size 32fftw: Stock - 1D FFT Size 32etcpak: DXT1onednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUdav1d: Summer Nature 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080pwebp: Quality 100svt-hevc: 7 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080ptscp: AI Chess Performancewebp: Defaultsvt-hevc: 10 - Bosphorus 1080pscimark2: Jacobi Successive Over-Relaxationscimark2: Dense LU Matrix Factorizationscimark2: Sparse Matrix Multiplyscimark2: Fast Fourier Transformscimark2: Monte CarloClang 12.0Clang 11.0Clang 12.0 LTOGCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.022797374.0350.824984456265204236.9249904207.0086744.14.8721363330.183112109.52595.956118.8708.993193573.051.965.748.662669.18198786044343574717.100.2189.11612.151307.491305.101302.70590.182593.972597.48113926.548.5033246.8372394.624.6013911.552.074.4315668451596.8671872.75457614712605107619930.041243100.84308.3247.8847789.922.130.533623938.44917.224682.4554659.3381.607623190.23410712090.09410690220.30532812653.8843.404850.496351.284380.0541785466.283969321.190315.409133.048132.82084.22982.6443190.6225.220202.08525.17541.7818.993626.85454281.444250.49194030.3266.6636437666673070633333156483333355663000284.64219.01633.3974.2711.47466.3838.1116.046841.091.077011.07507503507.85415.8707.5670.3136891.172589.2969088.328.132.036069.510103.171198.223.285070.71012474.008.256118.06767806.76476.690541.566.30988.781.22132743775077220512545.7469862.0108054.0058104672.7393.36115649133332718.5252.367970.7797761244.11372.492.199345.30487.43488.2315709661.331643.581785.508848.404280.22363.85675.1322913392.8490.84714523260119240.2059797203.6346823.84.9520343460.181108109.63690.527103.8269.143174184.088.379.383.667751.2933104318774624124957.200.2188.62012.011277.621271.911276.04562.970563.200563.24713927.949.0133178.4986984.414.4113840.352.354.6035448851900.4348962.68463613665616106819150.040249430.81184.1947.8947878.522.000.533618137.72717.134895.5584901.1271.626616160.23510655060.09410693670.30233122640.2840.637848.236351.075319.2341790837.010000302.405299.214127.740128.58680.22179.1493319.3426.034168.81925.47238.7119.025526.61466761.457570.48927829.9465.5835965333333051366667157840000056307000205.06518.57333.1478.4111.82165.4337.2816.409941.011.080111.07577507407.97915.5997.3920.3155221.151409.4088809.627.241.605409.536100.551190.413.527870.59472973.368.250117.39262476.76747.366543.436.24386.090.841169683670296395500845.8799438.6105643.983710004.22.7433.42914590133241872.7592.318590.7791011251.25373.992.240346.89481.05482.0216382651.336652.741785.429146.884590.37399.16674.8693.63313698.748.4713715.050.932657.8202.101284.7637085736771432719.98525068388.9464955183238935232.5729419220.9447007.34.7812383510.129116118.44789.091101.5359.573134197.9100.995.398.579865.011331521158763681312176.690.289.1631358.561356.911357.29659.191658.660657.87613895.351.9742399.8077573.983.9313793.453.834.6575382576805.5807292.75547806709785121721290.042238950.94305.3652.2178408.521.420.53632139.07216.295391.9905484.6761.688593640.23510674860.09510571250.30332982338.9611.977616.096412.072412.8462086609.978010339.069337.355127.343127.29884.13084.8643229.2229.080174.81227.78460.2019.479424.84520997.192130.59914028.9130120666672940466667172190000061404000269.67319.29834.569.32539.1215.683738.411.174861.17434532758.5349.1587.5040.3769920.7177829.9689798.61.6626010.399106.551145.503.672780.65401072.147.01192.98451077.85376.753530.827.05391.970.869308541454515142527496.13110548116894.8745110532.7783.65916590143991082.3652.997590.7867621228.63354.212.273322.42463.12464.5714463721.397605.502149.159178.973765.88384.03668.1023774406.0275055559242700231.23810197215.5656974.04.8412083510.169115116.65590.432103.5989.103206198.510494.498.7741.456.21056.422158.41461.2592.971350.01065.606.870.2193.6561379.511375.711382.41658.277659.265658.03813806.652.3634979.2942714.134.0213906.152.874.7315301951770.5091142.77544807689772103921120.040248450.9316.1451.4548134.521.640.523597338.54817.035529.4025525.7101.701588940.23010897310.09310763570.29733692392.6476.175485.019420.853422.1382110880.427978325.389341.847127.775127.74181.45379.1153235.9426.911173.22627.38658.9019.458326.49521307.236860.60215528.6030050333332942866667171800000062467333281.14618.88335.2611.23039.3216.146839.031.178941.19747520548.5679.0297.4690.3777330.7881929.5709247.31.6426810.417107.461171.043.611440.64625272.607.231109.69751817.83706.934536.717.07893.050.870784561055245383534976.10710179113194.8699107112.9183.64316650125761114.6033.003410.7824761245.11364.122.225330.53472.61477.6714671791.372615.622038.159248.893820.77388.98682.8724888243861233.5146948.24.8411880.17690.264103.0059.413166299.310495.0100.579463.9115323591599649149612106.950.2189.43213857.451.1734199.60026013882.251.324.7225310251376.816406550809694771108221610.04223661334.3551.0348231.122.110.523571837.94817.371.777563690.23010901600.09210908240.29633832176407.6659293182.3527.05727.10357.2419.618927.015139128.793055766667298940000016798000006088633318.31435.2611.90539.7115.498938.86517068.7099.2277.3819238.410.291111.271180.4471.797.473110.7027.6989538.287.00394.40547106.14910205110444.8160106753.60716590127651249.74366.392.274329.32472.32478.1614942501.386611.732148.849263.553462.66388.88647.8223111382.9850.814594383264637240.40511325205.0346875.318663860.183122109.81191.986103.9293101384.490.078.884.078355.211651017194447732653186.74211.371259.591268.081267.18544.306544.099544.60013561.550.3233146.0286464.474.3313562.553.7751885.5195312.73466617660614105719290.82192.0048.1277784.83610038.3384887.5734891.0722725.7838.089845.141355.059319.7871720060.441307303.806304.996128.008127.76882.94982.8273298.2925.783178.85225.59841.6418.9127455211.370590.45972430.4465.5736064666673100400000160963333357411333211.73319.12679.2311.69065.6816.058140.951.038991.04484496859.28015.6490.3018851.170449.4948902.127.291.595979.7251188.433.415830.55423173.518.142116.49369456.64097.403541.586.5780.833921718974777144444125.9489603.2106693.8811102272.8163.54316146131922654.7212.287550.7732331251.91373.892.262343.85476.95478.6216978461.351638.101785.459021.834594.27398.96690.94OpenBenchmarking.org

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35K10K15K20K25KSE +/- 349.17, N = 9SE +/- 220.77, N = 3SE +/- 348.10, N = 9SE +/- 538.47, N = 9SE +/- 160.97, N = 3SE +/- 106.49, N = 32311122913227972377424888250681. (CC) gcc options: -pthread -O3 -march=native -lm

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless CompressionAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.390180270360450SE +/- 0.39, N = 3SE +/- 0.17, N = 3SE +/- 0.49, N = 3SE +/- 3.10, N = 3SE +/- 1.92, N = 3382.99392.85374.04406.03388.951. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 8AMD AOCC 3.0Clang 11.0Clang 12.00.18450.3690.55350.7380.9225SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.810.800.821. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3110220330440550SE +/- 10.39, N = 12SE +/- 5.55, N = 3SE +/- 10.30, N = 12SE +/- 0.87, N = 3SE +/- 4.64, N = 12459471498505495-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.312002400360048006000SE +/- 174.98, N = 12SE +/- 169.87, N = 9SE +/- 126.29, N = 12SE +/- 17.50, N = 3SE +/- 2.40, N = 343834523445655595183-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.360K120K180K240K300KSE +/- 251.99, N = 3SE +/- 407.86, N = 3SE +/- 1778.47, N = 3SE +/- 1024.96, N = 3SE +/- 675.55, N = 3SE +/- 537.86, N = 32646372601192652042427002438612389351. (CC) gcc options: -pedantic -O3

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.350100150200250SE +/- 0.82, N = 3SE +/- 0.28, N = 3SE +/- 0.44, N = 3SE +/- 0.32, N = 3SE +/- 0.18, N = 3SE +/- 0.54, N = 3240.41240.21236.92231.24233.51232.571. (CC) gcc options: -O3 -march=native -fvisibility=hidden

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.32K4K6K8K10KSE +/- 171.77, N = 3SE +/- 102.76, N = 8SE +/- 88.25, N = 12SE +/- 7.52, N = 3SE +/- 138.76, N = 31132597979904101979419-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.350100150200250SE +/- 0.17, N = 3SE +/- 0.66, N = 3SE +/- 0.07, N = 3SE +/- 0.46, N = 3SE +/- 1.32, N = 3205.03203.63207.01215.57220.941. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.315003000450060007500SE +/- 65.81, N = 3SE +/- 60.67, N = 3SE +/- 35.20, N = 3SE +/- 25.90, N = 3SE +/- 23.67, N = 3SE +/- 30.40, N = 36875.36823.86744.16974.06948.27007.31. (CC) gcc options: -pthread -O3 -march=native -lm

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31.11382.22763.34144.45525.569SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 34.954.874.844.844.781. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35001000150020002500SE +/- 52.84, N = 15SE +/- 27.29, N = 3SE +/- 41.63, N = 12SE +/- 14.93, N = 3SE +/- 17.34, N = 3SE +/- 18.77, N = 31866203421361208118812381. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.380160240320400SE +/- 2.50, N = 3SE +/- 1.42, N = 3SE +/- 4.15, N = 4SE +/- 0.17, N = 3SE +/- 0.50, N = 3386346333351351-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.04120.08240.12360.16480.206SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1830.1810.1830.1690.1760.1291. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3306090120150SE +/- 0.50, N = 3SE +/- 0.29, N = 3SE +/- 0.50, N = 3SE +/- 0.17, N = 3SE +/- 0.44, N = 3122108112115116-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3306090120150SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.16, N = 3SE +/- 0.10, N = 3109.81109.64109.53116.66118.451. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.12, N = 3SE +/- 1.37, N = 3SE +/- 1.11, N = 6SE +/- 0.12, N = 3SE +/- 0.43, N = 3SE +/- 0.60, N = 391.9990.5395.9690.4390.2689.09-lstdc++-lstdc++-lstdc++-lstdc++1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3306090120150SE +/- 0.22, N = 3SE +/- 0.06, N = 3SE +/- 0.53, N = 3SE +/- 0.48, N = 3SE +/- 1.53, N = 3SE +/- 1.32, N = 3103.93103.83118.87103.60103.01101.54-lstdc++-lstdc++-lstdc++-lstdc++1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33691215SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 69.148.999.109.419.571. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.37K14K21K28K35KSE +/- 378.89, N = 6SE +/- 146.10, N = 3SE +/- 77.17, N = 3SE +/- 14.99, N = 3SE +/- 209.56, N = 3SE +/- 37.37, N = 33101331741319353206131662313411. (CC) gcc options: -pthread -O3 -march=native -lm

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.08, N = 12SE +/- 0.02, N = 14SE +/- 0.07, N = 12SE +/- 0.60, N = 12SE +/- 0.05, N = 15SE +/- 0.05, N = 1584.484.073.098.599.397.9-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.05, N = 12SE +/- 0.02, N = 15SE +/- 0.09, N = 12SE +/- 0.62, N = 12SE +/- 0.08, N = 1590.088.351.9104.0104.0100.9-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.07, N = 12SE +/- 0.03, N = 15SE +/- 0.56, N = 12SE +/- 0.59, N = 12SE +/- 0.08, N = 15SE +/- 0.07, N = 1578.879.365.794.495.095.3-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.04, N = 12SE +/- 0.06, N = 15SE +/- 0.05, N = 12SE +/- 1.05, N = 12SE +/- 0.29, N = 15SE +/- 0.16, N = 1584.083.648.698.7100.598.5-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32004006008001000SE +/- 1.94, N = 12SE +/- 1.41, N = 14SE +/- 4.04, N = 12SE +/- 66.49, N = 12SE +/- 2.88, N = 15SE +/- 2.10, N = 14783.0677.0626.0741.4794.0798.0-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31530456075SE +/- 3.44, N = 12SE +/- 3.65, N = 15SE +/- 2.22, N = 12SE +/- 5.30, N = 12SE +/- 3.83, N = 15SE +/- 4.17, N = 1555.251.269.156.263.965.0-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 2.61, N = 12SE +/- 1.49, N = 15SE +/- 17.06, N = 12SE +/- 95.41, N = 12SE +/- 1.87, N = 15SE +/- 1.59, N = 151165.00933.00819.001056.421153.001133.00-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35001000150020002500SE +/- 3.59, N = 12SE +/- 1.59, N = 15SE +/- 20.06, N = 12SE +/- 194.35, N = 12SE +/- 2.74, N = 15SE +/- 2.06, N = 151017.01043.0878.02158.42359.01521.0-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3400800120016002000SE +/- 9.88, N = 12SE +/- 8.32, N = 15SE +/- 15.32, N = 11SE +/- 131.59, N = 12SE +/- 2.67, N = 15SE +/- 9.19, N = 151944.01877.0604.01461.21599.01587.0-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3140280420560700SE +/- 37.59, N = 12SE +/- 38.96, N = 15SE +/- 35.24, N = 12SE +/- 53.43, N = 12SE +/- 2.60, N = 15SE +/- 0.80, N = 15477.00462.00434.00592.97649.00636.00-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 26.90, N = 12SE +/- 34.43, N = 15SE +/- 15.69, N = 12SE +/- 132.58, N = 12SE +/- 62.40, N = 15SE +/- 2.85, N = 15326.0412.0357.01350.01496.0813.0-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 32.29, N = 12SE +/- 36.50, N = 15SE +/- 15.30, N = 12SE +/- 101.07, N = 12SE +/- 25.34, N = 15SE +/- 25.85, N = 15531.00495.00471.001065.601210.001217.00-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 37.207.106.876.956.691. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.04730.09460.14190.18920.2365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.210.210.210.210.201. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.26, N = 3SE +/- 0.98, N = 3SE +/- 0.98, N = 3SE +/- 1.09, N = 3SE +/- 1.29, N = 4SE +/- 0.33, N = 3SE +/- 0.16, N = 386.7488.6289.1293.6393.6689.4389.16-flto-mabm-mabm-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 7AMD AOCC 3.0Clang 11.0Clang 12.03691215SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 311.3712.0112.151. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.330060090012001500SE +/- 1.97, N = 3SE +/- 7.11, N = 3SE +/- 3.61, N = 3SE +/- 1.75, N = 3SE +/- 3.05, N = 31259.591277.621307.491379.511358.56-fopenmp=libomp - MIN: 1247.29-fopenmp=libomp - MIN: 1252.39-fopenmp=libomp - MIN: 1293.38-fopenmp - MIN: 1361.6-fopenmp - MIN: 1337.171. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.330060090012001500SE +/- 0.58, N = 3SE +/- 9.75, N = 3SE +/- 1.78, N = 3SE +/- 2.65, N = 3SE +/- 4.57, N = 31268.081271.911305.101375.711356.91-fopenmp=libomp - MIN: 1257.35-fopenmp=libomp - MIN: 1252.33-fopenmp=libomp - MIN: 1294.76-fopenmp - MIN: 1355.68-fopenmp - MIN: 1335.041. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.330060090012001500SE +/- 5.94, N = 3SE +/- 9.46, N = 3SE +/- 3.92, N = 3SE +/- 3.72, N = 3SE +/- 4.44, N = 31267.181276.041302.701382.411357.29-fopenmp=libomp - MIN: 1248.35-fopenmp=libomp - MIN: 1249.65-fopenmp=libomp - MIN: 1289.86-fopenmp - MIN: 1360.58-fopenmp - MIN: 1335.631. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3140280420560700SE +/- 0.90, N = 3SE +/- 0.25, N = 3SE +/- 1.89, N = 3SE +/- 0.83, N = 3SE +/- 1.25, N = 3544.31562.97590.18658.28659.19-fopenmp=libomp - MIN: 531.9-fopenmp=libomp - MIN: 551.49-fopenmp=libomp - MIN: 575.41-fopenmp - MIN: 639.78-fopenmp - MIN: 642.051. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3140280420560700SE +/- 0.53, N = 3SE +/- 0.83, N = 3SE +/- 9.50, N = 3SE +/- 0.61, N = 3SE +/- 0.64, N = 3544.10563.20593.97659.27658.66-fopenmp=libomp - MIN: 532.32-fopenmp=libomp - MIN: 550.23-fopenmp=libomp - MIN: 570.44-fopenmp - MIN: 642.67-fopenmp - MIN: 639.861. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3140280420560700SE +/- 0.62, N = 3SE +/- 0.10, N = 3SE +/- 3.02, N = 3SE +/- 1.86, N = 3SE +/- 0.52, N = 3544.60563.25597.48658.04657.88-fopenmp=libomp - MIN: 532.91-fopenmp=libomp - MIN: 551.31-fopenmp=libomp - MIN: 580.8-fopenmp - MIN: 635.78-fopenmp - MIN: 638.351. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 11.0.1GCC 9.33K6K9K12K15KSE +/- 33.89, N = 3SE +/- 23.21, N = 3SE +/- 65.90, N = 3SE +/- 46.50, N = 3SE +/- 6.60, N = 4SE +/- 62.74, N = 3SE +/- 17.75, N = 513561.513927.913926.513698.713806.613857.413895.31. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 11.0.1GCC 9.31224364860SE +/- 0.26, N = 3SE +/- 0.46, N = 3SE +/- 0.42, N = 3SE +/- 0.74, N = 3SE +/- 0.72, N = 4SE +/- 0.65, N = 3SE +/- 0.65, N = 550.3249.0148.5048.4752.3651.1751.971. (CC) gcc options: -O3

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.39K18K27K36K45KSE +/- 9.32, N = 3SE +/- 0.81, N = 3SE +/- 64.93, N = 3SE +/- 102.23, N = 3SE +/- 3.94, N = 3SE +/- 453.41, N = 1433146.0333178.5033246.8434979.2934199.6042399.811. (CXX) g++ options: -O3 -march=native -fopenmp

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserIDAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.31.03952.0793.11854.1585.1975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.474.414.624.133.981. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweetsAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.31.0352.073.1054.145.175SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.334.414.604.023.931. (CXX) g++ options: -O3 -march=native -pthread

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 11.0.1GCC 9.33K6K9K12K15KSE +/- 73.30, N = 3SE +/- 15.91, N = 3SE +/- 71.01, N = 3SE +/- 60.82, N = 3SE +/- 42.32, N = 3SE +/- 34.44, N = 4SE +/- 37.19, N = 413562.513840.313911.513715.013906.113882.213793.41. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 11.0.1GCC 9.31224364860SE +/- 0.48, N = 3SE +/- 0.33, N = 3SE +/- 0.80, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.73, N = 4SE +/- 0.77, N = 453.7752.3552.0750.9352.8751.3253.831. (CC) gcc options: -O3

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31.06452.1293.19354.2585.3225SE +/- 0.074, N = 3SE +/- 0.054, N = 15SE +/- 0.052, N = 7SE +/- 0.021, N = 3SE +/- 0.034, N = 34.6034.4314.7314.7224.6571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read WriteClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.312K24K36K48K60KSE +/- 883.12, N = 3SE +/- 702.52, N = 15SE +/- 591.89, N = 7SE +/- 211.73, N = 3SE +/- 396.40, N = 354488566845301953102538251. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.316K32K48K64K80KSE +/- 242.64, N = 3SE +/- 4.51, N = 3SE +/- 10.95, N = 3SE +/- 23.55, N = 3SE +/- 42.62, N = 3SE +/- 971.24, N = 351885.5251900.4351596.8751770.5151376.8276805.581. (CXX) g++ options: -O3 -march=native -fopenmp

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: KostyaAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.62331.24661.86992.49323.1165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.732.682.752.772.751. (CXX) g++ options: -O3 -march=native -pthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3120240360480600SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 1.00, N = 34664634575445505471. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32004006008001000SE +/- 0.58, N = 3SE +/- 2.03, N = 36176136148078098061. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3150300450600750SE +/- 1.33, N = 3SE +/- 2.60, N = 3SE +/- 5.21, N = 3SE +/- 6.43, N = 36606657126896947091. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32004006008001000SE +/- 1.20, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 36146166057727717851. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 1.53, N = 3SE +/- 1.86, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 1.53, N = 31057106810761039108212171. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35001000150020002500SE +/- 4.63, N = 3SE +/- 12.41, N = 3SE +/- 6.57, N = 3SE +/- 1.20, N = 3SE +/- 4.81, N = 3SE +/- 1.20, N = 31929191519932112216121291. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.00950.0190.02850.0380.0475SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 15SE +/- 0.000, N = 30.0400.0410.0400.0420.0421. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read OnlyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35K10K15K20K25KSE +/- 289.16, N = 3SE +/- 303.43, N = 3SE +/- 118.05, N = 3SE +/- 281.76, N = 15SE +/- 41.57, N = 324943243102484523661238951. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandomAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.21150.4230.63450.8461.0575SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.820.810.840.900.941. (CXX) g++ options: -O3 -march=native -pthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bitAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.370140210280350SE +/- 0.39, N = 3SE +/- 0.48, N = 3SE +/- 0.93, N = 3SE +/- 0.21, N = 3SE +/- 1.11, N = 3SE +/- 0.71, N = 3192.00184.19308.32316.14334.35305.36-lm - MIN: 118.57 / MAX: 324.98-lm - MIN: 114.52 / MAX: 310.5MIN: 220.53 / MAX: 490.51-lm - MIN: 218.19 / MAX: 515.85-lm - MIN: 234.24 / MAX: 544.9-lm - MIN: 210.86 / MAX: 493.211. (CC) gcc options: -O3 -march=native -pthread

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31224364860SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 348.1347.8947.8851.4551.0352.221. (CXX) g++ options: -O3 -fPIC -lm

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32K4K6K8K10KSE +/- 19.99, N = 3SE +/- 27.38, N = 3SE +/- 65.76, N = 3SE +/- 56.49, N = 3SE +/- 36.00, N = 3SE +/- 50.36, N = 37784.87878.57789.98134.58231.18408.51. (CC) gcc options: -pthread -O3 -march=native -lm

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3510152025SE +/- 0.15, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 322.0022.1321.6422.1121.421. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.11930.23860.35790.47720.5965SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.530.530.520.520.501. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.38K16K24K32K40KSE +/- 455.21, N = 12SE +/- 530.09, N = 4SE +/- 165.99, N = 3SE +/- 301.69, N = 3SE +/- 442.82, N = 3SE +/- 79.87, N = 33610036181362393597335718363211. (CC) gcc options: -pthread -O3 -march=native -lm

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3918273645SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 338.3437.7338.4538.5537.9539.07-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.348121620SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 317.1317.2217.0317.3716.291. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.312002400360048006000SE +/- 3.70, N = 3SE +/- 1.35, N = 3SE +/- 4.78, N = 3SE +/- 5.42, N = 3SE +/- 11.31, N = 34887.574895.564682.465529.405391.991. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.312002400360048006000SE +/- 0.05, N = 3SE +/- 2.16, N = 3SE +/- 2.14, N = 3SE +/- 4.47, N = 3SE +/- 42.69, N = 34891.074901.134659.345525.715484.681. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.39980.79961.19941.59921.999SE +/- 0.011, N = 3SE +/- 0.004, N = 3SE +/- 0.013, N = 3SE +/- 0.029, N = 3SE +/- 0.028, N = 31.6261.6071.7011.7771.6881. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read WriteClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.313K26K39K52K65KSE +/- 400.92, N = 3SE +/- 162.92, N = 3SE +/- 469.64, N = 3SE +/- 899.82, N = 3SE +/- 994.44, N = 361616623195889456369593641. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.05290.10580.15870.21160.2645SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 30.2350.2340.2300.2300.2351. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3200K400K600K800K1000KSE +/- 13844.42, N = 3SE +/- 6289.60, N = 3SE +/- 8859.63, N = 3SE +/- 8885.95, N = 3SE +/- 8843.08, N = 3106550610712091089731109016010674861. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.02140.04280.06420.08560.107SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0940.0940.0930.0920.0951. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read OnlyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3200K400K600K800K1000KSE +/- 1740.88, N = 3SE +/- 720.87, N = 3SE +/- 183.22, N = 3SE +/- 1514.63, N = 3SE +/- 1623.23, N = 3106936710690221076357109082410571251. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.06860.13720.20580.27440.343SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 30.3020.3050.2970.2960.3031. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read WriteClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.37001400210028003500SE +/- 14.62, N = 3SE +/- 3.48, N = 3SE +/- 11.40, N = 3SE +/- 28.00, N = 3SE +/- 4.79, N = 3331232813369338332981. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21AMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.36001200180024003000SE +/- 2.28, N = 3SE +/- 1.01, N = 3SE +/- 1.92, N = 3SE +/- 1.62, N = 3SE +/- 2.06, N = 3SE +/- 4.53, N = 32725.72640.22653.82657.82392.62338.91. (CXX) g++ options: -O3 -march=native -rdynamic

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.32004006008001000SE +/- 3.17, N = 3SE +/- 0.16, N = 3SE +/- 4.64, N = 3SE +/- 0.02, N = 3SE +/- 0.40, N = 3838.09840.64843.40476.18611.981. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.32004006008001000SE +/- 3.15, N = 3SE +/- 0.62, N = 3SE +/- 4.85, N = 3SE +/- 0.28, N = 3SE +/- 0.13, N = 3845.14848.24850.50485.02616.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.390180270360450SE +/- 1.17, N = 3SE +/- 2.03, N = 3SE +/- 0.04, N = 3SE +/- 0.95, N = 3SE +/- 0.12, N = 3355.06351.08351.28420.85412.071. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.390180270360450SE +/- 1.14, N = 3SE +/- 1.73, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 3319.79319.23380.05422.14412.851. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3500K1000K1500K2000K2500KSE +/- 3670.84, N = 3SE +/- 971.31, N = 3SE +/- 984.68, N = 3SE +/- 2170.85, N = 3SE +/- 5755.65, N = 3SE +/- 4791.32, N = 31720060.441790837.011785466.282110880.432176407.672086609.981. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.370140210280350SE +/- 0.06, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 3SE +/- 0.44, N = 3SE +/- 0.04, N = 3303.81302.41321.19325.39339.071. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.370140210280350SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.52, N = 3SE +/- 0.04, N = 3305.00299.21315.41341.85337.361. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3306090120150SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.32, N = 3SE +/- 0.08, N = 3128.01127.74133.05127.78127.341. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3306090120150SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.33, N = 3SE +/- 0.09, N = 3127.77128.59132.82127.74127.301. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.320406080100SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 382.9580.2284.2381.4584.131. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.320406080100SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 382.8379.1582.6479.1284.861. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.37001400210028003500SE +/- 1.29, N = 3SE +/- 15.12, N = 3SE +/- 1.11, N = 3SE +/- 6.50, N = 3SE +/- 5.19, N = 3SE +/- 5.86, N = 33298.293319.343190.623235.943182.353229.221. (CC) gcc options: -O3 -march=native -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3714212835SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 325.7826.0325.2226.9127.0629.081. (CXX) g++ options: -O3 -fPIC -lm

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2AMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.34080120160200SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3178.85168.82202.09202.10173.23174.811. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3714212835SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 325.6025.4725.1827.3927.1027.781. (CXX) g++ options: -O3 -fPIC -lm

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31326395265SE +/- 0.04, N = 3SE +/- 0.33, N = 3SE +/- 0.12, N = 3SE +/- 0.23, N = 3SE +/- 0.16, N = 3SE +/- 0.19, N = 341.6438.7141.7858.9057.2460.201. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ExhaustiveAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 318.9119.0318.9919.4619.6219.481. (CXX) g++ options: -O3 -march=native -flto -pthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3612182430SE +/- 0.13, N = 3SE +/- 0.27, N = 3SE +/- 0.25, N = 3SE +/- 0.28, N = 3SE +/- 0.13, N = 326.6126.8526.4927.0124.841. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.311K22K33K44K55KSE +/- 542.47, N = 15SE +/- 413.24, N = 15SE +/- 671.66, N = 15SE +/- 844.19, N = 3SE +/- 227.13, N = 3SE +/- 228.68, N = 34552146676454285213051391520991. (CC) gcc options: -pthread -O3 -march=native -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3246810SE +/- 0.00485, N = 3SE +/- 0.00568, N = 3SE +/- 0.00123, N = 3SE +/- 0.03687, N = 3SE +/- 0.02683, N = 31.370591.457571.444257.236867.19213-fopenmp=libomp - MIN: 1.28-fopenmp=libomp - MIN: 1.35-fopenmp=libomp - MIN: 1.34-fopenmp - MIN: 6.18-fopenmp - MIN: 6.141. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.13550.2710.40650.5420.6775SE +/- 0.000365, N = 3SE +/- 0.001652, N = 3SE +/- 0.002843, N = 3SE +/- 0.001964, N = 3SE +/- 0.001469, N = 30.4597240.4892780.4919400.6021550.599140-fopenmp=libomp - MIN: 0.44-fopenmp=libomp - MIN: 0.46-fopenmp=libomp - MIN: 0.47-fopenmp - MIN: 0.57-fopenmp - MIN: 0.561. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3714212835SE +/- 0.13, N = 3SE +/- 0.25, N = 3SE +/- 0.23, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 330.4429.9430.3228.6028.7928.911. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 5AMD AOCC 3.0Clang 11.0Clang 12.01530456075SE +/- 0.17, N = 3SE +/- 0.20, N = 3SE +/- 0.14, N = 365.5765.5866.661. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3800M1600M2400M3200M4000MSE +/- 1543084.93, N = 3SE +/- 1559202.08, N = 3SE +/- 883804.91, N = 3SE +/- 1679616.36, N = 3SE +/- 6016181.88, N = 3SE +/- 3384441.53, N = 33606466667359653333336437666673005033333305576666730120666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3700M1400M2100M2800M3500MSE +/- 1234233.91, N = 3SE +/- 2452436.43, N = 3SE +/- 6045475.81, N = 3SE +/- 4643753.27, N = 3SE +/- 1154700.54, N = 3SE +/- 2961043.36, N = 33100400000305136666730706333332942866667298940000029404666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3400M800M1200M1600M2000MSE +/- 2130988.29, N = 3SE +/- 1331665.62, N = 3SE +/- 2255610.29, N = 3SE +/- 15763988.50, N = 3SE +/- 17297784.06, N = 3SE +/- 4864497.23, N = 31609633333157840000015648333331718000000167980000017219000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.313M26M39M52M65MSE +/- 47026.00, N = 3SE +/- 40360.87, N = 3SE +/- 790005.27, N = 3SE +/- 6887.99, N = 3SE +/- 318169.94, N = 3SE +/- 870702.21, N = 35741133356307000556630006246733360886333614040001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1AMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.360120180240300SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3211.73205.07284.64284.76281.15269.671. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3510152025SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.13, N = 319.1318.5719.0218.8818.3119.30-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3816243240SE +/- 0.22, N = 3SE +/- 0.48, N = 3SE +/- 0.19, N = 3SE +/- 0.47, N = 3SE +/- 0.12, N = 333.1433.3935.2635.2634.561. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 5AMD AOCC 3.0Clang 11.0Clang 12.020406080100SE +/- 0.41, N = 3SE +/- 0.24, N = 3SE +/- 0.17, N = 379.2378.4174.271. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33691215SE +/- 0.189, N = 3SE +/- 0.164, N = 4SE +/- 0.170, N = 3SE +/- 0.111, N = 9SE +/- 0.139, N = 3SE +/- 0.086, N = 311.69011.82111.47411.23011.9059.3251. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 7AMD AOCC 3.0Clang 11.0Clang 12.01530456075SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.16, N = 365.6865.4366.381. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3918273645SE +/- 0.31, N = 3SE +/- 0.43, N = 3SE +/- 0.19, N = 3SE +/- 0.29, N = 3SE +/- 0.38, N = 337.2838.1139.3239.7139.121. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.348121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 316.0616.4116.0516.1515.5015.681. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3918273645SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.05, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 340.9541.0141.0939.0338.8638.411. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.26530.53060.79591.06121.3265SE +/- 0.00160, N = 3SE +/- 0.00127, N = 3SE +/- 0.00199, N = 3SE +/- 0.00296, N = 3SE +/- 0.00349, N = 31.038991.080111.077011.178941.17486-fopenmp=libomp - MIN: 0.99-fopenmp=libomp - MIN: 1.03-fopenmp=libomp - MIN: 1.04-fopenmp - MIN: 1.12-fopenmp - MIN: 1.121. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.26940.53880.80821.07761.347SE +/- 0.00668, N = 3SE +/- 0.00395, N = 3SE +/- 0.00286, N = 3SE +/- 0.00438, N = 3SE +/- 0.00597, N = 31.044841.075771.075071.197471.17434-fopenmp=libomp - MIN: 0.83-fopenmp=libomp - MIN: 0.86-fopenmp=libomp - MIN: 0.87-fopenmp - MIN: 0.98-fopenmp - MIN: 0.961. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.311K22K33K44K55KSE +/- 621.84, N = 15SE +/- 585.78, N = 3SE +/- 952.64, N = 12SE +/- 439.64, N = 15SE +/- 568.96, N = 3SE +/- 788.42, N = 34968550740503505205451706532751. (CC) gcc options: -pthread -O3 -march=native -lm

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33691215SE +/- 0.006, N = 5SE +/- 0.006, N = 5SE +/- 0.007, N = 5SE +/- 0.008, N = 5SE +/- 0.006, N = 5SE +/- 0.011, N = 59.2807.9797.8548.5678.7098.534-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.348121620SE +/- 0.063, N = 3SE +/- 0.009, N = 3SE +/- 0.023, N = 3SE +/- 0.014, N = 3SE +/- 0.027, N = 3SE +/- 0.014, N = 315.64915.59915.8709.0299.2279.1581. (CC) gcc options: -lm -lpthread -O3 -march=native

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.002, N = 5SE +/- 0.013, N = 5SE +/- 0.003, N = 5SE +/- 0.002, N = 5SE +/- 0.002, N = 57.3927.5677.4697.3817.504-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.0850.170.2550.340.425SE +/- 0.000492, N = 3SE +/- 0.000247, N = 3SE +/- 0.000321, N = 3SE +/- 0.004341, N = 3SE +/- 0.000576, N = 30.3018850.3155220.3136890.3777330.376992-fopenmp=libomp - MIN: 0.29-fopenmp=libomp - MIN: 0.3-fopenmp=libomp - MIN: 0.3-fopenmp - MIN: 0.36-fopenmp - MIN: 0.361. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.26380.52760.79141.05521.319SE +/- 0.004625, N = 3SE +/- 0.006530, N = 3SE +/- 0.004576, N = 3SE +/- 0.005622, N = 3SE +/- 0.003430, N = 31.1704401.1514001.1725800.7881920.717782-fopenmp=libomp - MIN: 1.11-fopenmp=libomp - MIN: 1.09-fopenmp=libomp - MIN: 1.12-fopenmp - MIN: 0.74-fopenmp - MIN: 0.671. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.33691215SE +/- 0.026, N = 3SE +/- 0.032, N = 3SE +/- 0.041, N = 3SE +/- 0.049, N = 3SE +/- 0.053, N = 39.4949.4089.2969.5709.9681. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32K4K6K8K10KSE +/- 14.28, N = 3SE +/- 45.95, N = 3SE +/- 48.25, N = 3SE +/- 41.68, N = 3SE +/- 25.87, N = 3SE +/- 19.46, N = 38902.18809.69088.39247.39238.49798.61. (CC) gcc options: -pthread -O3 -march=native -lm

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 8AMD AOCC 3.0Clang 11.0Clang 12.0714212835SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 327.2927.2428.131. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.45810.91621.37431.83242.2905SE +/- 0.00195, N = 3SE +/- 0.00118, N = 3SE +/- 0.01922, N = 12SE +/- 0.00384, N = 3SE +/- 0.01150, N = 31.595971.605402.036061.642681.66260-fopenmp=libomp - MIN: 1.54-fopenmp=libomp - MIN: 1.55-fopenmp=libomp - MIN: 1.81-fopenmp - MIN: 1.58-fopenmp - MIN: 1.591. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33691215SE +/- 0.016, N = 3SE +/- 0.022, N = 3SE +/- 0.014, N = 3SE +/- 0.032, N = 3SE +/- 0.052, N = 3SE +/- 0.031, N = 39.7259.5369.51010.41710.29110.3991. (CXX) g++ options: -O3 -fPIC -lm

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.53, N = 3SE +/- 0.31, N = 3SE +/- 1.76, N = 3SE +/- 1.15, N = 8SE +/- 1.10, N = 8100.55103.17107.46111.27106.551. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 0.97, N = 3SE +/- 6.69, N = 3SE +/- 2.95, N = 3SE +/- 3.74, N = 3SE +/- 1.75, N = 3SE +/- 5.12, N = 31188.431190.411198.221171.041180.441145.50-lm - MIN: 703.73 / MAX: 1484.94-lm - MIN: 685.16 / MAX: 1496.36MIN: 700.24 / MAX: 1494.16-lm - MIN: 683.28 / MAX: 1473.51-lm - MIN: 680.31 / MAX: 1485.74-lm - MIN: 664.19 / MAX: 1441.541. (CC) gcc options: -O3 -march=native -pthread

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.82641.65282.47923.30564.132SE +/- 0.02018, N = 3SE +/- 0.04735, N = 3SE +/- 0.01639, N = 3SE +/- 0.02637, N = 3SE +/- 0.03246, N = 33.415833.527873.285073.611443.67278-fopenmp=libomp - MIN: 3.24-fopenmp=libomp - MIN: 3.29-fopenmp=libomp - MIN: 3.15-fopenmp - MIN: 3.37-fopenmp - MIN: 3.391. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.15980.31960.47940.63920.799SE +/- 0.000764, N = 3SE +/- 0.008914, N = 3SE +/- 0.011383, N = 3SE +/- 0.003112, N = 3SE +/- 0.003317, N = 30.5542310.5947290.7101240.6462520.654010-fopenmp=libomp - MIN: 0.5-fopenmp=libomp - MIN: 0.53-fopenmp=libomp - MIN: 0.64-fopenmp - MIN: 0.6-fopenmp - MIN: 0.591. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31632486480SE +/- 0.63, N = 3SE +/- 0.49, N = 3SE +/- 0.49, N = 3SE +/- 0.32, N = 3SE +/- 0.56, N = 3SE +/- 0.26, N = 373.5173.3674.0072.6071.7972.141. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.008, N = 3SE +/- 0.021, N = 3SE +/- 0.003, N = 3SE +/- 0.006, N = 3SE +/- 0.005, N = 3SE +/- 0.019, N = 38.1428.2508.2567.2317.4737.011-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr1. (CC) gcc options: -O3 -pipe -march=native -lncurses -lm

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3306090120150SE +/- 0.33, N = 3SE +/- 0.46, N = 3SE +/- 0.10, N = 3SE +/- 1.05, N = 3SE +/- 0.18, N = 3SE +/- 0.83, N = 3116.49117.39118.07109.70110.7092.981. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.315003000450060007500SE +/- 52.54, N = 3SE +/- 67.11, N = 7SE +/- 87.21, N = 3SE +/- 86.43, N = 3SE +/- 67.68, N = 3SE +/- 74.84, N = 3694562476780708551815107-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc1. (CXX) g++ options: -O3 -march=native -lpthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ThoroughAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.0015, N = 3SE +/- 0.0026, N = 3SE +/- 0.0028, N = 3SE +/- 0.0011, N = 3SE +/- 0.0034, N = 3SE +/- 0.0029, N = 36.64096.76746.76477.83707.69897.85371. (CXX) g++ options: -O3 -march=native -flto -pthread

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3246810SE +/- 0.028, N = 3SE +/- 0.022, N = 3SE +/- 0.006, N = 3SE +/- 0.017, N = 3SE +/- 0.017, N = 37.4037.3666.6906.9346.7531. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4KAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3120240360480600SE +/- 1.13, N = 3SE +/- 1.43, N = 3SE +/- 1.79, N = 3SE +/- 0.67, N = 3SE +/- 2.51, N = 3SE +/- 1.35, N = 3541.58543.43541.56536.71538.28530.82-lm - MIN: 259.4 / MAX: 585.8-lm - MIN: 256.75 / MAX: 593.99MIN: 252.01 / MAX: 587.53-lm - MIN: 256.44 / MAX: 577.82-lm - MIN: 251.6 / MAX: 584.38-lm - MIN: 248.84 / MAX: 574.281. (CC) gcc options: -O3 -march=native -pthread

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.009, N = 3SE +/- 0.018, N = 3SE +/- 0.004, N = 3SE +/- 0.006, N = 3SE +/- 0.021, N = 3SE +/- 0.009, N = 36.5786.2436.3097.0787.0037.053-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.51, N = 3SE +/- 1.07, N = 3SE +/- 0.65, N = 3SE +/- 0.47, N = 3SE +/- 0.89, N = 386.0988.7893.0594.4091.971. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.27480.54960.82441.09921.374SE +/- 0.000645, N = 3SE +/- 0.000480, N = 3SE +/- 0.018279, N = 4SE +/- 0.001247, N = 3SE +/- 0.001032, N = 30.8339210.8411691.2213200.8707840.869308-fopenmp=libomp - MIN: 0.81-fopenmp=libomp - MIN: 0.82-fopenmp=libomp - MIN: 1.13-fopenmp - MIN: 0.84-fopenmp - MIN: 0.841. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ TasksAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.316003200480064008000SE +/- 41.46, N = 3SE +/- 7.31, N = 3SE +/- 33.67, N = 3SE +/- 17.21, N = 3SE +/- 31.52, N = 3SE +/- 49.08, N = 3718968367437736756105414-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc1. (CXX) g++ options: -O3 -march=native -lpthread

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMPAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.316003200480064008000SE +/- 22.73, N = 3SE +/- 20.42, N = 3SE +/- 14.89, N = 3SE +/- 2.60, N = 3SE +/- 3.18, N = 3747770297507552454511. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ ThreadsAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.315003000450060007500SE +/- 24.26, N = 3SE +/- 25.04, N = 3SE +/- 30.90, N = 3SE +/- 15.06, N = 3SE +/- 6.12, N = 3SE +/- 8.33, N = 3714463957220714353835142-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc1. (CXX) g++ options: -O3 -march=native -lpthread

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.312K24K36K48K60KSE +/- 756.91, N = 3SE +/- 582.34, N = 3SE +/- 439.50, N = 3SE +/- 743.81, N = 3SE +/- 156.75, N = 3SE +/- 725.00, N = 34441250084512545349754710527491. (CC) gcc options: -pthread -O3 -march=native -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.022, N = 3SE +/- 0.011, N = 3SE +/- 0.013, N = 3SE +/- 0.007, N = 3SE +/- 0.017, N = 3SE +/- 0.022, N = 35.9485.8795.7466.1076.1496.1311. (CXX) g++ options: -O3 -fPIC -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32K4K6K8K10KSE +/- 43.38, N = 3SE +/- 15.16, N = 3SE +/- 101.36, N = 3SE +/- 57.26, N = 3SE +/- 48.56, N = 3SE +/- 20.21, N = 39603.29438.69862.010179.010205.010548.01. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33K6K9K12K15KSE +/- 34.64, N = 3SE +/- 35.53, N = 3SE +/- 27.10, N = 3SE +/- 32.26, N = 3SE +/- 189.35, N = 3SE +/- 44.20, N = 31066910564108051131911044116891. (CC) gcc options: -pthread -O3 -march=native -lm

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: MediumAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31.09682.19363.29044.38725.484SE +/- 0.0042, N = 3SE +/- 0.0013, N = 3SE +/- 0.0116, N = 3SE +/- 0.0047, N = 3SE +/- 0.0099, N = 3SE +/- 0.0035, N = 33.88113.98374.00584.86994.81604.87451. (CXX) g++ options: -O3 -march=native -flto -pthread

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32K4K6K8K10KSE +/- 39.89, N = 3SE +/- 28.76, N = 3SE +/- 7.75, N = 3SE +/- 14.75, N = 3SE +/- 55.19, N = 3SE +/- 37.69, N = 310227.010004.210467.010711.010675.011053.01. (CC) gcc options: -pthread -O3 -march=native -lm

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: DefaultAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.65661.31321.96982.62643.283SE +/- 0.010, N = 3SE +/- 0.031, N = 3SE +/- 0.027, N = 3SE +/- 0.032, N = 7SE +/- 0.038, N = 32.8162.7432.7392.9182.7781. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.82331.64662.46993.29324.1165SE +/- 0.004, N = 3SE +/- 0.010, N = 3SE +/- 0.014, N = 3SE +/- 0.022, N = 3SE +/- 0.002, N = 3SE +/- 0.016, N = 33.5433.4293.3613.6433.6073.6591. (CXX) g++ options: -O3 -fPIC -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.34K8K12K16K20KSE +/- 5.33, N = 3SE +/- 129.55, N = 3SE +/- 48.79, N = 3SE +/- 108.41, N = 3SE +/- 168.99, N = 3SE +/- 170.19, N = 81614614590156491665016590165901. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33K6K9K12K15KSE +/- 41.35, N = 3SE +/- 20.33, N = 3SE +/- 24.25, N = 3SE +/- 16.05, N = 3SE +/- 45.16, N = 3SE +/- 67.28, N = 31319213324133331257612765143991. (CC) gcc options: -pthread -O3 -march=native -lm

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1AMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.36001200180024003000SE +/- 8.09, N = 3SE +/- 1.69, N = 3SE +/- 2.64, N = 3SE +/- 6.09, N = 3SE +/- 0.48, N = 3SE +/- 0.16, N = 32654.721872.762718.532719.991114.601082.371. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.67581.35162.02742.70323.379SE +/- 0.00564, N = 3SE +/- 0.02389, N = 3SE +/- 0.02100, N = 3SE +/- 0.00883, N = 3SE +/- 0.00845, N = 32.287552.318592.367973.003412.99759-fopenmp=libomp - MIN: 1.91-fopenmp=libomp - MIN: 1.92-fopenmp=libomp - MIN: 2.01-fopenmp - MIN: 2.35-fopenmp - MIN: 2.241. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.1770.3540.5310.7080.885SE +/- 0.001713, N = 3SE +/- 0.001200, N = 3SE +/- 0.004246, N = 3SE +/- 0.002532, N = 3SE +/- 0.002405, N = 30.7732330.7791010.7797760.7824760.786762-fopenmp=libomp - MIN: 0.72-fopenmp=libomp - MIN: 0.73-fopenmp=libomp - MIN: 0.73-fopenmp - MIN: 0.73-fopenmp - MIN: 0.751. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 4.95, N = 3SE +/- 2.13, N = 3SE +/- 7.87, N = 3SE +/- 8.15, N = 3SE +/- 1.96, N = 3SE +/- 2.25, N = 31251.911251.251244.111245.111249.741228.63-lm - MIN: 543.89 / MAX: 1394.16-lm - MIN: 556.46 / MAX: 1394.06MIN: 549.81 / MAX: 1390.03-lm - MIN: 539.07 / MAX: 1398.87-lm - MIN: 559.74 / MAX: 1387.11-lm - MIN: 555.28 / MAX: 1361.681. (CC) gcc options: -O3 -march=native -pthread

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.380160240320400SE +/- 2.72, N = 3SE +/- 1.91, N = 3SE +/- 1.11, N = 3SE +/- 0.47, N = 3SE +/- 0.70, N = 3SE +/- 3.83, N = 3373.89373.99372.49364.12366.39354.211. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.51171.02341.53512.04682.5585SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.010, N = 3SE +/- 0.007, N = 3SE +/- 0.005, N = 32.2622.2402.1992.2252.2742.273-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.380160240320400SE +/- 1.09, N = 3SE +/- 3.43, N = 3SE +/- 1.56, N = 3SE +/- 1.54, N = 3SE +/- 1.51, N = 3SE +/- 1.20, N = 3343.85346.89345.30330.53329.32322.421. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3110220330440550SE +/- 2.67, N = 3SE +/- 0.23, N = 3SE +/- 1.37, N = 3SE +/- 0.24, N = 3SE +/- 1.15, N = 3SE +/- 0.82, N = 3476.95481.05487.43472.61472.32463.121. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3110220330440550SE +/- 1.94, N = 3SE +/- 1.76, N = 3SE +/- 0.73, N = 3SE +/- 2.08, N = 3SE +/- 1.13, N = 3SE +/- 0.32, N = 3478.62482.02488.23477.67478.16464.571. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3400K800K1200K1600K2000KSE +/- 2098.00, N = 5SE +/- 2852.59, N = 5SE +/- 1798.40, N = 5SE +/- 956.77, N = 5SE +/- 1626.80, N = 5SE +/- 760.80, N = 51697846163826515709661467179149425014463721. (CC) gcc options: -O3 -march=native

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.31430.62860.94291.25721.5715SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 31.3511.3361.3311.3721.3861.397-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3140280420560700SE +/- 3.03, N = 3SE +/- 5.55, N = 3SE +/- 3.01, N = 3SE +/- 2.42, N = 3SE +/- 5.75, N = 3SE +/- 3.83, N = 3638.10652.74643.58615.62611.73605.501. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35001000150020002500SE +/- 0.02, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 31785.451785.421785.502038.152148.842149.151. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32K4K6K8K10KSE +/- 0.22, N = 3SE +/- 77.81, N = 3SE +/- 7.16, N = 3SE +/- 33.93, N = 3SE +/- 25.06, N = 3SE +/- 28.39, N = 39021.839146.888848.409248.899263.559178.971. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.310002000300040005000SE +/- 5.98, N = 3SE +/- 3.87, N = 3SE +/- 10.41, N = 3SE +/- 0.86, N = 3SE +/- 0.39, N = 3SE +/- 1.69, N = 34594.274590.374280.223820.773462.663765.881. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.390180270360450SE +/- 0.70, N = 3SE +/- 0.67, N = 3SE +/- 0.46, N = 3SE +/- 0.25, N = 3SE +/- 1.03, N = 3SE +/- 0.66, N = 3398.96399.16363.85388.98388.88384.031. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3150300450600750SE +/- 0.18, N = 3SE +/- 0.40, N = 3SE +/- 0.40, N = 3SE +/- 1.71, N = 3SE +/- 0.29, N = 3SE +/- 0.14, N = 3690.94674.86675.13682.87647.82668.101. (CC) gcc options: -O3 -march=native -lm


Phoronix Test Suite v10.8.5