EPYC 7763 LLVM Clang Compiler Tests

AMD EPYC 7763 64-Core testing with a Supermicro H12SSL-i v1.01 (2.0 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104140-IB-EPYC7763L31&gru&sro&rro.

EPYC 7763 LLVM Clang Compiler TestsProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionClang 12.0Clang 11.0Clang 12.0 LTOGCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.0AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads)Supermicro H12SSL-i v1.01 (2.0 BIOS)AMD Starship/Matisse126GB3841GB Micron_9300_MTFDHAL3T8TDPASPEED2 x Broadcom NetXtreme BCM5720 2-port PCIeUbuntu 20.045.12.0-051200rc6daily20210408-generic (x86_64) 20210407GNOME Shell 3.36.4X Server 1.20.8Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73ext41024x768Clang 11.0.0-2~ubuntu20.04.1Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73GCC 9.3.0GCC 10.3.0GCC 11.0.1 20210413Clang 12.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- Clang 12.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 11.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 12.0 LTO: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- GCC 9.3: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 10.3: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 11.0.1: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- AMD AOCC 3.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa001119Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affectedCompiler Details- GCC 9.3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - GCC 10.3: --disable-multilib --enable-checking=release- GCC 11.0.1: --disable-multilib --enable-checking=release- AMD AOCC 3.0: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: (unknown)

EPYC 7763 LLVM Clang Compiler Testsdav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitaom-av1: Speed 0 Two-Pass - Bosphorus 4Kaom-av1: Speed 4 Two-Pass - Bosphorus 4Kaom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 4Kaom-av1: Speed 0 Two-Pass - Bosphorus 1080paom-av1: Speed 4 Two-Pass - Bosphorus 1080paom-av1: Speed 6 Realtime - Bosphorus 1080paom-av1: Speed 6 Two-Pass - Bosphorus 1080paom-av1: Speed 8 Realtime - Bosphorus 1080paom-av1: Speed 9 Realtime - Bosphorus 1080psvt-av1: Enc Mode 0 - 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 8 - 1080psvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080px265: Bosphorus 4Kx265: Bosphorus 1080psimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTonnx: yolov4 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacecoremark: CoreMark Size 666 - Iterations Per Secondsecuremark: SecureMark-TLScompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedquantlib: fftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 1024fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 4096scimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationbotan: KASUMIbotan: KASUMI - Decryptbotan: AES-256botan: AES-256 - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptjpegxl: PNG - 5jpegxl: PNG - 7jpegxl: PNG - 8jpegxl: JPEG - 5jpegxl: JPEG - 7jpegxl: JPEG - 8libraw: Post-Processing Benchmarketcpak: DXT1etcpak: ETC1etcpak: ETC2tscp: AI Chess Performanceliquid-dsp: 1 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 57liquid-dsp: 128 - 256 - 57pgbench: 100 - 1 - Read Onlypgbench: 100 - 1 - Read Writepgbench: 100 - 100 - Read Onlypgbench: 100 - 250 - Read Onlypgbench: 100 - 100 - Read Writepgbench: 100 - 250 - Read Writewebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressiontoybrot: TBBtoybrot: OpenMPtoybrot: C++ Taskstoybrot: C++ Threadsonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUfinancebench: Repo OpenMPfinancebench: Bonds OpenMPpgbench: 100 - 1 - Read Only - Average Latencypgbench: 100 - 1 - Read Write - Average Latencypgbench: 100 - 100 - Read Only - Average Latencypgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 100 - Read Write - Average Latencypgbench: 100 - 250 - Read Write - Average Latencymrbayes: Primate Phylogeny Analysisavifenc: 0avifenc: 2avifenc: 6avifenc: 10avifenc: 6, Losslessavifenc: 10, Losslessc-ray: Total Time - 4K, 16 Rays Per Pixelpovray: Trace Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3encode-opus: WAV To Opus Encodegcrypt: ngspice: C2670ngspice: C7552tachyon: Total Timewebp2: Defaultwebp2: Quality 75, Compression Effort 7webp2: Quality 95, Compression Effort 7webp2: Quality 100, Compression Effort 5webp2: Quality 100, Lossless Compressionastcenc: Mediumastcenc: Thoroughastcenc: ExhaustiveClang 12.0Clang 11.0Clang 12.0 LTOGCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.01198.22541.561244.11308.320.214.8717.228.9933.3938.110.537.1026.8522.1388.78103.170.18311.474118.06741.09345.30643.58487.43488.23372.4930.3274.002.750.844.604.6247135743460487881969.162648.665.751.973.0333498112990444561993712614107621364576051785466.28396926520452.0713911.548.5013926.52653.81333310805104679862.09088.37789.96744.1156495035051254454283623931935227973190.62675.13363.854280.228848.401785.5082.64484.2294659.3384682.455315.409321.190380.054351.284132.820133.048850.496843.40474.2712.150.8266.6666.3828.1341.782718.525284.642202.0851570966556630001564833333307063333336437666672431032811069022107120962319566841.3312.19919.0166.30938.44967807507743772201.077013.285071.075070.7101241.221321.444252.367972.036060.4919400.7797761302.70593.9721307.49590.1820.3136891305.10597.4811.1725833246.83723951596.8671870.0410.3050.0940.2341.6074.43189.11647.88425.1759.5103.36125.2205.74615.8709.2967.8548.2567.567236.924118.87095.95616.04682.739109.525207.0086.690374.0354.00586.764718.99361190.41543.431251.25184.190.214.9517.139.1433.1437.280.537.2026.6122.0086.09100.550.18111.821117.39241.01346.89652.74481.05482.02373.9929.9473.362.680.814.414.414954124621877104393351.267783.679.388.384.0346471108979745231915665613106820344636161790837.01000026011952.3513840.349.0113927.92640.2133241056410004.29438.68809.67878.56823.8145905074050084466763618131741229133319.34674.86399.164590.379146.881785.4279.14980.2214901.1274895.558299.214302.405319.234351.075128.586127.740848.236840.63778.4112.010.865.5865.4327.2438.711872.759205.065168.8191638265563070001578400000305136666735965333332494333121069367106550661616544881.3362.24018.5736.24337.72762477029683663951.080113.527871.075770.5947290.8411691.457572.318591.605400.4892780.7791011276.04563.2001277.62562.9700.3155221271.91563.2471.1514033178.49869851900.4348960.0400.3020.0940.2351.6264.60388.62047.89425.4729.5363.42926.0345.87915.5999.4087.9798.2507.392240.205103.82690.52716.40992.743109.636203.6347.366392.8493.98376.767419.025550.9313715.048.4713698.72657.82719.985284.763202.10170857367714393.6331145.50530.821228.63305.360.24.7816.299.5734.5639.120.56.6924.8421.4291.97106.550.1299.32592.98438.41322.42605.50463.12464.57354.2128.9172.142.750.943.933.98121781363615871521113365.079898.595.3100.997.9351495116941951832129709806121712385477852086609.97801023893553.8313793.451.9713895.32338.9143991168911053105489798.68408.57007.3165905327552749520993632131341250683229.22668.10384.033765.889178.972149.1584.86484.1305484.6765391.990337.355339.069412.846412.072127.298127.343616.096611.97760.201082.365269.673174.8121446372614040001721900000294046666730120666672389532981057125106748659364538251.3972.27319.2987.05339.07251075451541451421.174863.672781.174340.6540100.8693087.192132.997591.662600.5991400.7867621357.29658.6601358.56659.1910.3769921356.91657.8760.71778242399.80775776805.5807290.0420.3030.0950.2351.6884.65789.16352.21727.78410.3993.65929.0806.1319.1589.9688.5347.0117.504232.572101.53589.09115.68372.778118.447220.9446.753388.9464.87457.853719.47941171.04536.711245.11316.140.214.8417.039.1035.2639.320.526.8726.4921.6493.05107.460.16911.230109.69739.03330.53615.62472.61477.67364.1228.6072.602.770.94.024.131065.601350.0592.971461.22158.41056.4256.2741.498.794.410498.53515051151019755592112689807103912085447722110880.42797824270052.8713906.152.3613806.62392.6125761131910711101799247.38134.56974.0166505205453497521303597332061237743235.94682.87388.983820.779248.892038.1579.11581.4535525.7105529.402341.847325.389422.138420.853127.741127.775485.019476.17558.901114.603281.146173.2261467179624673331718000000294286666730050333332484533691076357108973158894530191.3722.22518.8837.07838.54851815524561053831.178943.611441.197470.6462520.8707847.236863.003411.642680.6021550.7824761382.41659.2651379.51658.2770.3777331375.71658.0380.78819234979.29427151770.5091140.0400.2970.0930.2301.7014.73193.65651.45427.38610.4173.64326.9116.1079.0299.5708.5677.2317.469231.238103.59890.43216.14682.918116.655215.5656.934406.0274.86997.837019.45831180.44538.281249.74334.350.214.8417.379.4135.2639.710.526.9527.0122.1194.40111.270.17611.905110.70238.86329.32611.73472.32478.16366.3928.7971.791210149664915992359115363.9794100.595.010499.32161694809108211885507712176407.66592924386151.3213882.251.1713857.4127651104410675102059238.48231.16948.2165905170654710513913571831662248883182.35647.82388.883462.669263.552148.8457.241494250608863331679800000298940000030557666672366133831090824109016056369531021.3862.27418.3147.00337.94834199.60026051376.8164060.0420.2960.0920.2301.7774.72289.43251.03427.10310.2913.60727.0576.1499.2278.7097.4737.381233.514103.00590.26415.49894.81607.698919.61891188.43541.581251.91192.000.18311.690116.49340.95343.85638.10476.95478.62373.8930.4473.512.730.824.334.4753132647719441017116555.278384.078.890.084.43864591221132543831929660617105718664666141720060.44130726463753.7713562.550.3213561.52725.71319210669102279603.28902.17784.86875.3161464968544412455213610031013231113298.29690.94398.964594.279021.831785.4582.82782.9494891.0724887.573304.996303.806319.787355.059127.768128.008845.141838.08979.2311.370.8165.5765.6827.2941.642654.721211.733178.8521697846574113331609633333310040000036064666671.3512.26219.1266.57838.33869457477718971441.038993.415831.044840.5542310.8339211.370592.287551.595970.4597240.7732331267.18544.0991259.59544.3060.3018851268.08544.6001.1704433146.02864651885.51953186.74248.12725.5989.7253.54325.7835.94815.6499.4949.2808.142240.405103.92991.98616.05812.816109.811205.0347.403382.9853.88116.640918.9127OpenBenchmarking.org

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.030060090012001500SE +/- 5.12, N = 3SE +/- 1.75, N = 3SE +/- 3.74, N = 3SE +/- 2.95, N = 3SE +/- 6.69, N = 3SE +/- 0.97, N = 31145.501180.441171.041198.221190.411188.43-lm - MIN: 664.19 / MAX: 1441.54-lm - MIN: 680.31 / MAX: 1485.74-lm - MIN: 683.28 / MAX: 1473.51MIN: 700.24 / MAX: 1494.16-lm - MIN: 685.16 / MAX: 1496.36-lm - MIN: 703.73 / MAX: 1484.941. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4KGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0120240360480600SE +/- 1.35, N = 3SE +/- 2.51, N = 3SE +/- 0.67, N = 3SE +/- 1.79, N = 3SE +/- 1.43, N = 3SE +/- 1.13, N = 3530.82538.28536.71541.56543.43541.58-lm - MIN: 248.84 / MAX: 574.28-lm - MIN: 251.6 / MAX: 584.38-lm - MIN: 256.44 / MAX: 577.82MIN: 252.01 / MAX: 587.53-lm - MIN: 256.75 / MAX: 593.99-lm - MIN: 259.4 / MAX: 585.81. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.030060090012001500SE +/- 2.25, N = 3SE +/- 1.96, N = 3SE +/- 8.15, N = 3SE +/- 7.87, N = 3SE +/- 2.13, N = 3SE +/- 4.95, N = 31228.631249.741245.111244.111251.251251.91-lm - MIN: 555.28 / MAX: 1361.68-lm - MIN: 559.74 / MAX: 1387.11-lm - MIN: 539.07 / MAX: 1398.87MIN: 549.81 / MAX: 1390.03-lm - MIN: 556.46 / MAX: 1394.06-lm - MIN: 543.89 / MAX: 1394.161. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bitGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.070140210280350SE +/- 0.71, N = 3SE +/- 1.11, N = 3SE +/- 0.21, N = 3SE +/- 0.93, N = 3SE +/- 0.48, N = 3SE +/- 0.39, N = 3305.36334.35316.14308.32184.19192.00-lm - MIN: 210.86 / MAX: 493.21-lm - MIN: 234.24 / MAX: 544.9-lm - MIN: 218.19 / MAX: 515.85MIN: 220.53 / MAX: 490.51-lm - MIN: 114.52 / MAX: 310.5-lm - MIN: 118.57 / MAX: 324.981. (CC) gcc options: -O3 -march=native -pthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4KGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.00.04730.09460.14190.18920.2365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.200.210.210.210.211. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4KGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.01.11382.22763.34144.45525.569SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 34.784.844.844.874.951. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.048121620SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 316.2917.3717.0317.2217.131. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4KGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.03691215SE +/- 0.11, N = 6SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 39.579.419.108.999.141. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0816243240SE +/- 0.12, N = 3SE +/- 0.47, N = 3SE +/- 0.19, N = 3SE +/- 0.48, N = 3SE +/- 0.22, N = 334.5635.2635.2633.3933.141. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0918273645SE +/- 0.38, N = 3SE +/- 0.29, N = 3SE +/- 0.19, N = 3SE +/- 0.43, N = 3SE +/- 0.31, N = 339.1239.7139.3238.1137.281. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.00.11930.23860.35790.47720.5965SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.500.520.520.530.531. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0246810SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 36.696.956.877.107.201. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0612182430SE +/- 0.13, N = 3SE +/- 0.28, N = 3SE +/- 0.25, N = 3SE +/- 0.27, N = 3SE +/- 0.13, N = 324.8427.0126.4926.8526.611. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0510152025SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.05, N = 3SE +/- 0.15, N = 321.4222.1121.6422.1322.001. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.020406080100SE +/- 0.89, N = 3SE +/- 0.47, N = 3SE +/- 0.65, N = 3SE +/- 1.07, N = 3SE +/- 0.51, N = 391.9794.4093.0588.7886.091. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.020406080100SE +/- 1.10, N = 8SE +/- 1.15, N = 8SE +/- 1.76, N = 3SE +/- 0.31, N = 3SE +/- 0.53, N = 3106.55111.27107.46103.17100.551. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.04120.08240.12360.16480.206SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1290.1760.1690.1830.1810.1831. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.03691215SE +/- 0.086, N = 3SE +/- 0.139, N = 3SE +/- 0.111, N = 9SE +/- 0.170, N = 3SE +/- 0.164, N = 4SE +/- 0.189, N = 39.32511.90511.23011.47411.82111.6901. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0306090120150SE +/- 0.83, N = 3SE +/- 0.18, N = 3SE +/- 1.05, N = 3SE +/- 0.10, N = 3SE +/- 0.46, N = 3SE +/- 0.33, N = 392.98110.70109.70118.07117.39116.491. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0918273645SE +/- 0.18, N = 3SE +/- 0.17, N = 3SE +/- 0.05, N = 3SE +/- 0.17, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 338.4138.8639.0341.0941.0140.951. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.080160240320400SE +/- 1.20, N = 3SE +/- 1.51, N = 3SE +/- 1.54, N = 3SE +/- 1.56, N = 3SE +/- 3.43, N = 3SE +/- 1.09, N = 3322.42329.32330.53345.30346.89343.851. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0140280420560700SE +/- 3.83, N = 3SE +/- 5.75, N = 3SE +/- 2.42, N = 3SE +/- 3.01, N = 3SE +/- 5.55, N = 3SE +/- 3.03, N = 3605.50611.73615.62643.58652.74638.101. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0110220330440550SE +/- 0.82, N = 3SE +/- 1.15, N = 3SE +/- 0.24, N = 3SE +/- 1.37, N = 3SE +/- 0.23, N = 3SE +/- 2.67, N = 3463.12472.32472.61487.43481.05476.951. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0110220330440550SE +/- 0.32, N = 3SE +/- 1.13, N = 3SE +/- 2.08, N = 3SE +/- 0.73, N = 3SE +/- 1.76, N = 3SE +/- 1.94, N = 3464.57478.16477.67488.23482.02478.621. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.080160240320400SE +/- 3.83, N = 3SE +/- 0.70, N = 3SE +/- 0.47, N = 3SE +/- 1.11, N = 3SE +/- 1.91, N = 3SE +/- 2.72, N = 3354.21366.39364.12372.49373.99373.891. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0714212835SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 3SE +/- 0.23, N = 3SE +/- 0.25, N = 3SE +/- 0.13, N = 328.9128.7928.6030.3229.9430.441. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.01632486480SE +/- 0.26, N = 3SE +/- 0.56, N = 3SE +/- 0.32, N = 3SE +/- 0.49, N = 3SE +/- 0.49, N = 3SE +/- 0.63, N = 372.1471.7972.6074.0073.3673.511. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: KostyaGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.62331.24661.86992.49323.1165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.752.772.752.682.731. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandomGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.21150.4230.63450.8461.0575SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.940.900.840.810.821. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweetsGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.01.0352.073.1054.145.175SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.934.024.604.414.331. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserIDGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.01.03952.0793.11854.1585.1975SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33.984.134.624.414.471. (CXX) g++ options: -O3 -march=native -pthread

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.030060090012001500SE +/- 25.85, N = 15SE +/- 25.34, N = 15SE +/- 101.07, N = 12SE +/- 15.30, N = 12SE +/- 36.50, N = 15SE +/- 32.29, N = 121217.001210.001065.60471.00495.00531.00-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.030060090012001500SE +/- 2.85, N = 15SE +/- 62.40, N = 15SE +/- 132.58, N = 12SE +/- 15.69, N = 12SE +/- 34.43, N = 15SE +/- 26.90, N = 12813.01496.01350.0357.0412.0326.0-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0140280420560700SE +/- 0.80, N = 15SE +/- 2.60, N = 15SE +/- 53.43, N = 12SE +/- 35.24, N = 12SE +/- 38.96, N = 15SE +/- 37.59, N = 12636.00649.00592.97434.00462.00477.00-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0400800120016002000SE +/- 9.19, N = 15SE +/- 2.67, N = 15SE +/- 131.59, N = 12SE +/- 15.32, N = 11SE +/- 8.32, N = 15SE +/- 9.88, N = 121587.01599.01461.2604.01877.01944.0-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.05001000150020002500SE +/- 2.06, N = 15SE +/- 2.74, N = 15SE +/- 194.35, N = 12SE +/- 20.06, N = 12SE +/- 1.59, N = 15SE +/- 3.59, N = 121521.02359.02158.4878.01043.01017.0-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.030060090012001500SE +/- 1.59, N = 15SE +/- 1.87, N = 15SE +/- 95.41, N = 12SE +/- 17.06, N = 12SE +/- 1.49, N = 15SE +/- 2.61, N = 121133.001153.001056.42819.00933.001165.00-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.01530456075SE +/- 4.17, N = 15SE +/- 3.83, N = 15SE +/- 5.30, N = 12SE +/- 2.22, N = 12SE +/- 3.65, N = 15SE +/- 3.44, N = 1265.063.956.269.151.255.2-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.02004006008001000SE +/- 2.10, N = 14SE +/- 2.88, N = 15SE +/- 66.49, N = 12SE +/- 4.04, N = 12SE +/- 1.41, N = 14SE +/- 1.94, N = 12798.0794.0741.4626.0677.0783.0-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.020406080100SE +/- 0.16, N = 15SE +/- 0.29, N = 15SE +/- 1.05, N = 12SE +/- 0.05, N = 12SE +/- 0.06, N = 15SE +/- 0.04, N = 1298.5100.598.748.683.684.0-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.020406080100SE +/- 0.07, N = 15SE +/- 0.08, N = 15SE +/- 0.59, N = 12SE +/- 0.56, N = 12SE +/- 0.03, N = 15SE +/- 0.07, N = 1295.395.094.465.779.378.8-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.020406080100SE +/- 0.08, N = 15SE +/- 0.62, N = 12SE +/- 0.09, N = 12SE +/- 0.02, N = 15SE +/- 0.05, N = 12100.9104.0104.051.988.390.0-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.020406080100SE +/- 0.05, N = 15SE +/- 0.05, N = 15SE +/- 0.60, N = 12SE +/- 0.07, N = 12SE +/- 0.02, N = 14SE +/- 0.08, N = 1297.999.398.573.084.084.4-fopenmp-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.080160240320400SE +/- 0.50, N = 3SE +/- 0.17, N = 3SE +/- 4.15, N = 4SE +/- 1.42, N = 3SE +/- 2.50, N = 3351351333346386-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0110220330440550SE +/- 4.64, N = 12SE +/- 0.87, N = 3SE +/- 10.30, N = 12SE +/- 5.55, N = 3SE +/- 10.39, N = 12495505498471459-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0306090120150SE +/- 0.44, N = 3SE +/- 0.17, N = 3SE +/- 0.50, N = 3SE +/- 0.29, N = 3SE +/- 0.50, N = 3116115112108122-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.02K4K6K8K10KSE +/- 138.76, N = 3SE +/- 7.52, N = 3SE +/- 88.25, N = 12SE +/- 102.76, N = 8SE +/- 171.77, N = 39419101979904979711325-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.012002400360048006000SE +/- 2.40, N = 3SE +/- 17.50, N = 3SE +/- 126.29, N = 12SE +/- 169.87, N = 9SE +/- 174.98, N = 1251835559445645234383-fopenmp-fopenmp-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.05001000150020002500SE +/- 1.20, N = 3SE +/- 4.81, N = 3SE +/- 1.20, N = 3SE +/- 6.57, N = 3SE +/- 12.41, N = 3SE +/- 4.63, N = 32129216121121993191519291. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0150300450600750SE +/- 6.43, N = 3SE +/- 5.21, N = 3SE +/- 2.60, N = 3SE +/- 1.33, N = 37096946897126656601. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.02004006008001000SE +/- 2.03, N = 3SE +/- 0.58, N = 38068098076146136171. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.030060090012001500SE +/- 1.53, N = 3SE +/- 0.33, N = 3SE +/- 0.88, N = 3SE +/- 1.86, N = 3SE +/- 1.53, N = 31217108210391076106810571. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.05001000150020002500SE +/- 18.77, N = 3SE +/- 17.34, N = 3SE +/- 14.93, N = 3SE +/- 41.63, N = 12SE +/- 27.29, N = 3SE +/- 52.84, N = 151238118812082136203418661. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0120240360480600SE +/- 1.00, N = 3SE +/- 1.00, N = 3SE +/- 0.33, N = 35475505444574634661. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.02004006008001000SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 1.20, N = 37857717726056166141. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0500K1000K1500K2000K2500KSE +/- 4791.32, N = 3SE +/- 5755.65, N = 3SE +/- 2170.85, N = 3SE +/- 984.68, N = 3SE +/- 971.31, N = 3SE +/- 3670.84, N = 32086609.982176407.672110880.431785466.281790837.011720060.441. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.060K120K180K240K300KSE +/- 537.86, N = 3SE +/- 675.55, N = 3SE +/- 1024.96, N = 3SE +/- 1778.47, N = 3SE +/- 407.86, N = 3SE +/- 251.99, N = 32389352438612427002652042601192646371. (CC) gcc options: -pedantic -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.01224364860SE +/- 0.77, N = 4SE +/- 0.73, N = 4SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.80, N = 3SE +/- 0.33, N = 3SE +/- 0.48, N = 353.8351.3252.8750.9352.0752.3553.771. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.03K6K9K12K15KSE +/- 37.19, N = 4SE +/- 34.44, N = 4SE +/- 42.32, N = 3SE +/- 60.82, N = 3SE +/- 71.01, N = 3SE +/- 15.91, N = 3SE +/- 73.30, N = 313793.413882.213906.113715.013911.513840.313562.51. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.01224364860SE +/- 0.65, N = 5SE +/- 0.65, N = 3SE +/- 0.72, N = 4SE +/- 0.74, N = 3SE +/- 0.42, N = 3SE +/- 0.46, N = 3SE +/- 0.26, N = 351.9751.1752.3648.4748.5049.0150.321. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.03K6K9K12K15KSE +/- 17.75, N = 5SE +/- 62.74, N = 3SE +/- 6.60, N = 4SE +/- 46.50, N = 3SE +/- 65.90, N = 3SE +/- 23.21, N = 3SE +/- 33.89, N = 313895.313857.413806.613698.713926.513927.913561.51. (CC) gcc options: -O3

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21GCC 9.3GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.06001200180024003000SE +/- 4.53, N = 3SE +/- 2.06, N = 3SE +/- 1.62, N = 3SE +/- 1.92, N = 3SE +/- 1.01, N = 3SE +/- 2.28, N = 32338.92392.62657.82653.82640.22725.71. (CXX) g++ options: -O3 -march=native -rdynamic

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.03K6K9K12K15KSE +/- 67.28, N = 3SE +/- 45.16, N = 3SE +/- 16.05, N = 3SE +/- 24.25, N = 3SE +/- 20.33, N = 3SE +/- 41.35, N = 31439912765125761333313324131921. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.03K6K9K12K15KSE +/- 44.20, N = 3SE +/- 189.35, N = 3SE +/- 32.26, N = 3SE +/- 27.10, N = 3SE +/- 35.53, N = 3SE +/- 34.64, N = 31168911044113191080510564106691. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.02K4K6K8K10KSE +/- 37.69, N = 3SE +/- 55.19, N = 3SE +/- 14.75, N = 3SE +/- 7.75, N = 3SE +/- 28.76, N = 3SE +/- 39.89, N = 311053.010675.010711.010467.010004.210227.01. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.02K4K6K8K10KSE +/- 20.21, N = 3SE +/- 48.56, N = 3SE +/- 57.26, N = 3SE +/- 101.36, N = 3SE +/- 15.16, N = 3SE +/- 43.38, N = 310548.010205.010179.09862.09438.69603.21. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.02K4K6K8K10KSE +/- 19.46, N = 3SE +/- 25.87, N = 3SE +/- 41.68, N = 3SE +/- 48.25, N = 3SE +/- 45.95, N = 3SE +/- 14.28, N = 39798.69238.49247.39088.38809.68902.11. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.02K4K6K8K10KSE +/- 50.36, N = 3SE +/- 36.00, N = 3SE +/- 56.49, N = 3SE +/- 65.76, N = 3SE +/- 27.38, N = 3SE +/- 19.99, N = 38408.58231.18134.57789.97878.57784.81. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.015003000450060007500SE +/- 30.40, N = 3SE +/- 23.67, N = 3SE +/- 25.90, N = 3SE +/- 35.20, N = 3SE +/- 60.67, N = 3SE +/- 65.81, N = 37007.36948.26974.06744.16823.86875.31. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.04K8K12K16K20KSE +/- 170.19, N = 8SE +/- 168.99, N = 3SE +/- 108.41, N = 3SE +/- 48.79, N = 3SE +/- 129.55, N = 3SE +/- 5.33, N = 31659016590166501564914590161461. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.011K22K33K44K55KSE +/- 788.42, N = 3SE +/- 568.96, N = 3SE +/- 439.64, N = 15SE +/- 952.64, N = 12SE +/- 585.78, N = 3SE +/- 621.84, N = 155327551706520545035050740496851. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.012K24K36K48K60KSE +/- 725.00, N = 3SE +/- 156.75, N = 3SE +/- 743.81, N = 3SE +/- 439.50, N = 3SE +/- 582.34, N = 3SE +/- 756.91, N = 35274954710534975125450084444121. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.011K22K33K44K55KSE +/- 228.68, N = 3SE +/- 227.13, N = 3SE +/- 844.19, N = 3SE +/- 671.66, N = 15SE +/- 413.24, N = 15SE +/- 542.47, N = 155209951391521304542846676455211. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.08K16K24K32K40KSE +/- 79.87, N = 3SE +/- 442.82, N = 3SE +/- 301.69, N = 3SE +/- 165.99, N = 3SE +/- 530.09, N = 4SE +/- 455.21, N = 123632135718359733623936181361001. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.07K14K21K28K35KSE +/- 37.37, N = 3SE +/- 209.56, N = 3SE +/- 14.99, N = 3SE +/- 77.17, N = 3SE +/- 146.10, N = 3SE +/- 378.89, N = 63134131662320613193531741310131. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.05K10K15K20K25KSE +/- 106.49, N = 3SE +/- 160.97, N = 3SE +/- 538.47, N = 9SE +/- 348.10, N = 9SE +/- 220.77, N = 3SE +/- 349.17, N = 92506824888237742279722913231111. (CC) gcc options: -pthread -O3 -march=native -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.07001400210028003500SE +/- 5.86, N = 3SE +/- 5.19, N = 3SE +/- 6.50, N = 3SE +/- 1.11, N = 3SE +/- 15.12, N = 3SE +/- 1.29, N = 33229.223182.353235.943190.623319.343298.291. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0150300450600750SE +/- 0.14, N = 3SE +/- 0.29, N = 3SE +/- 1.71, N = 3SE +/- 0.40, N = 3SE +/- 0.40, N = 3SE +/- 0.18, N = 3668.10647.82682.87675.13674.86690.941. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.090180270360450SE +/- 0.66, N = 3SE +/- 1.03, N = 3SE +/- 0.25, N = 3SE +/- 0.46, N = 3SE +/- 0.67, N = 3SE +/- 0.70, N = 3384.03388.88388.98363.85399.16398.961. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.010002000300040005000SE +/- 1.69, N = 3SE +/- 0.39, N = 3SE +/- 0.86, N = 3SE +/- 10.41, N = 3SE +/- 3.87, N = 3SE +/- 5.98, N = 33765.883462.663820.774280.224590.374594.271. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.02K4K6K8K10KSE +/- 28.39, N = 3SE +/- 25.06, N = 3SE +/- 33.93, N = 3SE +/- 7.16, N = 3SE +/- 77.81, N = 3SE +/- 0.22, N = 39178.979263.559248.898848.409146.889021.831. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.05001000150020002500SE +/- 0.12, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 32149.152148.842038.151785.501785.421785.451. (CC) gcc options: -O3 -march=native -lm

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.020406080100SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 384.8679.1282.6479.1582.831. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.020406080100SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 384.1381.4584.2380.2282.951. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256GCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.012002400360048006000SE +/- 42.69, N = 3SE +/- 4.47, N = 3SE +/- 2.14, N = 3SE +/- 2.16, N = 3SE +/- 0.05, N = 35484.685525.714659.344901.134891.071. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.012002400360048006000SE +/- 11.31, N = 3SE +/- 5.42, N = 3SE +/- 4.78, N = 3SE +/- 1.35, N = 3SE +/- 3.70, N = 35391.995529.404682.464895.564887.571. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.070140210280350SE +/- 0.04, N = 3SE +/- 0.52, N = 3SE +/- 0.13, N = 3SE +/- 0.09, N = 3SE +/- 0.03, N = 3337.36341.85315.41299.21305.001. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.070140210280350SE +/- 0.04, N = 3SE +/- 0.44, N = 3SE +/- 0.16, N = 3SE +/- 0.15, N = 3SE +/- 0.06, N = 3339.07325.39321.19302.41303.811. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.090180270360450SE +/- 0.09, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 1.73, N = 3SE +/- 1.14, N = 3412.85422.14380.05319.23319.791. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.090180270360450SE +/- 0.12, N = 3SE +/- 0.95, N = 3SE +/- 0.04, N = 3SE +/- 2.03, N = 3SE +/- 1.17, N = 3412.07420.85351.28351.08355.061. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256GCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0306090120150SE +/- 0.09, N = 3SE +/- 0.33, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3127.30127.74132.82128.59127.771. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0306090120150SE +/- 0.08, N = 3SE +/- 0.32, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.08, N = 3127.34127.78133.05127.74128.011. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305GCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.02004006008001000SE +/- 0.13, N = 3SE +/- 0.28, N = 3SE +/- 4.85, N = 3SE +/- 0.62, N = 3SE +/- 3.15, N = 3616.10485.02850.50848.24845.141. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.02004006008001000SE +/- 0.40, N = 3SE +/- 0.02, N = 3SE +/- 4.64, N = 3SE +/- 0.16, N = 3SE +/- 3.17, N = 3611.98476.18843.40840.64838.091. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 5Clang 12.0Clang 11.0AMD AOCC 3.020406080100SE +/- 0.17, N = 3SE +/- 0.24, N = 3SE +/- 0.41, N = 374.2778.4179.231. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 7Clang 12.0Clang 11.0AMD AOCC 3.03691215SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 312.1512.0111.371. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 8Clang 12.0Clang 11.0AMD AOCC 3.00.18450.3690.55350.7380.9225SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.820.800.811. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 5Clang 12.0Clang 11.0AMD AOCC 3.01530456075SE +/- 0.14, N = 3SE +/- 0.20, N = 3SE +/- 0.17, N = 366.6665.5865.571. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 7Clang 12.0Clang 11.0AMD AOCC 3.01530456075SE +/- 0.16, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 366.3865.4365.681. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 8Clang 12.0Clang 11.0AMD AOCC 3.0714212835SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 328.1327.2427.291. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.01326395265SE +/- 0.19, N = 3SE +/- 0.16, N = 3SE +/- 0.23, N = 3SE +/- 0.12, N = 3SE +/- 0.33, N = 3SE +/- 0.04, N = 360.2057.2458.9041.7838.7141.641. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1GCC 9.3GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.06001200180024003000SE +/- 0.16, N = 3SE +/- 0.48, N = 3SE +/- 6.09, N = 3SE +/- 2.64, N = 3SE +/- 1.69, N = 3SE +/- 8.09, N = 31082.371114.602719.992718.531872.762654.721. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1GCC 9.3GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.060120180240300SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3269.67281.15284.76284.64205.07211.731. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2GCC 9.3GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.04080120160200SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3174.81173.23202.10202.09168.82178.851. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0400K800K1200K1600K2000KSE +/- 760.80, N = 5SE +/- 1626.80, N = 5SE +/- 956.77, N = 5SE +/- 1798.40, N = 5SE +/- 2852.59, N = 5SE +/- 2098.00, N = 51446372149425014671791570966163826516978461. (CC) gcc options: -O3 -march=native

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.013M26M39M52M65MSE +/- 870702.21, N = 3SE +/- 318169.94, N = 3SE +/- 6887.99, N = 3SE +/- 790005.27, N = 3SE +/- 40360.87, N = 3SE +/- 47026.00, N = 36140400060886333624673335566300056307000574113331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0400M800M1200M1600M2000MSE +/- 4864497.23, N = 3SE +/- 17297784.06, N = 3SE +/- 15763988.50, N = 3SE +/- 2255610.29, N = 3SE +/- 1331665.62, N = 3SE +/- 2130988.29, N = 31721900000167980000017180000001564833333157840000016096333331. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0700M1400M2100M2800M3500MSE +/- 2961043.36, N = 3SE +/- 1154700.54, N = 3SE +/- 4643753.27, N = 3SE +/- 6045475.81, N = 3SE +/- 2452436.43, N = 3SE +/- 1234233.91, N = 32940466667298940000029428666673070633333305136666731004000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0800M1600M2400M3200M4000MSE +/- 3384441.53, N = 3SE +/- 6016181.88, N = 3SE +/- 1679616.36, N = 3SE +/- 883804.91, N = 3SE +/- 1559202.08, N = 3SE +/- 1543084.93, N = 33012066667305576666730050333333643766667359653333336064666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read OnlyGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.05K10K15K20K25KSE +/- 41.57, N = 3SE +/- 281.76, N = 15SE +/- 118.05, N = 3SE +/- 303.43, N = 3SE +/- 289.16, N = 323895236612484524310249431. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read WriteGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.07001400210028003500SE +/- 4.79, N = 3SE +/- 28.00, N = 3SE +/- 11.40, N = 3SE +/- 3.48, N = 3SE +/- 14.62, N = 3329833833369328133121. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read OnlyGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0200K400K600K800K1000KSE +/- 1623.23, N = 3SE +/- 1514.63, N = 3SE +/- 183.22, N = 3SE +/- 720.87, N = 3SE +/- 1740.88, N = 3105712510908241076357106902210693671. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0200K400K600K800K1000KSE +/- 8843.08, N = 3SE +/- 8885.95, N = 3SE +/- 8859.63, N = 3SE +/- 6289.60, N = 3SE +/- 13844.42, N = 3106748610901601089731107120910655061. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read WriteGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.013K26K39K52K65KSE +/- 994.44, N = 3SE +/- 899.82, N = 3SE +/- 469.64, N = 3SE +/- 162.92, N = 3SE +/- 400.92, N = 359364563695889462319616161. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read WriteGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.012K24K36K48K60KSE +/- 396.40, N = 3SE +/- 211.73, N = 3SE +/- 591.89, N = 7SE +/- 702.52, N = 15SE +/- 883.12, N = 353825531025301956684544881. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.31430.62860.94291.25721.5715SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 31.3971.3861.3721.3311.3361.351-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.51171.02341.53512.04682.5585SE +/- 0.005, N = 3SE +/- 0.007, N = 3SE +/- 0.010, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 32.2732.2742.2252.1992.2402.262-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0510152025SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.13, N = 3SE +/- 0.03, N = 319.3018.3118.8819.0218.5719.13-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0246810SE +/- 0.009, N = 3SE +/- 0.021, N = 3SE +/- 0.006, N = 3SE +/- 0.004, N = 3SE +/- 0.018, N = 3SE +/- 0.009, N = 37.0537.0037.0786.3096.2436.578-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0918273645SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 339.0737.9538.5538.4537.7338.34-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -lpng16 -ljpeg

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBGCC 9.3GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.015003000450060007500SE +/- 74.84, N = 3SE +/- 67.68, N = 3SE +/- 86.43, N = 3SE +/- 87.21, N = 3SE +/- 67.11, N = 7SE +/- 52.54, N = 3510751817085678062476945-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc1. (CXX) g++ options: -O3 -march=native -lpthread

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMPGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.016003200480064008000SE +/- 3.18, N = 3SE +/- 2.60, N = 3SE +/- 14.89, N = 3SE +/- 20.42, N = 3SE +/- 22.73, N = 3545155247507702974771. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ TasksGCC 9.3GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.016003200480064008000SE +/- 49.08, N = 3SE +/- 31.52, N = 3SE +/- 17.21, N = 3SE +/- 33.67, N = 3SE +/- 7.31, N = 3SE +/- 41.46, N = 3541456107367743768367189-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc1. (CXX) g++ options: -O3 -march=native -lpthread

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ ThreadsGCC 9.3GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.015003000450060007500SE +/- 8.33, N = 3SE +/- 6.12, N = 3SE +/- 15.06, N = 3SE +/- 30.90, N = 3SE +/- 25.04, N = 3SE +/- 24.26, N = 3514253837143722063957144-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc1. (CXX) g++ options: -O3 -march=native -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.26530.53060.79591.06121.3265SE +/- 0.00349, N = 3SE +/- 0.00296, N = 3SE +/- 0.00199, N = 3SE +/- 0.00127, N = 3SE +/- 0.00160, N = 31.174861.178941.077011.080111.03899-fopenmp - MIN: 1.12-fopenmp - MIN: 1.12-fopenmp=libomp - MIN: 1.04-fopenmp=libomp - MIN: 1.03-fopenmp=libomp - MIN: 0.991. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.82641.65282.47923.30564.132SE +/- 0.03246, N = 3SE +/- 0.02637, N = 3SE +/- 0.01639, N = 3SE +/- 0.04735, N = 3SE +/- 0.02018, N = 33.672783.611443.285073.527873.41583-fopenmp - MIN: 3.39-fopenmp - MIN: 3.37-fopenmp=libomp - MIN: 3.15-fopenmp=libomp - MIN: 3.29-fopenmp=libomp - MIN: 3.241. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.26940.53880.80821.07761.347SE +/- 0.00597, N = 3SE +/- 0.00438, N = 3SE +/- 0.00286, N = 3SE +/- 0.00395, N = 3SE +/- 0.00668, N = 31.174341.197471.075071.075771.04484-fopenmp - MIN: 0.96-fopenmp - MIN: 0.98-fopenmp=libomp - MIN: 0.87-fopenmp=libomp - MIN: 0.86-fopenmp=libomp - MIN: 0.831. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.15980.31960.47940.63920.799SE +/- 0.003317, N = 3SE +/- 0.003112, N = 3SE +/- 0.011383, N = 3SE +/- 0.008914, N = 3SE +/- 0.000764, N = 30.6540100.6462520.7101240.5947290.554231-fopenmp - MIN: 0.59-fopenmp - MIN: 0.6-fopenmp=libomp - MIN: 0.64-fopenmp=libomp - MIN: 0.53-fopenmp=libomp - MIN: 0.51. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.27480.54960.82441.09921.374SE +/- 0.001032, N = 3SE +/- 0.001247, N = 3SE +/- 0.018279, N = 4SE +/- 0.000480, N = 3SE +/- 0.000645, N = 30.8693080.8707841.2213200.8411690.833921-fopenmp - MIN: 0.84-fopenmp - MIN: 0.84-fopenmp=libomp - MIN: 1.13-fopenmp=libomp - MIN: 0.82-fopenmp=libomp - MIN: 0.811. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0246810SE +/- 0.02683, N = 3SE +/- 0.03687, N = 3SE +/- 0.00123, N = 3SE +/- 0.00568, N = 3SE +/- 0.00485, N = 37.192137.236861.444251.457571.37059-fopenmp - MIN: 6.14-fopenmp - MIN: 6.18-fopenmp=libomp - MIN: 1.34-fopenmp=libomp - MIN: 1.35-fopenmp=libomp - MIN: 1.281. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.67581.35162.02742.70323.379SE +/- 0.00845, N = 3SE +/- 0.00883, N = 3SE +/- 0.02100, N = 3SE +/- 0.02389, N = 3SE +/- 0.00564, N = 32.997593.003412.367972.318592.28755-fopenmp - MIN: 2.24-fopenmp - MIN: 2.35-fopenmp=libomp - MIN: 2.01-fopenmp=libomp - MIN: 1.92-fopenmp=libomp - MIN: 1.911. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.45810.91621.37431.83242.2905SE +/- 0.01150, N = 3SE +/- 0.00384, N = 3SE +/- 0.01922, N = 12SE +/- 0.00118, N = 3SE +/- 0.00195, N = 31.662601.642682.036061.605401.59597-fopenmp - MIN: 1.59-fopenmp - MIN: 1.58-fopenmp=libomp - MIN: 1.81-fopenmp=libomp - MIN: 1.55-fopenmp=libomp - MIN: 1.541. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.13550.2710.40650.5420.6775SE +/- 0.001469, N = 3SE +/- 0.001964, N = 3SE +/- 0.002843, N = 3SE +/- 0.001652, N = 3SE +/- 0.000365, N = 30.5991400.6021550.4919400.4892780.459724-fopenmp - MIN: 0.56-fopenmp - MIN: 0.57-fopenmp=libomp - MIN: 0.47-fopenmp=libomp - MIN: 0.46-fopenmp=libomp - MIN: 0.441. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.1770.3540.5310.7080.885SE +/- 0.002405, N = 3SE +/- 0.002532, N = 3SE +/- 0.004246, N = 3SE +/- 0.001200, N = 3SE +/- 0.001713, N = 30.7867620.7824760.7797760.7791010.773233-fopenmp - MIN: 0.75-fopenmp - MIN: 0.73-fopenmp=libomp - MIN: 0.73-fopenmp=libomp - MIN: 0.73-fopenmp=libomp - MIN: 0.721. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.030060090012001500SE +/- 4.44, N = 3SE +/- 3.72, N = 3SE +/- 3.92, N = 3SE +/- 9.46, N = 3SE +/- 5.94, N = 31357.291382.411302.701276.041267.18-fopenmp - MIN: 1335.63-fopenmp - MIN: 1360.58-fopenmp=libomp - MIN: 1289.86-fopenmp=libomp - MIN: 1249.65-fopenmp=libomp - MIN: 1248.351. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0140280420560700SE +/- 0.64, N = 3SE +/- 0.61, N = 3SE +/- 9.50, N = 3SE +/- 0.83, N = 3SE +/- 0.53, N = 3658.66659.27593.97563.20544.10-fopenmp - MIN: 639.86-fopenmp - MIN: 642.67-fopenmp=libomp - MIN: 570.44-fopenmp=libomp - MIN: 550.23-fopenmp=libomp - MIN: 532.321. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.030060090012001500SE +/- 3.05, N = 3SE +/- 1.75, N = 3SE +/- 3.61, N = 3SE +/- 7.11, N = 3SE +/- 1.97, N = 31358.561379.511307.491277.621259.59-fopenmp - MIN: 1337.17-fopenmp - MIN: 1361.6-fopenmp=libomp - MIN: 1293.38-fopenmp=libomp - MIN: 1252.39-fopenmp=libomp - MIN: 1247.291. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0140280420560700SE +/- 1.25, N = 3SE +/- 0.83, N = 3SE +/- 1.89, N = 3SE +/- 0.25, N = 3SE +/- 0.90, N = 3659.19658.28590.18562.97544.31-fopenmp - MIN: 642.05-fopenmp - MIN: 639.78-fopenmp=libomp - MIN: 575.41-fopenmp=libomp - MIN: 551.49-fopenmp=libomp - MIN: 531.91. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.0850.170.2550.340.425SE +/- 0.000576, N = 3SE +/- 0.004341, N = 3SE +/- 0.000321, N = 3SE +/- 0.000247, N = 3SE +/- 0.000492, N = 30.3769920.3777330.3136890.3155220.301885-fopenmp - MIN: 0.36-fopenmp - MIN: 0.36-fopenmp=libomp - MIN: 0.3-fopenmp=libomp - MIN: 0.3-fopenmp=libomp - MIN: 0.291. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.030060090012001500SE +/- 4.57, N = 3SE +/- 2.65, N = 3SE +/- 1.78, N = 3SE +/- 9.75, N = 3SE +/- 0.58, N = 31356.911375.711305.101271.911268.08-fopenmp - MIN: 1335.04-fopenmp - MIN: 1355.68-fopenmp=libomp - MIN: 1294.76-fopenmp=libomp - MIN: 1252.33-fopenmp=libomp - MIN: 1257.351. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0140280420560700SE +/- 0.52, N = 3SE +/- 1.86, N = 3SE +/- 3.02, N = 3SE +/- 0.10, N = 3SE +/- 0.62, N = 3657.88658.04597.48563.25544.60-fopenmp - MIN: 638.35-fopenmp - MIN: 635.78-fopenmp=libomp - MIN: 580.8-fopenmp=libomp - MIN: 551.31-fopenmp=libomp - MIN: 532.911. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.26380.52760.79141.05521.319SE +/- 0.003430, N = 3SE +/- 0.005622, N = 3SE +/- 0.004576, N = 3SE +/- 0.006530, N = 3SE +/- 0.004625, N = 30.7177820.7881921.1725801.1514001.170440-fopenmp - MIN: 0.67-fopenmp - MIN: 0.74-fopenmp=libomp - MIN: 1.12-fopenmp=libomp - MIN: 1.09-fopenmp=libomp - MIN: 1.111. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.09K18K27K36K45KSE +/- 453.41, N = 14SE +/- 3.94, N = 3SE +/- 102.23, N = 3SE +/- 64.93, N = 3SE +/- 0.81, N = 3SE +/- 9.32, N = 342399.8134199.6034979.2933246.8433178.5033146.031. (CXX) g++ options: -O3 -march=native -fopenmp

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.016K32K48K64K80KSE +/- 971.24, N = 3SE +/- 42.62, N = 3SE +/- 23.55, N = 3SE +/- 10.95, N = 3SE +/- 4.51, N = 3SE +/- 242.64, N = 376805.5851376.8251770.5151596.8751900.4351885.521. (CXX) g++ options: -O3 -march=native -fopenmp

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average LatencyGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.00.00950.0190.02850.0380.0475SE +/- 0.000, N = 3SE +/- 0.001, N = 15SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.0420.0420.0400.0410.0401. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average LatencyGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.00.06860.13720.20580.27440.343SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.002, N = 30.3030.2960.2970.3050.3021. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average LatencyGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.00.02140.04280.06420.08560.107SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0950.0920.0930.0940.0941. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.00.05290.10580.15870.21160.2645SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.003, N = 30.2350.2300.2300.2340.2351. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average LatencyGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.00.39980.79961.19941.59921.999SE +/- 0.028, N = 3SE +/- 0.029, N = 3SE +/- 0.013, N = 3SE +/- 0.004, N = 3SE +/- 0.011, N = 31.6881.7771.7011.6071.6261. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average LatencyGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.01.06452.1293.19354.2585.3225SE +/- 0.034, N = 3SE +/- 0.021, N = 3SE +/- 0.052, N = 7SE +/- 0.054, N = 15SE +/- 0.074, N = 34.6574.7224.7314.4314.6031. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0 LTOClang 12.0Clang 11.0AMD AOCC 3.020406080100SE +/- 0.16, N = 3SE +/- 0.33, N = 3SE +/- 1.29, N = 4SE +/- 1.09, N = 3SE +/- 0.98, N = 3SE +/- 0.98, N = 3SE +/- 0.26, N = 389.1689.4393.6693.6389.1288.6286.74-mabm-mabm-mabm-flto1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.01224364860SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 352.2251.0351.4547.8847.8948.131. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0714212835SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 327.7827.1027.3925.1825.4725.601. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.03691215SE +/- 0.031, N = 3SE +/- 0.052, N = 3SE +/- 0.032, N = 3SE +/- 0.014, N = 3SE +/- 0.022, N = 3SE +/- 0.016, N = 310.39910.29110.4179.5109.5369.7251. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.82331.64662.46993.29324.1165SE +/- 0.016, N = 3SE +/- 0.002, N = 3SE +/- 0.022, N = 3SE +/- 0.014, N = 3SE +/- 0.010, N = 3SE +/- 0.004, N = 33.6593.6073.6433.3613.4293.5431. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0714212835SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 329.0827.0626.9125.2226.0325.781. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0246810SE +/- 0.022, N = 3SE +/- 0.017, N = 3SE +/- 0.007, N = 3SE +/- 0.013, N = 3SE +/- 0.011, N = 3SE +/- 0.022, N = 36.1316.1496.1075.7465.8795.9481. (CXX) g++ options: -O3 -fPIC -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.048121620SE +/- 0.014, N = 3SE +/- 0.027, N = 3SE +/- 0.014, N = 3SE +/- 0.023, N = 3SE +/- 0.009, N = 3SE +/- 0.063, N = 39.1589.2279.02915.87015.59915.6491. (CC) gcc options: -lm -lpthread -O3 -march=native

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.03691215SE +/- 0.053, N = 3SE +/- 0.049, N = 3SE +/- 0.041, N = 3SE +/- 0.032, N = 3SE +/- 0.026, N = 39.9689.5709.2969.4089.4941. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.03691215SE +/- 0.011, N = 5SE +/- 0.006, N = 5SE +/- 0.008, N = 5SE +/- 0.007, N = 5SE +/- 0.006, N = 5SE +/- 0.006, N = 58.5348.7098.5677.8547.9799.280-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0246810SE +/- 0.019, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 3SE +/- 0.003, N = 3SE +/- 0.021, N = 3SE +/- 0.008, N = 37.0117.4737.2318.2568.2508.142-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr1. (CC) gcc options: -O3 -pipe -march=native -lncurses -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0246810SE +/- 0.002, N = 5SE +/- 0.002, N = 5SE +/- 0.003, N = 5SE +/- 0.013, N = 5SE +/- 0.002, N = 57.5047.3817.4697.5677.392-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.050100150200250SE +/- 0.54, N = 3SE +/- 0.18, N = 3SE +/- 0.32, N = 3SE +/- 0.44, N = 3SE +/- 0.28, N = 3SE +/- 0.82, N = 3232.57233.51231.24236.92240.21240.411. (CC) gcc options: -O3 -march=native -fvisibility=hidden

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0306090120150SE +/- 1.32, N = 3SE +/- 1.53, N = 3SE +/- 0.48, N = 3SE +/- 0.53, N = 3SE +/- 0.06, N = 3SE +/- 0.22, N = 3101.54103.01103.60118.87103.83103.93-lstdc++-lstdc++-lstdc++-lstdc++1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552GCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.020406080100SE +/- 0.60, N = 3SE +/- 0.43, N = 3SE +/- 0.12, N = 3SE +/- 1.11, N = 6SE +/- 1.37, N = 3SE +/- 0.12, N = 389.0990.2690.4395.9690.5391.99-lstdc++-lstdc++-lstdc++-lstdc++1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.048121620SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 315.6815.5016.1516.0516.4116.061. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: DefaultGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.00.65661.31321.96982.62643.283SE +/- 0.038, N = 3SE +/- 0.032, N = 7SE +/- 0.027, N = 3SE +/- 0.031, N = 3SE +/- 0.010, N = 32.7782.9182.7392.7432.8161. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7GCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0306090120150SE +/- 0.10, N = 3SE +/- 0.16, N = 3SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.09, N = 3118.45116.66109.53109.64109.811. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7GCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.050100150200250SE +/- 1.32, N = 3SE +/- 0.46, N = 3SE +/- 0.07, N = 3SE +/- 0.66, N = 3SE +/- 0.17, N = 3220.94215.57207.01203.63205.031. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5GCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0246810SE +/- 0.017, N = 3SE +/- 0.017, N = 3SE +/- 0.006, N = 3SE +/- 0.022, N = 3SE +/- 0.028, N = 36.7536.9346.6907.3667.4031. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless CompressionGCC 9.3GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.090180270360450SE +/- 1.92, N = 3SE +/- 3.10, N = 3SE +/- 0.49, N = 3SE +/- 0.17, N = 3SE +/- 0.39, N = 3388.95406.03374.04392.85382.991. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: MediumGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.01.09682.19363.29044.38725.484SE +/- 0.0035, N = 3SE +/- 0.0099, N = 3SE +/- 0.0047, N = 3SE +/- 0.0116, N = 3SE +/- 0.0013, N = 3SE +/- 0.0042, N = 34.87454.81604.86994.00583.98373.88111. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ThoroughGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0246810SE +/- 0.0029, N = 3SE +/- 0.0034, N = 3SE +/- 0.0011, N = 3SE +/- 0.0028, N = 3SE +/- 0.0026, N = 3SE +/- 0.0015, N = 37.85377.69897.83706.76476.76746.64091. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ExhaustiveGCC 9.3GCC 11.0.1GCC 10.3Clang 12.0Clang 11.0AMD AOCC 3.0510152025SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 319.4819.6219.4618.9919.0318.911. (CXX) g++ options: -O3 -march=native -flto -pthread


Phoronix Test Suite v10.8.5