EPYC 7763 LLVM Clang Compiler Tests

AMD EPYC 7763 64-Core testing with a Supermicro H12SSL-i v1.01 (2.0 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2104140-IB-EPYC7763L31
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

Audio Encoding 3 Tests
AV1 4 Tests
C++ Boost Tests 2 Tests
C/C++ Compiler Tests 16 Tests
CPU Massive 16 Tests
Creator Workloads 22 Tests
Cryptography 3 Tests
Encoding 10 Tests
Finance 2 Tests
Game Development 2 Tests
HPC - High Performance Computing 4 Tests
Imaging 6 Tests
Machine Learning 2 Tests
Multi-Core 14 Tests
NVIDIA GPU Compute 2 Tests
Raytracing 3 Tests
Renderers 3 Tests
Scientific Computing 2 Tests
Server 2 Tests
Server CPU Tests 8 Tests
Single-Threaded 4 Tests
Texture Compression 2 Tests
Video Encoding 7 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Clang 12.0
April 10 2021
  8 Hours, 55 Minutes
Clang 11.0
April 11 2021
  7 Hours, 36 Minutes
Clang 12.0 LTO
April 12 2021
  23 Minutes
GCC 9.3
April 12 2021
  7 Hours, 23 Minutes
GCC 10.3
April 13 2021
  7 Hours, 10 Minutes
GCC 11.0.1
April 13 2021
  4 Hours, 42 Minutes
AMD AOCC 3.0
April 14 2021
  7 Hours, 37 Minutes
Invert Hiding All Results Option
  6 Hours, 15 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


EPYC 7763 LLVM Clang Compiler Tests - Phoronix Test Suite

EPYC 7763 LLVM Clang Compiler Tests

AMD EPYC 7763 64-Core testing with a Supermicro H12SSL-i v1.01 (2.0 BIOS) and ASPEED on Ubuntu 20.04 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2104140-IB-EPYC7763L31&gru&sro.

EPYC 7763 LLVM Clang Compiler TestsProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionClang 12.0Clang 11.0Clang 12.0 LTOGCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.0AMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads)Supermicro H12SSL-i v1.01 (2.0 BIOS)AMD Starship/Matisse126GB3841GB Micron_9300_MTFDHAL3T8TDPASPEED2 x Broadcom NetXtreme BCM5720 2-port PCIeUbuntu 20.045.12.0-051200rc6daily20210408-generic (x86_64) 20210407GNOME Shell 3.36.4X Server 1.20.8Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73ext41024x768Clang 11.0.0-2~ubuntu20.04.1Clang 12.0.0-++20210409092622+fa0971b87fb2-1~exp1~20210409193326.73GCC 9.3.0GCC 10.3.0GCC 11.0.1 20210413Clang 12.0.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseEnvironment Details- Clang 12.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 11.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- Clang 12.0 LTO: CXXFLAGS="-O3 -march=native -flto" CFLAGS="-O3 -march=native -flto"- GCC 9.3: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 10.3: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- GCC 11.0.1: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"- AMD AOCC 3.0: CXXFLAGS="-O3 -march=native" CFLAGS="-O3 -march=native"Processor Details- Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa001119Python Details- Python 3.8.2Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affectedCompiler Details- GCC 9.3: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - GCC 10.3: --disable-multilib --enable-checking=release- GCC 11.0.1: --disable-multilib --enable-checking=release- AMD AOCC 3.0: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: (unknown)

EPYC 7763 LLVM Clang Compiler Testsdav1d: Chimera 1080pdav1d: Summer Nature 4Kdav1d: Summer Nature 1080pdav1d: Chimera 1080p 10-bitaom-av1: Speed 0 Two-Pass - Bosphorus 4Kaom-av1: Speed 4 Two-Pass - Bosphorus 4Kaom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 9 Realtime - Bosphorus 4Kaom-av1: Speed 0 Two-Pass - Bosphorus 1080paom-av1: Speed 4 Two-Pass - Bosphorus 1080paom-av1: Speed 6 Realtime - Bosphorus 1080paom-av1: Speed 6 Two-Pass - Bosphorus 1080paom-av1: Speed 8 Realtime - Bosphorus 1080paom-av1: Speed 9 Realtime - Bosphorus 1080psvt-av1: Enc Mode 0 - 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 8 - 1080psvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080px265: Bosphorus 4Kx265: Bosphorus 1080psimdjson: Kostyasimdjson: LargeRandsimdjson: PartialTweetssimdjson: DistinctUserIDviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - sDOTviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTonnx: yolov4 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: fcn-resnet101-11 - OpenMP CPUonnx: shufflenet-v2-10 - OpenMP CPUonnx: super-resolution-10 - OpenMP CPUgraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacecoremark: CoreMark Size 666 - Iterations Per Secondsecuremark: SecureMark-TLScompress-lz4: 3 - Compression Speedcompress-lz4: 3 - Decompression Speedcompress-lz4: 9 - Compression Speedcompress-lz4: 9 - Decompression Speedquantlib: fftw: Stock - 1D FFT Size 32fftw: Stock - 1D FFT Size 1024fftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 1024fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 4096fftw: Float + SSE - 1D FFT Size 32fftw: Float + SSE - 1D FFT Size 1024fftw: Float + SSE - 1D FFT Size 2048fftw: Float + SSE - 1D FFT Size 4096fftw: Float + SSE - 2D FFT Size 1024fftw: Float + SSE - 2D FFT Size 2048fftw: Float + SSE - 2D FFT Size 4096scimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationbotan: KASUMIbotan: KASUMI - Decryptbotan: AES-256botan: AES-256 - Decryptbotan: Twofishbotan: Twofish - Decryptbotan: Blowfishbotan: Blowfish - Decryptbotan: CAST-256botan: CAST-256 - Decryptbotan: ChaCha20Poly1305botan: ChaCha20Poly1305 - Decryptjpegxl: PNG - 5jpegxl: PNG - 7jpegxl: PNG - 8jpegxl: JPEG - 5jpegxl: JPEG - 7jpegxl: JPEG - 8libraw: Post-Processing Benchmarketcpak: DXT1etcpak: ETC1etcpak: ETC2tscp: AI Chess Performanceliquid-dsp: 1 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 57liquid-dsp: 128 - 256 - 57pgbench: 100 - 1 - Read Onlypgbench: 100 - 1 - Read Writepgbench: 100 - 100 - Read Onlypgbench: 100 - 250 - Read Onlypgbench: 100 - 100 - Read Writepgbench: 100 - 250 - Read Writewebp: Defaultwebp: Quality 100webp: Quality 100, Losslesswebp: Quality 100, Highest Compressionwebp: Quality 100, Lossless, Highest Compressiontoybrot: TBBtoybrot: OpenMPtoybrot: C++ Taskstoybrot: C++ Threadsonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUfinancebench: Repo OpenMPfinancebench: Bonds OpenMPpgbench: 100 - 1 - Read Only - Average Latencypgbench: 100 - 1 - Read Write - Average Latencypgbench: 100 - 100 - Read Only - Average Latencypgbench: 100 - 250 - Read Only - Average Latencypgbench: 100 - 100 - Read Write - Average Latencypgbench: 100 - 250 - Read Write - Average Latencymrbayes: Primate Phylogeny Analysisavifenc: 0avifenc: 2avifenc: 6avifenc: 10avifenc: 6, Losslessavifenc: 10, Losslessc-ray: Total Time - 4K, 16 Rays Per Pixelpovray: Trace Timeencode-flac: WAV To FLACencode-mp3: WAV To MP3encode-opus: WAV To Opus Encodegcrypt: ngspice: C2670ngspice: C7552tachyon: Total Timewebp2: Defaultwebp2: Quality 75, Compression Effort 7webp2: Quality 95, Compression Effort 7webp2: Quality 100, Compression Effort 5webp2: Quality 100, Lossless Compressionastcenc: Mediumastcenc: Thoroughastcenc: ExhaustiveClang 12.0Clang 11.0Clang 12.0 LTOGCC 9.3GCC 10.3GCC 11.0.1AMD AOCC 3.01198.22541.561244.11308.320.214.8717.228.9933.3938.110.537.1026.8522.1388.78103.170.18311.474118.06741.09345.30643.58487.43488.23372.4930.3274.002.750.844.604.6247135743460487881969.162648.665.751.973.0333498112990444561993712614107621364576051785466.28396926520452.0713911.548.5013926.52653.81333310805104679862.09088.37789.96744.1156495035051254454283623931935227973190.62675.13363.854280.228848.401785.5082.64484.2294659.3384682.455315.409321.190380.054351.284132.820133.048850.496843.40474.2712.150.8266.6666.3828.1341.782718.525284.642202.0851570966556630001564833333307063333336437666672431032811069022107120962319566841.3312.19919.0166.30938.44967807507743772201.077013.285071.075070.7101241.221321.444252.367972.036060.4919400.7797761302.70593.9721307.49590.1820.3136891305.10597.4811.1725833246.83723951596.8671870.0410.3050.0940.2341.6074.43189.11647.88425.1759.5103.36125.2205.74615.8709.2967.8548.2567.567236.924118.87095.95616.04682.739109.525207.0086.690374.0354.00586.764718.99361190.41543.431251.25184.190.214.9517.139.1433.1437.280.537.2026.6122.0086.09100.550.18111.821117.39241.01346.89652.74481.05482.02373.9929.9473.362.680.814.414.414954124621877104393351.267783.679.388.384.0346471108979745231915665613106820344636161790837.01000026011952.3513840.349.0113927.92640.2133241056410004.29438.68809.67878.56823.8145905074050084466763618131741229133319.34674.86399.164590.379146.881785.4279.14980.2214901.1274895.558299.214302.405319.234351.075128.586127.740848.236840.63778.4112.010.865.5865.4327.2438.711872.759205.065168.8191638265563070001578400000305136666735965333332494333121069367106550661616544881.3362.24018.5736.24337.72762477029683663951.080113.527871.075770.5947290.8411691.457572.318591.605400.4892780.7791011276.04563.2001277.62562.9700.3155221271.91563.2471.1514033178.49869851900.4348960.0400.3020.0940.2351.6264.60388.62047.89425.4729.5363.42926.0345.87915.5999.4087.9798.2507.392240.205103.82690.52716.40992.743109.636203.6347.366392.8493.98376.767419.025550.9313715.048.4713698.72657.82719.985284.763202.10170857367714393.6331145.50530.821228.63305.360.24.7816.299.5734.5639.120.56.6924.8421.4291.97106.550.1299.32592.98438.41322.42605.50463.12464.57354.2128.9172.142.750.943.933.98121781363615871521113365.079898.595.3100.997.9351495116941951832129709806121712385477852086609.97801023893553.8313793.451.9713895.32338.9143991168911053105489798.68408.57007.3165905327552749520993632131341250683229.22668.10384.033765.889178.972149.1584.86484.1305484.6765391.990337.355339.069412.846412.072127.298127.343616.096611.97760.201082.365269.673174.8121446372614040001721900000294046666730120666672389532981057125106748659364538251.3972.27319.2987.05339.07251075451541451421.174863.672781.174340.6540100.8693087.192132.997591.662600.5991400.7867621357.29658.6601358.56659.1910.3769921356.91657.8760.71778242399.80775776805.5807290.0420.3030.0950.2351.6884.65789.16352.21727.78410.3993.65929.0806.1319.1589.9688.5347.0117.504232.572101.53589.09115.68372.778118.447220.9446.753388.9464.87457.853719.47941171.04536.711245.11316.140.214.8417.039.1035.2639.320.526.8726.4921.6493.05107.460.16911.230109.69739.03330.53615.62472.61477.67364.1228.6072.602.770.94.024.131065.601350.0592.971461.22158.41056.4256.2741.498.794.410498.53515051151019755592112689807103912085447722110880.42797824270052.8713906.152.3613806.62392.6125761131910711101799247.38134.56974.0166505205453497521303597332061237743235.94682.87388.983820.779248.892038.1579.11581.4535525.7105529.402341.847325.389422.138420.853127.741127.775485.019476.17558.901114.603281.146173.2261467179624673331718000000294286666730050333332484533691076357108973158894530191.3722.22518.8837.07838.54851815524561053831.178943.611441.197470.6462520.8707847.236863.003411.642680.6021550.7824761382.41659.2651379.51658.2770.3777331375.71658.0380.78819234979.29427151770.5091140.0400.2970.0930.2301.7014.73193.65651.45427.38610.4173.64326.9116.1079.0299.5708.5677.2317.469231.238103.59890.43216.14682.918116.655215.5656.934406.0274.86997.837019.45831180.44538.281249.74334.350.214.8417.379.4135.2639.710.526.9527.0122.1194.40111.270.17611.905110.70238.86329.32611.73472.32478.16366.3928.7971.791210149664915992359115363.9794100.595.010499.32161694809108211885507712176407.66592924386151.3213882.251.1713857.4127651104410675102059238.48231.16948.2165905170654710513913571831662248883182.35647.82388.883462.669263.552148.8457.241494250608863331679800000298940000030557666672366133831090824109016056369531021.3862.27418.3147.00337.94834199.60026051376.8164060.0420.2960.0920.2301.7774.72289.43251.03427.10310.2913.60727.0576.1499.2278.7097.4737.381233.514103.00590.26415.49894.81607.698919.61891188.43541.581251.91192.000.18311.690116.49340.95343.85638.10476.95478.62373.8930.4473.512.730.824.334.4753132647719441017116555.278384.078.890.084.43864591221132543831929660617105718664666141720060.44130726463753.7713562.550.3213561.52725.71319210669102279603.28902.17784.86875.3161464968544412455213610031013231113298.29690.94398.964594.279021.831785.4582.82782.9494891.0724887.573304.996303.806319.787355.059127.768128.008845.141838.08979.2311.370.8165.5765.6827.2941.642654.721211.733178.8521697846574113331609633333310040000036064666671.3512.26219.1266.57838.33869457477718971441.038993.415831.044840.5542310.8339211.370592.287551.595970.4597240.7732331267.18544.0991259.59544.3060.3018851268.08544.6001.1704433146.02864651885.51953186.74248.12725.5989.7253.54325.7835.94815.6499.4949.2808.142240.405103.92991.98616.05812.816109.811205.0347.403382.9853.88116.640918.9127OpenBenchmarking.org

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 0.97, N = 3SE +/- 6.69, N = 3SE +/- 2.95, N = 3SE +/- 3.74, N = 3SE +/- 1.75, N = 3SE +/- 5.12, N = 31188.431190.411198.221171.041180.441145.50-lm - MIN: 703.73 / MAX: 1484.94-lm - MIN: 685.16 / MAX: 1496.36MIN: 700.24 / MAX: 1494.16-lm - MIN: 683.28 / MAX: 1473.51-lm - MIN: 680.31 / MAX: 1485.74-lm - MIN: 664.19 / MAX: 1441.541. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 4KAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3120240360480600SE +/- 1.13, N = 3SE +/- 1.43, N = 3SE +/- 1.79, N = 3SE +/- 0.67, N = 3SE +/- 2.51, N = 3SE +/- 1.35, N = 3541.58543.43541.56536.71538.28530.82-lm - MIN: 259.4 / MAX: 585.8-lm - MIN: 256.75 / MAX: 593.99MIN: 252.01 / MAX: 587.53-lm - MIN: 256.44 / MAX: 577.82-lm - MIN: 251.6 / MAX: 584.38-lm - MIN: 248.84 / MAX: 574.281. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Summer Nature 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 4.95, N = 3SE +/- 2.13, N = 3SE +/- 7.87, N = 3SE +/- 8.15, N = 3SE +/- 1.96, N = 3SE +/- 2.25, N = 31251.911251.251244.111245.111249.741228.63-lm - MIN: 543.89 / MAX: 1394.16-lm - MIN: 556.46 / MAX: 1394.06MIN: 549.81 / MAX: 1390.03-lm - MIN: 539.07 / MAX: 1398.87-lm - MIN: 559.74 / MAX: 1387.11-lm - MIN: 555.28 / MAX: 1361.681. (CC) gcc options: -O3 -march=native -pthread

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.2Video Input: Chimera 1080p 10-bitAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.370140210280350SE +/- 0.39, N = 3SE +/- 0.48, N = 3SE +/- 0.93, N = 3SE +/- 0.21, N = 3SE +/- 1.11, N = 3SE +/- 0.71, N = 3192.00184.19308.32316.14334.35305.36-lm - MIN: 118.57 / MAX: 324.98-lm - MIN: 114.52 / MAX: 310.5MIN: 220.53 / MAX: 490.51-lm - MIN: 218.19 / MAX: 515.85-lm - MIN: 234.24 / MAX: 544.9-lm - MIN: 210.86 / MAX: 493.211. (CC) gcc options: -O3 -march=native -pthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.04730.09460.14190.18920.2365SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.210.210.210.210.201. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31.11382.22763.34144.45525.569SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 34.954.874.844.844.781. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.348121620SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 317.1317.2217.0317.3716.291. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33691215SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.11, N = 69.148.999.109.419.571. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3816243240SE +/- 0.22, N = 3SE +/- 0.48, N = 3SE +/- 0.19, N = 3SE +/- 0.47, N = 3SE +/- 0.12, N = 333.1433.3935.2635.2634.561. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3918273645SE +/- 0.31, N = 3SE +/- 0.43, N = 3SE +/- 0.19, N = 3SE +/- 0.29, N = 3SE +/- 0.38, N = 337.2838.1139.3239.7139.121. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.11930.23860.35790.47720.5965SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.530.530.520.520.501. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 37.207.106.876.956.691. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3612182430SE +/- 0.13, N = 3SE +/- 0.27, N = 3SE +/- 0.25, N = 3SE +/- 0.28, N = 3SE +/- 0.13, N = 326.6126.8526.4927.0124.841. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3510152025SE +/- 0.15, N = 3SE +/- 0.05, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.11, N = 322.0022.1321.6422.1121.421. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.51, N = 3SE +/- 1.07, N = 3SE +/- 0.65, N = 3SE +/- 0.47, N = 3SE +/- 0.89, N = 386.0988.7893.0594.4091.971. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

AOM AV1

Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080pClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.53, N = 3SE +/- 0.31, N = 3SE +/- 1.76, N = 3SE +/- 1.15, N = 8SE +/- 1.10, N = 8100.55103.17107.46111.27106.551. (CXX) g++ options: -O3 -march=native -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-AV1

Encoder Mode: Enc Mode 0 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.04120.08240.12360.16480.206SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.1830.1810.1830.1690.1760.1291. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 4 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33691215SE +/- 0.189, N = 3SE +/- 0.164, N = 4SE +/- 0.170, N = 3SE +/- 0.111, N = 9SE +/- 0.139, N = 3SE +/- 0.086, N = 311.69011.82111.47411.23011.9059.3251. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-AV1

Encoder Mode: Enc Mode 8 - Input: 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3306090120150SE +/- 0.33, N = 3SE +/- 0.46, N = 3SE +/- 0.10, N = 3SE +/- 1.05, N = 3SE +/- 0.18, N = 3SE +/- 0.83, N = 3116.49117.39118.07109.70110.7092.981. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

SVT-HEVC

Tuning: 1 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3918273645SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.17, N = 3SE +/- 0.05, N = 3SE +/- 0.17, N = 3SE +/- 0.18, N = 340.9541.0141.0939.0338.8638.411. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 7 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.380160240320400SE +/- 1.09, N = 3SE +/- 3.43, N = 3SE +/- 1.56, N = 3SE +/- 1.54, N = 3SE +/- 1.51, N = 3SE +/- 1.20, N = 3343.85346.89345.30330.53329.32322.421. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-HEVC

Tuning: 10 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3140280420560700SE +/- 3.03, N = 3SE +/- 5.55, N = 3SE +/- 3.01, N = 3SE +/- 2.42, N = 3SE +/- 5.75, N = 3SE +/- 3.83, N = 3638.10652.74643.58615.62611.73605.501. (CC) gcc options: -O3 -march=native -fPIE -fPIC -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

Tuning: VMAF Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3110220330440550SE +/- 2.67, N = 3SE +/- 0.23, N = 3SE +/- 1.37, N = 3SE +/- 0.24, N = 3SE +/- 1.15, N = 3SE +/- 0.82, N = 3476.95481.05487.43472.61472.32463.121. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3110220330440550SE +/- 1.94, N = 3SE +/- 1.76, N = 3SE +/- 0.73, N = 3SE +/- 2.08, N = 3SE +/- 1.13, N = 3SE +/- 0.32, N = 3478.62482.02488.23477.67478.16464.571. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-VP9

Tuning: Visual Quality Optimized - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.380160240320400SE +/- 2.72, N = 3SE +/- 1.91, N = 3SE +/- 1.11, N = 3SE +/- 0.47, N = 3SE +/- 0.70, N = 3SE +/- 3.83, N = 3373.89373.99372.49364.12366.39354.211. (CC) gcc options: -O3 -fcommon -march=native -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

x265

Video Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 4KAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3714212835SE +/- 0.13, N = 3SE +/- 0.25, N = 3SE +/- 0.23, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 330.4429.9430.3228.6028.7928.911. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

x265

Video Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.4Video Input: Bosphorus 1080pAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31632486480SE +/- 0.63, N = 3SE +/- 0.49, N = 3SE +/- 0.49, N = 3SE +/- 0.32, N = 3SE +/- 0.56, N = 3SE +/- 0.26, N = 373.5173.3674.0072.6071.7972.141. (CXX) g++ options: -O3 -march=native -rdynamic -lpthread -lrt -ldl -lnuma

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: KostyaAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.62331.24661.86992.49323.1165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.732.682.752.772.751. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandomAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.21150.4230.63450.8461.0575SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.820.810.840.900.941. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweetsAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.31.0352.073.1054.145.175SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.334.414.604.023.931. (CXX) g++ options: -O3 -march=native -pthread

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserIDAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.31.03952.0793.11854.1585.1975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 34.474.414.624.133.981. (CXX) g++ options: -O3 -march=native -pthread

ViennaCL

Test: CPU BLAS - sCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 32.29, N = 12SE +/- 36.50, N = 15SE +/- 15.30, N = 12SE +/- 101.07, N = 12SE +/- 25.34, N = 15SE +/- 25.85, N = 15531.00495.00471.001065.601210.001217.00-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 26.90, N = 12SE +/- 34.43, N = 15SE +/- 15.69, N = 12SE +/- 132.58, N = 12SE +/- 62.40, N = 15SE +/- 2.85, N = 15326.0412.0357.01350.01496.0813.0-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - sDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3140280420560700SE +/- 37.59, N = 12SE +/- 38.96, N = 15SE +/- 35.24, N = 12SE +/- 53.43, N = 12SE +/- 2.60, N = 15SE +/- 0.80, N = 15477.00462.00434.00592.97649.00636.00-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dCOPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3400800120016002000SE +/- 9.88, N = 12SE +/- 8.32, N = 15SE +/- 15.32, N = 11SE +/- 131.59, N = 12SE +/- 2.67, N = 15SE +/- 9.19, N = 151944.01877.0604.01461.21599.01587.0-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dAXPY

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35001000150020002500SE +/- 3.59, N = 12SE +/- 1.59, N = 15SE +/- 20.06, N = 12SE +/- 194.35, N = 12SE +/- 2.74, N = 15SE +/- 2.06, N = 151017.01043.0878.02158.42359.01521.0-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dDOT

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 2.61, N = 12SE +/- 1.49, N = 15SE +/- 17.06, N = 12SE +/- 95.41, N = 12SE +/- 1.87, N = 15SE +/- 1.59, N = 151165.00933.00819.001056.421153.001133.00-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-N

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31530456075SE +/- 3.44, N = 12SE +/- 3.65, N = 15SE +/- 2.22, N = 12SE +/- 5.30, N = 12SE +/- 3.83, N = 15SE +/- 4.17, N = 1555.251.269.156.263.965.0-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMV-T

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32004006008001000SE +/- 1.94, N = 12SE +/- 1.41, N = 14SE +/- 4.04, N = 12SE +/- 66.49, N = 12SE +/- 2.88, N = 15SE +/- 2.10, N = 14783.0677.0626.0741.4794.0798.0-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.04, N = 12SE +/- 0.06, N = 15SE +/- 0.05, N = 12SE +/- 1.05, N = 12SE +/- 0.29, N = 15SE +/- 0.16, N = 1584.083.648.698.7100.598.5-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-NT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.07, N = 12SE +/- 0.03, N = 15SE +/- 0.56, N = 12SE +/- 0.59, N = 12SE +/- 0.08, N = 15SE +/- 0.07, N = 1578.879.365.794.495.095.3-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TN

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.05, N = 12SE +/- 0.02, N = 15SE +/- 0.09, N = 12SE +/- 0.62, N = 12SE +/- 0.08, N = 1590.088.351.9104.0104.0100.9-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ViennaCL

Test: CPU BLAS - dGEMM-TT

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.08, N = 12SE +/- 0.02, N = 14SE +/- 0.07, N = 12SE +/- 0.60, N = 12SE +/- 0.05, N = 15SE +/- 0.05, N = 1584.484.073.098.599.397.9-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -rdynamic -lOpenCL

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.380160240320400SE +/- 2.50, N = 3SE +/- 1.42, N = 3SE +/- 4.15, N = 4SE +/- 0.17, N = 3SE +/- 0.50, N = 3386346333351351-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3110220330440550SE +/- 10.39, N = 12SE +/- 5.55, N = 3SE +/- 10.30, N = 12SE +/- 0.87, N = 3SE +/- 4.64, N = 12459471498505495-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3306090120150SE +/- 0.50, N = 3SE +/- 0.29, N = 3SE +/- 0.50, N = 3SE +/- 0.17, N = 3SE +/- 0.44, N = 3122108112115116-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.32K4K6K8K10KSE +/- 171.77, N = 3SE +/- 102.76, N = 8SE +/- 88.25, N = 12SE +/- 7.52, N = 3SE +/- 138.76, N = 31132597979904101979419-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.312002400360048006000SE +/- 174.98, N = 12SE +/- 169.87, N = 9SE +/- 126.29, N = 12SE +/- 17.50, N = 3SE +/- 2.40, N = 343834523445655595183-fopenmp=libomp-fopenmp=libomp-fopenmp=libomp-fopenmp-fopenmp1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -ldl -lrt

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35001000150020002500SE +/- 4.63, N = 3SE +/- 12.41, N = 3SE +/- 6.57, N = 3SE +/- 1.20, N = 3SE +/- 4.81, N = 3SE +/- 1.20, N = 31929191519932112216121291. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3150300450600750SE +/- 1.33, N = 3SE +/- 2.60, N = 3SE +/- 5.21, N = 3SE +/- 6.43, N = 36606657126896947091. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32004006008001000SE +/- 0.58, N = 3SE +/- 2.03, N = 36176136148078098061. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.330060090012001500SE +/- 1.53, N = 3SE +/- 1.86, N = 3SE +/- 0.88, N = 3SE +/- 0.33, N = 3SE +/- 1.53, N = 31057106810761039108212171. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35001000150020002500SE +/- 52.84, N = 15SE +/- 27.29, N = 3SE +/- 41.63, N = 12SE +/- 14.93, N = 3SE +/- 17.34, N = 3SE +/- 18.77, N = 31866203421361208118812381. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3120240360480600SE +/- 0.33, N = 3SE +/- 1.00, N = 3SE +/- 1.00, N = 34664634575445505471. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32004006008001000SE +/- 1.20, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 36146166057727717851. (CC) gcc options: -fopenmp -O3 -march=native -pthread -ljbig -lwebp -lwebpmux -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lxml2 -lz -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3500K1000K1500K2000K2500KSE +/- 3670.84, N = 3SE +/- 971.31, N = 3SE +/- 984.68, N = 3SE +/- 2170.85, N = 3SE +/- 5755.65, N = 3SE +/- 4791.32, N = 31720060.441790837.011785466.282110880.432176407.672086609.981. (CC) gcc options: -O2 -O3 -march=native -lrt" -lrt

SecureMark

Benchmark: SecureMark-TLS

OpenBenchmarking.orgmarks, More Is BetterSecureMark 1.0.4Benchmark: SecureMark-TLSAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.360K120K180K240K300KSE +/- 251.99, N = 3SE +/- 407.86, N = 3SE +/- 1778.47, N = 3SE +/- 1024.96, N = 3SE +/- 675.55, N = 3SE +/- 537.86, N = 32646372601192652042427002438612389351. (CC) gcc options: -pedantic -O3

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression SpeedAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 11.0.1GCC 9.31224364860SE +/- 0.48, N = 3SE +/- 0.33, N = 3SE +/- 0.80, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.73, N = 4SE +/- 0.77, N = 453.7752.3552.0750.9352.8751.3253.831. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression SpeedAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 11.0.1GCC 9.33K6K9K12K15KSE +/- 73.30, N = 3SE +/- 15.91, N = 3SE +/- 71.01, N = 3SE +/- 60.82, N = 3SE +/- 42.32, N = 3SE +/- 34.44, N = 4SE +/- 37.19, N = 413562.513840.313911.513715.013906.113882.213793.41. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression SpeedAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 11.0.1GCC 9.31224364860SE +/- 0.26, N = 3SE +/- 0.46, N = 3SE +/- 0.42, N = 3SE +/- 0.74, N = 3SE +/- 0.72, N = 4SE +/- 0.65, N = 3SE +/- 0.65, N = 550.3249.0148.5048.4752.3651.1751.971. (CC) gcc options: -O3

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression SpeedAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 11.0.1GCC 9.33K6K9K12K15KSE +/- 33.89, N = 3SE +/- 23.21, N = 3SE +/- 65.90, N = 3SE +/- 46.50, N = 3SE +/- 6.60, N = 4SE +/- 62.74, N = 3SE +/- 17.75, N = 513561.513927.913926.513698.713806.613857.413895.31. (CC) gcc options: -O3

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21AMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.36001200180024003000SE +/- 2.28, N = 3SE +/- 1.01, N = 3SE +/- 1.92, N = 3SE +/- 1.62, N = 3SE +/- 2.06, N = 3SE +/- 4.53, N = 32725.72640.22653.82657.82392.62338.91. (CXX) g++ options: -O3 -march=native -rdynamic

FFTW

Build: Stock - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 32AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33K6K9K12K15KSE +/- 41.35, N = 3SE +/- 20.33, N = 3SE +/- 24.25, N = 3SE +/- 16.05, N = 3SE +/- 45.16, N = 3SE +/- 67.28, N = 31319213324133331257612765143991. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 1024AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33K6K9K12K15KSE +/- 34.64, N = 3SE +/- 35.53, N = 3SE +/- 27.10, N = 3SE +/- 32.26, N = 3SE +/- 189.35, N = 3SE +/- 44.20, N = 31066910564108051131911044116891. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32K4K6K8K10KSE +/- 39.89, N = 3SE +/- 28.76, N = 3SE +/- 7.75, N = 3SE +/- 14.75, N = 3SE +/- 55.19, N = 3SE +/- 37.69, N = 310227.010004.210467.010711.010675.011053.01. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32K4K6K8K10KSE +/- 43.38, N = 3SE +/- 15.16, N = 3SE +/- 101.36, N = 3SE +/- 57.26, N = 3SE +/- 48.56, N = 3SE +/- 20.21, N = 39603.29438.69862.010179.010205.010548.01. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 1024AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32K4K6K8K10KSE +/- 14.28, N = 3SE +/- 45.95, N = 3SE +/- 48.25, N = 3SE +/- 41.68, N = 3SE +/- 25.87, N = 3SE +/- 19.46, N = 38902.18809.69088.39247.39238.49798.61. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32K4K6K8K10KSE +/- 19.99, N = 3SE +/- 27.38, N = 3SE +/- 65.76, N = 3SE +/- 56.49, N = 3SE +/- 36.00, N = 3SE +/- 50.36, N = 37784.87878.57789.98134.58231.18408.51. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.315003000450060007500SE +/- 65.81, N = 3SE +/- 60.67, N = 3SE +/- 35.20, N = 3SE +/- 25.90, N = 3SE +/- 23.67, N = 3SE +/- 30.40, N = 36875.36823.86744.16974.06948.27007.31. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 32

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 32AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.34K8K12K16K20KSE +/- 5.33, N = 3SE +/- 129.55, N = 3SE +/- 48.79, N = 3SE +/- 108.41, N = 3SE +/- 168.99, N = 3SE +/- 170.19, N = 81614614590156491665016590165901. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 1024AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.311K22K33K44K55KSE +/- 621.84, N = 15SE +/- 585.78, N = 3SE +/- 952.64, N = 12SE +/- 439.64, N = 15SE +/- 568.96, N = 3SE +/- 788.42, N = 34968550740503505205451706532751. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 2048AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.312K24K36K48K60KSE +/- 756.91, N = 3SE +/- 582.34, N = 3SE +/- 439.50, N = 3SE +/- 743.81, N = 3SE +/- 156.75, N = 3SE +/- 725.00, N = 34441250084512545349754710527491. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 1D FFT Size 4096AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.311K22K33K44K55KSE +/- 542.47, N = 15SE +/- 413.24, N = 15SE +/- 671.66, N = 15SE +/- 844.19, N = 3SE +/- 227.13, N = 3SE +/- 228.68, N = 34552146676454285213051391520991. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 1024

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 1024AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.38K16K24K32K40KSE +/- 455.21, N = 12SE +/- 530.09, N = 4SE +/- 165.99, N = 3SE +/- 301.69, N = 3SE +/- 442.82, N = 3SE +/- 79.87, N = 33610036181362393597335718363211. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 2048AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.37K14K21K28K35KSE +/- 378.89, N = 6SE +/- 146.10, N = 3SE +/- 77.17, N = 3SE +/- 14.99, N = 3SE +/- 209.56, N = 3SE +/- 37.37, N = 33101331741319353206131662313411. (CC) gcc options: -pthread -O3 -march=native -lm

FFTW

Build: Float + SSE - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Float + SSE - Size: 2D FFT Size 4096AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35K10K15K20K25KSE +/- 349.17, N = 9SE +/- 220.77, N = 3SE +/- 348.10, N = 9SE +/- 538.47, N = 9SE +/- 160.97, N = 3SE +/- 106.49, N = 32311122913227972377424888250681. (CC) gcc options: -pthread -O3 -march=native -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.37001400210028003500SE +/- 1.29, N = 3SE +/- 15.12, N = 3SE +/- 1.11, N = 3SE +/- 6.50, N = 3SE +/- 5.19, N = 3SE +/- 5.86, N = 33298.293319.343190.623235.943182.353229.221. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3150300450600750SE +/- 0.18, N = 3SE +/- 0.40, N = 3SE +/- 0.40, N = 3SE +/- 1.71, N = 3SE +/- 0.29, N = 3SE +/- 0.14, N = 3690.94674.86675.13682.87647.82668.101. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.390180270360450SE +/- 0.70, N = 3SE +/- 0.67, N = 3SE +/- 0.46, N = 3SE +/- 0.25, N = 3SE +/- 1.03, N = 3SE +/- 0.66, N = 3398.96399.16363.85388.98388.88384.031. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.310002000300040005000SE +/- 5.98, N = 3SE +/- 3.87, N = 3SE +/- 10.41, N = 3SE +/- 0.86, N = 3SE +/- 0.39, N = 3SE +/- 1.69, N = 34594.274590.374280.223820.773462.663765.881. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.32K4K6K8K10KSE +/- 0.22, N = 3SE +/- 77.81, N = 3SE +/- 7.16, N = 3SE +/- 33.93, N = 3SE +/- 25.06, N = 3SE +/- 28.39, N = 39021.839146.888848.409248.899263.559178.971. (CC) gcc options: -O3 -march=native -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35001000150020002500SE +/- 0.02, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 31785.451785.421785.502038.152148.842149.151. (CC) gcc options: -O3 -march=native -lm

Botan

Test: KASUMI

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMIAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.320406080100SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 382.8379.1582.6479.1284.861. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: KASUMI - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: KASUMI - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.320406080100SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 382.9580.2284.2381.4584.131. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.312002400360048006000SE +/- 0.05, N = 3SE +/- 2.16, N = 3SE +/- 2.14, N = 3SE +/- 4.47, N = 3SE +/- 42.69, N = 34891.074901.134659.345525.715484.681. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: AES-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: AES-256 - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.312002400360048006000SE +/- 3.70, N = 3SE +/- 1.35, N = 3SE +/- 4.78, N = 3SE +/- 5.42, N = 3SE +/- 11.31, N = 34887.574895.564682.465529.405391.991. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: TwofishAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.370140210280350SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.52, N = 3SE +/- 0.04, N = 3305.00299.21315.41341.85337.361. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Twofish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Twofish - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.370140210280350SE +/- 0.06, N = 3SE +/- 0.15, N = 3SE +/- 0.16, N = 3SE +/- 0.44, N = 3SE +/- 0.04, N = 3303.81302.41321.19325.39339.071. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: BlowfishAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.390180270360450SE +/- 1.14, N = 3SE +/- 1.73, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.09, N = 3319.79319.23380.05422.14412.851. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: Blowfish - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: Blowfish - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.390180270360450SE +/- 1.17, N = 3SE +/- 2.03, N = 3SE +/- 0.04, N = 3SE +/- 0.95, N = 3SE +/- 0.12, N = 3355.06351.08351.28420.85412.071. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3306090120150SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.33, N = 3SE +/- 0.09, N = 3127.77128.59132.82127.74127.301. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: CAST-256 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: CAST-256 - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3306090120150SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.32, N = 3SE +/- 0.08, N = 3128.01127.74133.05127.78127.341. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.32004006008001000SE +/- 3.15, N = 3SE +/- 0.62, N = 3SE +/- 4.85, N = 3SE +/- 0.28, N = 3SE +/- 0.13, N = 3845.14848.24850.50485.02616.101. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

Botan

Test: ChaCha20Poly1305 - Decrypt

OpenBenchmarking.orgMiB/s, More Is BetterBotan 2.17.3Test: ChaCha20Poly1305 - DecryptAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.32004006008001000SE +/- 3.17, N = 3SE +/- 0.16, N = 3SE +/- 4.64, N = 3SE +/- 0.02, N = 3SE +/- 0.40, N = 3838.09840.64843.40476.18611.981. (CXX) g++ options: -fstack-protector -m64 -pthread -lbotan-2 -ldl -lrt

JPEG XL

Input: PNG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 5AMD AOCC 3.0Clang 11.0Clang 12.020406080100SE +/- 0.41, N = 3SE +/- 0.24, N = 3SE +/- 0.17, N = 379.2378.4174.271. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: PNG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 7AMD AOCC 3.0Clang 11.0Clang 12.03691215SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 311.3712.0112.151. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: PNG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: PNG - Encode Speed: 8AMD AOCC 3.0Clang 11.0Clang 12.00.18450.3690.55350.7380.9225SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.810.800.821. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 5

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 5AMD AOCC 3.0Clang 11.0Clang 12.01530456075SE +/- 0.17, N = 3SE +/- 0.20, N = 3SE +/- 0.14, N = 365.5765.5866.661. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 7

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 7AMD AOCC 3.0Clang 11.0Clang 12.01530456075SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.16, N = 365.6865.4366.381. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

JPEG XL

Input: JPEG - Encode Speed: 8

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.3Input: JPEG - Encode Speed: 8AMD AOCC 3.0Clang 11.0Clang 12.0714212835SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 327.2927.2428.131. (CXX) g++ options: -O3 -march=native -funwind-tables -Xclang -mrelax-all -O2 -fPIE -pie -pthread -ldl

LibRaw

Post-Processing Benchmark

OpenBenchmarking.orgMpix/sec, More Is BetterLibRaw 0.20Post-Processing BenchmarkAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31326395265SE +/- 0.04, N = 3SE +/- 0.33, N = 3SE +/- 0.12, N = 3SE +/- 0.23, N = 3SE +/- 0.16, N = 3SE +/- 0.19, N = 341.6438.7141.7858.9057.2460.201. (CXX) g++ options: -O3 -march=native -fopenmp -ljpeg -lz -lm

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT1AMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.36001200180024003000SE +/- 8.09, N = 3SE +/- 1.69, N = 3SE +/- 2.64, N = 3SE +/- 6.09, N = 3SE +/- 0.48, N = 3SE +/- 0.16, N = 32654.721872.762718.532719.991114.601082.371. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1AMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.360120180240300SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3211.73205.07284.64284.76281.15269.671. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2AMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.34080120160200SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.09, N = 3178.85168.82202.09202.10173.23174.811. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3400K800K1200K1600K2000KSE +/- 2098.00, N = 5SE +/- 2852.59, N = 5SE +/- 1798.40, N = 5SE +/- 956.77, N = 5SE +/- 1626.80, N = 5SE +/- 760.80, N = 51697846163826515709661467179149425014463721. (CC) gcc options: -O3 -march=native

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 1 - Buffer Length: 256 - Filter Length: 57AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.313M26M39M52M65MSE +/- 47026.00, N = 3SE +/- 40360.87, N = 3SE +/- 790005.27, N = 3SE +/- 6887.99, N = 3SE +/- 318169.94, N = 3SE +/- 870702.21, N = 35741133356307000556630006246733360886333614040001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3400M800M1200M1600M2000MSE +/- 2130988.29, N = 3SE +/- 1331665.62, N = 3SE +/- 2255610.29, N = 3SE +/- 15763988.50, N = 3SE +/- 17297784.06, N = 3SE +/- 4864497.23, N = 31609633333157840000015648333331718000000167980000017219000001. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3700M1400M2100M2800M3500MSE +/- 1234233.91, N = 3SE +/- 2452436.43, N = 3SE +/- 6045475.81, N = 3SE +/- 4643753.27, N = 3SE +/- 1154700.54, N = 3SE +/- 2961043.36, N = 33100400000305136666730706333332942866667298940000029404666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 128 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3800M1600M2400M3200M4000MSE +/- 1543084.93, N = 3SE +/- 1559202.08, N = 3SE +/- 883804.91, N = 3SE +/- 1679616.36, N = 3SE +/- 6016181.88, N = 3SE +/- 3384441.53, N = 33606466667359653333336437666673005033333305576666730120666671. (CC) gcc options: -O3 -march=native -pthread -lm -lc -lliquid

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read OnlyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.35K10K15K20K25KSE +/- 289.16, N = 3SE +/- 303.43, N = 3SE +/- 118.05, N = 3SE +/- 281.76, N = 15SE +/- 41.57, N = 324943243102484523661238951. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read WriteClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.37001400210028003500SE +/- 14.62, N = 3SE +/- 3.48, N = 3SE +/- 11.40, N = 3SE +/- 28.00, N = 3SE +/- 4.79, N = 3331232813369338332981. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read OnlyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3200K400K600K800K1000KSE +/- 1740.88, N = 3SE +/- 720.87, N = 3SE +/- 183.22, N = 3SE +/- 1514.63, N = 3SE +/- 1623.23, N = 3106936710690221076357109082410571251. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read OnlyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3200K400K600K800K1000KSE +/- 13844.42, N = 3SE +/- 6289.60, N = 3SE +/- 8859.63, N = 3SE +/- 8885.95, N = 3SE +/- 8843.08, N = 3106550610712091089731109016010674861. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read WriteClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.313K26K39K52K65KSE +/- 400.92, N = 3SE +/- 162.92, N = 3SE +/- 469.64, N = 3SE +/- 899.82, N = 3SE +/- 994.44, N = 361616623195889456369593641. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read WriteClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.312K24K36K48K60KSE +/- 883.12, N = 3SE +/- 702.52, N = 15SE +/- 591.89, N = 7SE +/- 211.73, N = 3SE +/- 396.40, N = 354488566845301953102538251. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

WebP Image Encode

Encode Settings: Default

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: DefaultAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.31430.62860.94291.25721.5715SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 31.3511.3361.3311.3721.3861.397-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

WebP Image Encode

Encode Settings: Quality 100

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.51171.02341.53512.04682.5585SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.010, N = 3SE +/- 0.007, N = 3SE +/- 0.005, N = 32.2622.2402.1992.2252.2742.273-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

WebP Image Encode

Encode Settings: Quality 100, Lossless

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, LosslessAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3510152025SE +/- 0.03, N = 3SE +/- 0.13, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.13, N = 319.1318.5719.0218.8818.3119.30-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

WebP Image Encode

Encode Settings: Quality 100, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Highest CompressionAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.009, N = 3SE +/- 0.018, N = 3SE +/- 0.004, N = 3SE +/- 0.006, N = 3SE +/- 0.021, N = 3SE +/- 0.009, N = 36.5786.2436.3097.0787.0037.053-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

WebP Image Encode

Encode Settings: Quality 100, Lossless, Highest Compression

OpenBenchmarking.orgEncode Time - Seconds, Fewer Is BetterWebP Image Encode 1.1Encode Settings: Quality 100, Lossless, Highest CompressionAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3918273645SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 338.3437.7338.4538.5537.9539.07-ltiff1. (CC) gcc options: -fvisibility=hidden -O3 -march=native -pthread -lm -ljpeg -lpng16

toyBrot Fractal Generator

Implementation: TBB

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: TBBAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.315003000450060007500SE +/- 52.54, N = 3SE +/- 67.11, N = 7SE +/- 87.21, N = 3SE +/- 86.43, N = 3SE +/- 67.68, N = 3SE +/- 74.84, N = 3694562476780708551815107-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc1. (CXX) g++ options: -O3 -march=native -lpthread

toyBrot Fractal Generator

Implementation: OpenMP

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: OpenMPAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.316003200480064008000SE +/- 22.73, N = 3SE +/- 20.42, N = 3SE +/- 14.89, N = 3SE +/- 2.60, N = 3SE +/- 3.18, N = 3747770297507552454511. (CXX) g++ options: -O3 -march=native -lpthread -lm -lgcc -lgcc_s -lc

toyBrot Fractal Generator

Implementation: C++ Tasks

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ TasksAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.316003200480064008000SE +/- 41.46, N = 3SE +/- 7.31, N = 3SE +/- 33.67, N = 3SE +/- 17.21, N = 3SE +/- 31.52, N = 3SE +/- 49.08, N = 3718968367437736756105414-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc1. (CXX) g++ options: -O3 -march=native -lpthread

toyBrot Fractal Generator

Implementation: C++ Threads

OpenBenchmarking.orgms, Fewer Is BettertoyBrot Fractal Generator 2020-11-18Implementation: C++ ThreadsAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 9.315003000450060007500SE +/- 24.26, N = 3SE +/- 25.04, N = 3SE +/- 30.90, N = 3SE +/- 15.06, N = 3SE +/- 6.12, N = 3SE +/- 8.33, N = 3714463957220714353835142-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc-flto-lm -lgcc -lgcc_s -lc-lm -lgcc -lgcc_s -lc1. (CXX) g++ options: -O3 -march=native -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.26530.53060.79591.06121.3265SE +/- 0.00160, N = 3SE +/- 0.00127, N = 3SE +/- 0.00199, N = 3SE +/- 0.00296, N = 3SE +/- 0.00349, N = 31.038991.080111.077011.178941.17486-fopenmp=libomp - MIN: 0.99-fopenmp=libomp - MIN: 1.03-fopenmp=libomp - MIN: 1.04-fopenmp - MIN: 1.12-fopenmp - MIN: 1.121. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.82641.65282.47923.30564.132SE +/- 0.02018, N = 3SE +/- 0.04735, N = 3SE +/- 0.01639, N = 3SE +/- 0.02637, N = 3SE +/- 0.03246, N = 33.415833.527873.285073.611443.67278-fopenmp=libomp - MIN: 3.24-fopenmp=libomp - MIN: 3.29-fopenmp=libomp - MIN: 3.15-fopenmp - MIN: 3.37-fopenmp - MIN: 3.391. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.26940.53880.80821.07761.347SE +/- 0.00668, N = 3SE +/- 0.00395, N = 3SE +/- 0.00286, N = 3SE +/- 0.00438, N = 3SE +/- 0.00597, N = 31.044841.075771.075071.197471.17434-fopenmp=libomp - MIN: 0.83-fopenmp=libomp - MIN: 0.86-fopenmp=libomp - MIN: 0.87-fopenmp - MIN: 0.98-fopenmp - MIN: 0.961. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.15980.31960.47940.63920.799SE +/- 0.000764, N = 3SE +/- 0.008914, N = 3SE +/- 0.011383, N = 3SE +/- 0.003112, N = 3SE +/- 0.003317, N = 30.5542310.5947290.7101240.6462520.654010-fopenmp=libomp - MIN: 0.5-fopenmp=libomp - MIN: 0.53-fopenmp=libomp - MIN: 0.64-fopenmp - MIN: 0.6-fopenmp - MIN: 0.591. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.27480.54960.82441.09921.374SE +/- 0.000645, N = 3SE +/- 0.000480, N = 3SE +/- 0.018279, N = 4SE +/- 0.001247, N = 3SE +/- 0.001032, N = 30.8339210.8411691.2213200.8707840.869308-fopenmp=libomp - MIN: 0.81-fopenmp=libomp - MIN: 0.82-fopenmp=libomp - MIN: 1.13-fopenmp - MIN: 0.84-fopenmp - MIN: 0.841. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3246810SE +/- 0.00485, N = 3SE +/- 0.00568, N = 3SE +/- 0.00123, N = 3SE +/- 0.03687, N = 3SE +/- 0.02683, N = 31.370591.457571.444257.236867.19213-fopenmp=libomp - MIN: 1.28-fopenmp=libomp - MIN: 1.35-fopenmp=libomp - MIN: 1.34-fopenmp - MIN: 6.18-fopenmp - MIN: 6.141. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.67581.35162.02742.70323.379SE +/- 0.00564, N = 3SE +/- 0.02389, N = 3SE +/- 0.02100, N = 3SE +/- 0.00883, N = 3SE +/- 0.00845, N = 32.287552.318592.367973.003412.99759-fopenmp=libomp - MIN: 1.91-fopenmp=libomp - MIN: 1.92-fopenmp=libomp - MIN: 2.01-fopenmp - MIN: 2.35-fopenmp - MIN: 2.241. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.45810.91621.37431.83242.2905SE +/- 0.00195, N = 3SE +/- 0.00118, N = 3SE +/- 0.01922, N = 12SE +/- 0.00384, N = 3SE +/- 0.01150, N = 31.595971.605402.036061.642681.66260-fopenmp=libomp - MIN: 1.54-fopenmp=libomp - MIN: 1.55-fopenmp=libomp - MIN: 1.81-fopenmp - MIN: 1.58-fopenmp - MIN: 1.591. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.13550.2710.40650.5420.6775SE +/- 0.000365, N = 3SE +/- 0.001652, N = 3SE +/- 0.002843, N = 3SE +/- 0.001964, N = 3SE +/- 0.001469, N = 30.4597240.4892780.4919400.6021550.599140-fopenmp=libomp - MIN: 0.44-fopenmp=libomp - MIN: 0.46-fopenmp=libomp - MIN: 0.47-fopenmp - MIN: 0.57-fopenmp - MIN: 0.561. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.1770.3540.5310.7080.885SE +/- 0.001713, N = 3SE +/- 0.001200, N = 3SE +/- 0.004246, N = 3SE +/- 0.002532, N = 3SE +/- 0.002405, N = 30.7732330.7791010.7797760.7824760.786762-fopenmp=libomp - MIN: 0.72-fopenmp=libomp - MIN: 0.73-fopenmp=libomp - MIN: 0.73-fopenmp - MIN: 0.73-fopenmp - MIN: 0.751. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.330060090012001500SE +/- 5.94, N = 3SE +/- 9.46, N = 3SE +/- 3.92, N = 3SE +/- 3.72, N = 3SE +/- 4.44, N = 31267.181276.041302.701382.411357.29-fopenmp=libomp - MIN: 1248.35-fopenmp=libomp - MIN: 1249.65-fopenmp=libomp - MIN: 1289.86-fopenmp - MIN: 1360.58-fopenmp - MIN: 1335.631. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3140280420560700SE +/- 0.53, N = 3SE +/- 0.83, N = 3SE +/- 9.50, N = 3SE +/- 0.61, N = 3SE +/- 0.64, N = 3544.10563.20593.97659.27658.66-fopenmp=libomp - MIN: 532.32-fopenmp=libomp - MIN: 550.23-fopenmp=libomp - MIN: 570.44-fopenmp - MIN: 642.67-fopenmp - MIN: 639.861. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.330060090012001500SE +/- 1.97, N = 3SE +/- 7.11, N = 3SE +/- 3.61, N = 3SE +/- 1.75, N = 3SE +/- 3.05, N = 31259.591277.621307.491379.511358.56-fopenmp=libomp - MIN: 1247.29-fopenmp=libomp - MIN: 1252.39-fopenmp=libomp - MIN: 1293.38-fopenmp - MIN: 1361.6-fopenmp - MIN: 1337.171. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3140280420560700SE +/- 0.90, N = 3SE +/- 0.25, N = 3SE +/- 1.89, N = 3SE +/- 0.83, N = 3SE +/- 1.25, N = 3544.31562.97590.18658.28659.19-fopenmp=libomp - MIN: 531.9-fopenmp=libomp - MIN: 551.49-fopenmp=libomp - MIN: 575.41-fopenmp - MIN: 639.78-fopenmp - MIN: 642.051. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.0850.170.2550.340.425SE +/- 0.000492, N = 3SE +/- 0.000247, N = 3SE +/- 0.000321, N = 3SE +/- 0.004341, N = 3SE +/- 0.000576, N = 30.3018850.3155220.3136890.3777330.376992-fopenmp=libomp - MIN: 0.29-fopenmp=libomp - MIN: 0.3-fopenmp=libomp - MIN: 0.3-fopenmp - MIN: 0.36-fopenmp - MIN: 0.361. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.330060090012001500SE +/- 0.58, N = 3SE +/- 9.75, N = 3SE +/- 1.78, N = 3SE +/- 2.65, N = 3SE +/- 4.57, N = 31268.081271.911305.101375.711356.91-fopenmp=libomp - MIN: 1257.35-fopenmp=libomp - MIN: 1252.33-fopenmp=libomp - MIN: 1294.76-fopenmp - MIN: 1355.68-fopenmp - MIN: 1335.041. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3140280420560700SE +/- 0.62, N = 3SE +/- 0.10, N = 3SE +/- 3.02, N = 3SE +/- 1.86, N = 3SE +/- 0.52, N = 3544.60563.25597.48658.04657.88-fopenmp=libomp - MIN: 532.91-fopenmp=libomp - MIN: 551.31-fopenmp=libomp - MIN: 580.8-fopenmp - MIN: 635.78-fopenmp - MIN: 638.351. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.26380.52760.79141.05521.319SE +/- 0.004625, N = 3SE +/- 0.006530, N = 3SE +/- 0.004576, N = 3SE +/- 0.005622, N = 3SE +/- 0.003430, N = 31.1704401.1514001.1725800.7881920.717782-fopenmp=libomp - MIN: 1.11-fopenmp=libomp - MIN: 1.09-fopenmp=libomp - MIN: 1.12-fopenmp - MIN: 0.74-fopenmp - MIN: 0.671. (CXX) g++ options: -O3 -march=native -std=c++11 -msse4.1 -fPIC -pie -lpthread -ldl

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.39K18K27K36K45KSE +/- 9.32, N = 3SE +/- 0.81, N = 3SE +/- 64.93, N = 3SE +/- 102.23, N = 3SE +/- 3.94, N = 3SE +/- 453.41, N = 1433146.0333178.5033246.8434979.2934199.6042399.811. (CXX) g++ options: -O3 -march=native -fopenmp

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.316K32K48K64K80KSE +/- 242.64, N = 3SE +/- 4.51, N = 3SE +/- 10.95, N = 3SE +/- 23.55, N = 3SE +/- 42.62, N = 3SE +/- 971.24, N = 351885.5251900.4351596.8751770.5151376.8276805.581. (CXX) g++ options: -O3 -march=native -fopenmp

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Only - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.00950.0190.02850.0380.0475SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 15SE +/- 0.000, N = 30.0400.0410.0400.0420.0421. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 1 - Mode: Read Write - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.06860.13720.20580.27440.343SE +/- 0.002, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 30.3020.3050.2970.2960.3031. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Only - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.02140.04280.06420.08560.107SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.0940.0940.0930.0920.0951. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Only - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.05290.10580.15870.21160.2645SE +/- 0.003, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 30.2350.2340.2300.2300.2351. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 100 - Mode: Read Write - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.39980.79961.19941.59921.999SE +/- 0.011, N = 3SE +/- 0.004, N = 3SE +/- 0.013, N = 3SE +/- 0.029, N = 3SE +/- 0.028, N = 31.6261.6071.7011.7771.6881. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

PostgreSQL pgbench

Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL pgbench 13.0Scaling Factor: 100 - Clients: 250 - Mode: Read Write - Average LatencyClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31.06452.1293.19354.2585.3225SE +/- 0.074, N = 3SE +/- 0.054, N = 15SE +/- 0.052, N = 7SE +/- 0.021, N = 3SE +/- 0.034, N = 34.6034.4314.7314.7224.6571. (CC) gcc options: -fno-strict-aliasing -fwrapv -O3 -march=native -lpgcommon -lpgport -lpq -lpthread -lrt -ldl -lm

Timed MrBayes Analysis

Primate Phylogeny Analysis

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MrBayes Analysis 3.2.7Primate Phylogeny AnalysisAMD AOCC 3.0Clang 11.0Clang 12.0Clang 12.0 LTOGCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.26, N = 3SE +/- 0.98, N = 3SE +/- 0.98, N = 3SE +/- 1.09, N = 3SE +/- 1.29, N = 4SE +/- 0.33, N = 3SE +/- 0.16, N = 386.7488.6289.1293.6393.6689.4389.16-flto-mabm-mabm-mabm1. (CC) gcc options: -mmmx -msse -msse2 -msse3 -mssse3 -msse4.1 -msse4.2 -msse4a -msha -maes -mavx -mfma -mavx2 -mrdrnd -mbmi -mbmi2 -madx -O3 -std=c99 -pedantic -march=native -lm

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 0AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31224364860SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 348.1347.8947.8851.4551.0352.221. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 2AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3714212835SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 325.6025.4725.1827.3927.1027.781. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33691215SE +/- 0.016, N = 3SE +/- 0.022, N = 3SE +/- 0.014, N = 3SE +/- 0.032, N = 3SE +/- 0.052, N = 3SE +/- 0.031, N = 39.7259.5369.51010.41710.29110.3991. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.30.82331.64662.46993.29324.1165SE +/- 0.004, N = 3SE +/- 0.010, N = 3SE +/- 0.014, N = 3SE +/- 0.022, N = 3SE +/- 0.002, N = 3SE +/- 0.016, N = 33.5433.4293.3613.6433.6073.6591. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 6, LosslessAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3714212835SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 325.7826.0325.2226.9127.0629.081. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.9.0Encoder Speed: 10, LosslessAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.022, N = 3SE +/- 0.011, N = 3SE +/- 0.013, N = 3SE +/- 0.007, N = 3SE +/- 0.017, N = 3SE +/- 0.022, N = 35.9485.8795.7466.1076.1496.1311. (CXX) g++ options: -O3 -fPIC -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.348121620SE +/- 0.063, N = 3SE +/- 0.009, N = 3SE +/- 0.023, N = 3SE +/- 0.014, N = 3SE +/- 0.027, N = 3SE +/- 0.014, N = 315.64915.59915.8709.0299.2279.1581. (CC) gcc options: -lm -lpthread -O3 -march=native

POV-Ray

Trace Time

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-Ray 3.7.0.7Trace TimeAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.33691215SE +/- 0.026, N = 3SE +/- 0.032, N = 3SE +/- 0.041, N = 3SE +/- 0.049, N = 3SE +/- 0.053, N = 39.4949.4089.2969.5709.9681. (CXX) g++ options: -pipe -O3 -ffast-math -march=native -pthread -lSDL -lXpm -lSM -lICE -lX11 -lIlmImf -lImath -lHalf -lIex -lIexMath -lIlmThread -lpthread -ltiff -ljpeg -lpng -lz -lrt -lm -lboost_thread -lboost_system

FLAC Audio Encoding

WAV To FLAC

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.33691215SE +/- 0.006, N = 5SE +/- 0.006, N = 5SE +/- 0.007, N = 5SE +/- 0.008, N = 5SE +/- 0.006, N = 5SE +/- 0.011, N = 59.2807.9797.8548.5678.7098.534-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

LAME MP3 Encoding

WAV To MP3

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.008, N = 3SE +/- 0.021, N = 3SE +/- 0.003, N = 3SE +/- 0.006, N = 3SE +/- 0.005, N = 3SE +/- 0.019, N = 38.1428.2508.2567.2317.4737.011-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr-ffast-math -funroll-loops -fschedule-insns2 -fbranch-count-reg -fforce-addr1. (CC) gcc options: -O3 -pipe -march=native -lncurses -lm

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus EncodeClang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.002, N = 5SE +/- 0.013, N = 5SE +/- 0.003, N = 5SE +/- 0.002, N = 5SE +/- 0.002, N = 57.3927.5677.4697.3817.504-fvisibility=hidden-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -march=native -logg -lm

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.9AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.350100150200250SE +/- 0.82, N = 3SE +/- 0.28, N = 3SE +/- 0.44, N = 3SE +/- 0.32, N = 3SE +/- 0.18, N = 3SE +/- 0.54, N = 3240.41240.21236.92231.24233.51232.571. (CC) gcc options: -O3 -march=native -fvisibility=hidden

Ngspice

Circuit: C2670

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C2670AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3306090120150SE +/- 0.22, N = 3SE +/- 0.06, N = 3SE +/- 0.53, N = 3SE +/- 0.48, N = 3SE +/- 1.53, N = 3SE +/- 1.32, N = 3103.93103.83118.87103.60103.01101.54-lstdc++-lstdc++-lstdc++-lstdc++1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Ngspice

Circuit: C7552

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C7552AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.320406080100SE +/- 0.12, N = 3SE +/- 1.37, N = 3SE +/- 1.11, N = 6SE +/- 0.12, N = 3SE +/- 0.43, N = 3SE +/- 0.60, N = 391.9990.5395.9690.4390.2689.09-lstdc++-lstdc++-lstdc++-lstdc++1. (CC) gcc options: -O3 -march=native -fopenmp -lm -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lSM -lICE

Tachyon

Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99b6Total TimeAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.348121620SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 316.0616.4116.0516.1515.5015.681. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: DefaultAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.30.65661.31321.96982.62643.283SE +/- 0.010, N = 3SE +/- 0.031, N = 3SE +/- 0.027, N = 3SE +/- 0.032, N = 7SE +/- 0.038, N = 32.8162.7432.7392.9182.7781. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 7AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3306090120150SE +/- 0.09, N = 3SE +/- 0.10, N = 3SE +/- 0.10, N = 3SE +/- 0.16, N = 3SE +/- 0.10, N = 3109.81109.64109.53116.66118.451. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 7AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.350100150200250SE +/- 0.17, N = 3SE +/- 0.66, N = 3SE +/- 0.07, N = 3SE +/- 0.46, N = 3SE +/- 1.32, N = 3205.03203.63207.01215.57220.941. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5AMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.3246810SE +/- 0.028, N = 3SE +/- 0.022, N = 3SE +/- 0.006, N = 3SE +/- 0.017, N = 3SE +/- 0.017, N = 37.4037.3666.6906.9346.7531. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless CompressionAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 9.390180270360450SE +/- 0.39, N = 3SE +/- 0.17, N = 3SE +/- 0.49, N = 3SE +/- 3.10, N = 3SE +/- 1.92, N = 3382.99392.85374.04406.03388.951. (CXX) g++ options: -O3 -march=native -fno-rtti -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: MediumAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.31.09682.19363.29044.38725.484SE +/- 0.0042, N = 3SE +/- 0.0013, N = 3SE +/- 0.0116, N = 3SE +/- 0.0047, N = 3SE +/- 0.0099, N = 3SE +/- 0.0035, N = 33.88113.98374.00584.86994.81604.87451. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ThoroughAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3246810SE +/- 0.0015, N = 3SE +/- 0.0026, N = 3SE +/- 0.0028, N = 3SE +/- 0.0011, N = 3SE +/- 0.0034, N = 3SE +/- 0.0029, N = 36.64096.76746.76477.83707.69897.85371. (CXX) g++ options: -O3 -march=native -flto -pthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ExhaustiveAMD AOCC 3.0Clang 11.0Clang 12.0GCC 10.3GCC 11.0.1GCC 9.3510152025SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 318.9119.0318.9919.4619.6219.481. (CXX) g++ options: -O3 -march=native -flto -pthread


Phoronix Test Suite v10.8.4