AMD EPYC Rome Compiler Benchmarks

AMD AOCC 2.0, GCC, LLVM Clang compiler benchmarks on EPYC 7742. Tests by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/1908089-AS-AMDEPYCRO53.

AMD EPYC Rome Compiler BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.02 x AMD EPYC 7742 64-Core @ 2.25GHz (128 Cores / 256 Threads)AMD DAYTONA_X (RDY1001C BIOS)AMD Device 1480516096MB280GB INTEL SSDPED1D280GA + 6 x 3841GB Micron_9300_MTFDHAL3T8TDP + 256GB Micron_1100_MTFDASPEEDVE2282 x Mellanox MT27710Ubuntu 19.045.2.0-050200rc7-generic (x86_64) 20190630GNOME Shell 3.32.1X Server 1.20.4modesetting 1.20.4GCC 9.1.0ext41920x1080GCC 10.0.0 20190804Clang 9.0.0-svn364739-1~exp1+0~20190701101552.184~1.gbp124358Clang 8.0.0OpenBenchmarking.orgEnvironment Details- CXXFLAGS=-O3-march=znver2 CFLAGS=-O3-march=znver2Compiler Details- GCC 9.1.0: --disable-multilib --enable-checking=release- GCC 10.0 Git: --disable-multilib --enable-checking=release- AOCC 2.0: Optimized build with assertions; Default target: x86_64-unknown-linux-gnu; Host CPU: znver1Processor Details- Scaling Governor: acpi-cpufreq ondemandPython Details- Python 2.7.16 + Python 3.7.3Security Details- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling

AMD EPYC Rome Compiler Benchmarkssockperf: Throughputsockperf: Latency Ping Pongsockperf: Latency Under Loadhpcg: lzbench: XZ 0 - Compressionlzbench: XZ 0 - Decompressionlzbench: Zstd 1 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Brotli 0 - Decompressionlzbench: Libdeflate 1 - Compressionlzbench: Libdeflate 1 - Decompressionfftw: Stock - 1D FFT Size 2048fftw: Stock - 1D FFT Size 4096fftw: Stock - 2D FFT Size 2048fftw: Stock - 2D FFT Size 4096scimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationtscp: AI Chess Performancejohn-the-ripper: Blowfishmkl-dnn: IP Batch 1D - f32mkl-dnn: IP Batch All - f32mkl-dnn: Convolution Batch conv_3d - f32mkl-dnn: Convolution Batch conv_all - f32mkl-dnn: Deconvolution Batch deconv_1d - f32mkl-dnn: Deconvolution Batch deconv_3d - f32mkl-dnn: Convolution Batch conv_alexnet - f32mkl-dnn: Deconvolution Batch deconv_all - f32mkl-dnn: Convolution Batch conv_googlenet_v3 - f32svt-av1: 1080p 8-bit YUV To AV1 Video Encodesvt-hevc: 1080p 8-bit YUV To HEVC Video Encodesvt-vp9: 1080p 8-bit YUV To VP9 Video Encodex264: H.264 Video Encodingx265: H.265 1080p Video Encodinggraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacecoremark: CoreMark Size 666 - Iterations Per Secondbuild-llvm: Time To Compilebuild-php: Time To Compilec-ray: Total Time - 4K, 16 Rays Per Pixelaobench: 2048 x 2048 - Total Timedav1d: Summer Nature 4Kdav1d: Summer Nature 1080ptjbench: Decompression Throughputcpp-perf-bench: Atolcpp-perf-bench: Ctypecpp-perf-bench: Math Librarycpp-perf-bench: Rand Numberscpp-perf-bench: Stepanov Vectorcpp-perf-bench: Function Objectscpp-perf-bench: Stepanov Abstractionapache: Static Web Page Servingapache-siege: 100apache-siege: 200apache-siege: 250GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.04775785.3922.420.3630873649823894461978808946.238808.937381.006327.202834.06612.17203.682741.978811.171801.3110728121483021585.036593.30387.7137344.80818.78588.074136.482085363194.83101.07343.29277.45153.3144.542182042072151022082283868113.7196.2964.625.7236.7711.524.75175.1674.3641.66343.451599.0799.2118.9336.6624915.1926315.8032222.7233869.934804155.3924.120.36308636210113861948768958.538816.737288.806289.302828.01610.98202.712763.508762.751800.111040775148542946.126961.88440.3138147.16955.00500.544171.882070863342.14100.44344.04283.63152.4344.732192022072131022072263825301.4596.8565.065.8235.3311.164.77175.4874.3239.33333.851627.9997.7019.0136.5224096.6026336.6130752.5432367.310.358273.108258.376627.775842.732880.56621.05224.523382.938518.521655.77114937018765216.7690.662.66371.956.082.7442.492590.0923.30102.78337.75274.55153.9545.372162372092151192202343024508.8178.8688.309.1941.7511.564.86174.1374.1036.74331.021892.4685.8918.8233.4424594.6625958.5132213.5933938.130.338007.577503.576462.335593.442730.13605.88178.572736.948473.111656.18109368218714011.6076.802.71370.635.402.7342.781917.8923.21157.1845.571942051861901201952023284059.92146.60230.219.0137.0211.144.77175.9674.3837.81330.411894.0289.7318.9434.1024163.31OpenBenchmarking.org

Sockperf

Test: Throughput

OpenBenchmarking.orgMessages Per Second, More Is BetterSockperf 3.4Test: ThroughputGCC 9.1.0GCC 10.0 Git100K200K300K400K500KSE +/- 3521.00, N = 25SE +/- 4088.70, N = 54775784804151. (CXX) g++ options: --param -O3 -march=znver2 -rdynamic -ldl -lpthread

Sockperf

Test: Latency Ping Pong

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Ping PongGCC 9.1.0GCC 10.0 Git1.21282.42563.63844.85126.064SE +/- 0.02, N = 5SE +/- 0.05, N = 55.395.391. (CXX) g++ options: --param -O3 -march=znver2 -rdynamic -ldl -lpthread

Sockperf

Test: Latency Under Load

OpenBenchmarking.orgusec, Fewer Is BetterSockperf 3.4Test: Latency Under LoadGCC 9.1.0GCC 10.0 Git612182430SE +/- 0.86, N = 25SE +/- 0.50, N = 2022.4224.121. (CXX) g++ options: --param -O3 -march=znver2 -rdynamic -ldl -lpthread

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.0GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.00.0810.1620.2430.3240.405SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 12SE +/- 0.01, N = 120.360.360.350.33

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: CompressionGCC 9.1.0GCC 10.0 Git71421283530301. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: DecompressionGCC 9.1.0GCC 10.0 Git20406080100SE +/- 0.33, N = 387861. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: CompressionGCC 9.1.0GCC 10.0 Git80160240320400SE +/- 0.33, N = 3SE +/- 1.20, N = 33643621. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: DecompressionGCC 9.1.0GCC 10.0 Git200400600800100098210111. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Brotli 0 - Process: CompressionGCC 9.1.0GCC 10.0 Git80160240320400SE +/- 1.00, N = 3SE +/- 2.33, N = 33893851. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Brotli 0 - Process: DecompressionGCC 9.1.0100200300400500SE +/- 0.33, N = 34461. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: CompressionGCC 9.1.0GCC 10.0 Git4080120160200SE +/- 0.58, N = 31971941. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: Libdeflate 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: DecompressionGCC 9.1.0GCC 10.0 Git2004006008001000SE +/- 4.67, N = 3SE +/- 7.84, N = 38808761. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

FFTW

Build: Stock - Size: 1D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 2048GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.02K4K6K8K10KSE +/- 35.41, N = 3SE +/- 20.56, N = 3SE +/- 7.45, N = 3SE +/- 3.56, N = 38946.238958.538273.108007.571. (CC) gcc options: -pthread -O3 -march=znver2 -lm

FFTW

Build: Stock - Size: 1D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 1D FFT Size 4096GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.02K4K6K8K10KSE +/- 10.91, N = 3SE +/- 4.55, N = 3SE +/- 12.43, N = 3SE +/- 113.30, N = 38808.938816.738258.377503.571. (CC) gcc options: -pthread -O3 -march=znver2 -lm

FFTW

Build: Stock - Size: 2D FFT Size 2048

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 2048GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.016003200480064008000SE +/- 15.60, N = 3SE +/- 74.47, N = 3SE +/- 50.06, N = 3SE +/- 19.37, N = 37381.007288.806627.776462.331. (CC) gcc options: -pthread -O3 -march=znver2 -lm

FFTW

Build: Stock - Size: 2D FFT Size 4096

OpenBenchmarking.orgMflops, More Is BetterFFTW 3.3.6Build: Stock - Size: 2D FFT Size 4096GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.014002800420056007000SE +/- 32.75, N = 3SE +/- 67.18, N = 3SE +/- 41.90, N = 3SE +/- 48.86, N = 126327.206289.305842.735593.441. (CC) gcc options: -pthread -O3 -march=znver2 -lm

SciMark

Computational Test: Composite

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.06001200180024003000SE +/- 10.94, N = 3SE +/- 4.68, N = 3SE +/- 9.28, N = 3SE +/- 4.27, N = 32834.062828.012880.562730.131. (CC) gcc options: -O3 -march=znver2 -lm

SciMark

Computational Test: Monte Carlo

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.0130260390520650SE +/- 0.53, N = 3SE +/- 0.05, N = 3SE +/- 0.10, N = 3SE +/- 0.12, N = 3612.17610.98621.05605.881. (CC) gcc options: -O3 -march=znver2 -lm

SciMark

Computational Test: Fast Fourier Transform

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.050100150200250SE +/- 0.91, N = 3SE +/- 0.44, N = 3SE +/- 0.10, N = 3SE +/- 0.38, N = 3203.68202.71224.52178.571. (CC) gcc options: -O3 -march=znver2 -lm

SciMark

Computational Test: Sparse Matrix Multiply

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.07001400210028003500SE +/- 69.70, N = 3SE +/- 15.27, N = 3SE +/- 47.14, N = 3SE +/- 14.78, N = 32741.972763.503382.932736.941. (CC) gcc options: -O3 -march=znver2 -lm

SciMark

Computational Test: Dense LU Matrix Factorization

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.02K4K6K8K10KSE +/- 15.96, N = 3SE +/- 28.43, N = 3SE +/- 6.82, N = 3SE +/- 10.49, N = 38811.178762.758518.528473.111. (CC) gcc options: -O3 -march=znver2 -lm

SciMark

Computational Test: Jacobi Successive Over-Relaxation

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.0400800120016002000SE +/- 0.11, N = 3SE +/- 0.14, N = 3SE +/- 0.16, N = 3SE +/- 0.43, N = 31801.311800.111655.771656.181. (CC) gcc options: -O3 -march=znver2 -lm

TSCP

AI Chess Performance

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.0200K400K600K800K1000KSE +/- 1412.87, N = 5SE +/- 393.00, N = 5SE +/- 479.00, N = 5SE +/- 532.27, N = 510728121040775114937010936821. (CC) gcc options: -O3 -march=znver2 -march=native

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 1.9.0-jumbo-1Test: BlowfishGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.040K80K120K160K200KSE +/- 794.73, N = 3SE +/- 760.59, N = 3SE +/- 2145.04, N = 3SE +/- 858.63, N = 31483021485421876521871401. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -pthread -lm -lz -ldl -lcrypt -lbz2

MKL-DNN

Harness: IP Batch 1D - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch 1D - Data Type: f32GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.030060090012001500SE +/- 218.86, N = 12SE +/- 119.82, N = 15SE +/- 0.25, N = 3SE +/- 0.25, N = 151585.03946.1216.7611.60-march=native -mtune=native -fopenmp - MIN: 283.94-march=native -mtune=native -fopenmp - MIN: 276.31. (CXX) g++ options: -O3 -march=znver2 -std=c++11 -fPIC -pie -lmklml_intel -ldl

MKL-DNN

Harness: IP Batch All - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: IP Batch All - Data Type: f32GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.015003000450060007500SE +/- 192.76, N = 15SE +/- 320.05, N = 12SE +/- 1.29, N = 4SE +/- 1.10, N = 36593.306961.8890.6676.80-march=native -mtune=native -fopenmp - MIN: 3178.05-march=native -mtune=native -fopenmp - MIN: 3291.221. (CXX) g++ options: -O3 -march=znver2 -std=c++11 -fPIC -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_3d - Data Type: f32GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.0100200300400500SE +/- 57.38, N = 15SE +/- 108.49, N = 12SE +/- 0.02, N = 3SE +/- 0.03, N = 15387.71440.312.662.71-march=native -mtune=native -fopenmp - MIN: 159.05-march=native -mtune=native -fopenmp - MIN: 157.551. (CXX) g++ options: -O3 -march=znver2 -std=c++11 -fPIC -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_all - Data Type: f32GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.08K16K24K32K40KSE +/- 601.47, N = 3SE +/- 509.65, N = 5SE +/- 1.44, N = 3SE +/- 0.50, N = 337344.8038147.16371.95370.63-march=native -mtune=native -fopenmp - MIN: 28450.7-march=native -mtune=native -fopenmp - MIN: 29056.71. (CXX) g++ options: -O3 -march=znver2 -std=c++11 -fPIC -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_1d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_1d - Data Type: f32GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.02004006008001000SE +/- 138.13, N = 12SE +/- 140.61, N = 15SE +/- 0.11, N = 12SE +/- 0.07, N = 6818.78955.006.085.40-march=native -mtune=native -fopenmp - MIN: 286.61-march=native -mtune=native -fopenmp - MIN: 287.251. (CXX) g++ options: -O3 -march=znver2 -std=c++11 -fPIC -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_3d - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_3d - Data Type: f32GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.0130260390520650SE +/- 105.31, N = 12SE +/- 48.77, N = 15SE +/- 0.02, N = 3SE +/- 0.00, N = 3588.07500.542.742.73-march=native -mtune=native -fopenmp - MIN: 189.7-march=native -mtune=native -fopenmp - MIN: 192.521. (CXX) g++ options: -O3 -march=znver2 -std=c++11 -fPIC -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_alexnet - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_alexnet - Data Type: f32GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.09001800270036004500SE +/- 56.84, N = 15SE +/- 66.32, N = 15SE +/- 0.28, N = 3SE +/- 0.09, N = 34136.484171.8842.4942.78-march=native -mtune=native -fopenmp - MIN: 3325.34-march=native -mtune=native -fopenmp - MIN: 3313.31. (CXX) g++ options: -O3 -march=znver2 -std=c++11 -fPIC -pie -lmklml_intel -ldl

MKL-DNN

Harness: Deconvolution Batch deconv_all - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Deconvolution Batch deconv_all - Data Type: f32GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.040K80K120K160K200KSE +/- 2544.01, N = 3SE +/- 3229.16, N = 3SE +/- 22.00, N = 3SE +/- 15.15, N = 3208536.00207086.002590.091917.89-march=native -mtune=native -fopenmp - MIN: 153312-march=native -mtune=native -fopenmp - MIN: 1496981. (CXX) g++ options: -O3 -march=znver2 -std=c++11 -fPIC -pie -lmklml_intel -ldl

MKL-DNN

Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32

OpenBenchmarking.orgms, Fewer Is BetterMKL-DNN 2019-04-16Harness: Convolution Batch conv_googlenet_v3 - Data Type: f32GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.07001400210028003500SE +/- 173.73, N = 12SE +/- 108.79, N = 12SE +/- 0.11, N = 3SE +/- 0.27, N = 63194.833342.1423.3023.21-march=native -mtune=native -fopenmp - MIN: 1649.04-march=native -mtune=native -fopenmp - MIN: 1649.031. (CXX) g++ options: -O3 -march=znver2 -std=c++11 -fPIC -pie -lmklml_intel -ldl

SVT-AV1

1080p 8-bit YUV To AV1 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.51080p 8-bit YUV To AV1 Video EncodeGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVN20406080100SE +/- 1.25, N = 5SE +/- 0.16, N = 3SE +/- 0.32, N = 3101.07100.44102.781. (CXX) g++ options: -O3 -march=znver2 -pie -lpthread -lm

SVT-HEVC

1080p 8-bit YUV To HEVC Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 2019-02-031080p 8-bit YUV To HEVC Video EncodeGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVN70140210280350SE +/- 4.37, N = 3SE +/- 3.15, N = 10SE +/- 4.13, N = 3343.29344.04337.75-fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native-fPIE -fPIC -O2 -flto -fvisibility=hidden -march=native1. (CC) gcc options: -O3 -march=znver2 -pie -rdynamic -lpthread -lrt

SVT-VP9

1080p 8-bit YUV To VP9 Video Encode

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 2019-02-171080p 8-bit YUV To VP9 Video EncodeGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVN60120180240300SE +/- 4.48, N = 3SE +/- 3.30, N = 15SE +/- 3.16, N = 3277.45283.63274.55-fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx-fPIE -fPIC -O2 -flto -fvisibility=hidden -mavx1. (CC) gcc options: -O3 -march=znver2 -pie -rdynamic -lpthread -lrt -lm

x264

H.264 Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.0306090120150SE +/- 0.91, N = 3SE +/- 0.41, N = 3SE +/- 0.48, N = 3SE +/- 2.65, N = 3153.31152.43153.95157.18-mstack-alignment=64-mstack-alignment=641. (CC) gcc options: -ldl -lavformat -lavcodec -lavutil -lswscale -m64 -lm -lpthread -O3 -ffast-math -march=znver2 -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

x265

H.265 1080p Video Encoding

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video EncodingGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.01020304050SE +/- 0.11, N = 3SE +/- 0.21, N = 3SE +/- 0.15, N = 3SE +/- 0.15, N = 344.5444.7345.3745.571. (CXX) g++ options: -O3 -march=znver2 -rdynamic -lpthread -lrt -ldl -lnuma

GraphicsMagick

Operation: Swirl

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SwirlGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.050100150200250SE +/- 0.33, N = 3SE +/- 0.33, N = 3218219216194-ldl-ldl-lomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=znver2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Rotate

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: RotateGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.050100150200250SE +/- 1.15, N = 3204202237205-ldl-ldl-lomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=znver2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Sharpen

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SharpenGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.050100150200250SE +/- 0.67, N = 3SE +/- 1.00, N = 3207207209186-ldl-ldl-lomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=znver2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Enhanced

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: EnhancedGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.050100150200250SE +/- 0.33, N = 3215213215190-ldl-ldl-lomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=znver2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Resizing

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: ResizingGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.0306090120150SE +/- 0.58, N = 3SE +/- 1.53, N = 3SE +/- 2.59, N = 12102102119120-ldl-ldl-lomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=znver2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: Noise-Gaussian

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-GaussianGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.050100150200250SE +/- 0.88, N = 3208207220195-ldl-ldl-lomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=znver2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

GraphicsMagick

Operation: HWB Color Space

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color SpaceGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.050100150200250SE +/- 0.33, N = 3SE +/- 2.65, N = 3228226234202-ldl-ldl-lomp-lomp1. (CC) gcc options: -fopenmp -O3 -march=znver2 -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.0800K1600K2400K3200K4000KSE +/- 60538.32, N = 3SE +/- 29615.82, N = 3SE +/- 32092.57, N = 3SE +/- 40813.96, N = 33868113.713825301.453024508.813284059.921. (CC) gcc options: -O2 -O3 -march=znver2 -lrt" -lrt

Timed LLVM Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 6.0.1Time To CompileGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.030609012015096.2996.8578.86146.60

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 7.1.9Time To CompileGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.050100150200250SE +/- 0.28, N = 3SE +/- 0.29, N = 3SE +/- 0.24, N = 3SE +/- 0.53, N = 364.6265.0688.30230.211. (CC) gcc options: -O3 -march=znver2 -pedantic -ldl -lz -lm

C-Ray

Total Time - 4K, 16 Rays Per Pixel

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.03691215SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 3SE +/- 0.13, N = 35.725.829.199.011. (CC) gcc options: -lm -lpthread -O3 -march=znver2

AOBench

Size: 2048 x 2048 - Total Time

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.01020304050SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 336.7735.3341.7537.021. (CC) gcc options: -lm -O3 -march=znver2

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.3Video Input: Summer Nature 4KGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.03691215SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 0.19, N = 3SE +/- 0.07, N = 311.5211.1611.5611.141. (CC) gcc options: -O3 -march=znver2 -pthread

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.3Video Input: Summer Nature 1080pGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.01.09352.1873.28054.3745.4675SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 34.754.774.864.771. (CC) gcc options: -O3 -march=znver2 -pthread

libjpeg-turbo tjbench

Test: Decompression Throughput

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.0.2Test: Decompression ThroughputGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.04080120160200SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 3SE +/- 0.02, N = 3175.16175.48174.13175.961. (CC) gcc options: -O3 -march=znver2 -rdynamic

CppPerformanceBenchmarks

Test: Atol

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.020406080100SE +/- 0.23, N = 3SE +/- 0.12, N = 3SE +/- 0.06, N = 3SE +/- 0.28, N = 374.3674.3274.1074.381. (CXX) g++ options: -O3 -march=znver2 -std=c++11

CppPerformanceBenchmarks

Test: Ctype

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.01020304050SE +/- 0.12, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 341.6639.3336.7437.811. (CXX) g++ options: -O3 -march=znver2 -std=c++11

CppPerformanceBenchmarks

Test: Math Library

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Math LibraryGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.070140210280350SE +/- 0.46, N = 3SE +/- 0.12, N = 3SE +/- 0.43, N = 3SE +/- 0.31, N = 3343.45333.85331.02330.411. (CXX) g++ options: -O3 -march=znver2 -std=c++11

CppPerformanceBenchmarks

Test: Random Numbers

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Random NumbersGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.0400800120016002000SE +/- 0.06, N = 3SE +/- 0.12, N = 3SE +/- 0.02, N = 3SE +/- 0.26, N = 31599.071627.991892.461894.021. (CXX) g++ options: -O3 -march=znver2 -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Vector

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.020406080100SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.11, N = 3SE +/- 0.35, N = 399.2197.7085.8989.731. (CXX) g++ options: -O3 -march=znver2 -std=c++11

CppPerformanceBenchmarks

Test: Function Objects

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.0510152025SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 318.9319.0118.8218.941. (CXX) g++ options: -O3 -march=znver2 -std=c++11

CppPerformanceBenchmarks

Test: Stepanov Abstraction

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.0816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 336.6636.5233.4434.101. (CXX) g++ options: -O3 -march=znver2 -std=c++11

Apache Benchmark

Static Web Page Serving

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingGCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVNAOCC 2.05K10K15K20K25KSE +/- 435.79, N = 15SE +/- 497.81, N = 15SE +/- 400.39, N = 15SE +/- 395.53, N = 1524915.1924096.6024594.6624163.311. (CC) gcc options: -shared -fPIC -pthread -O3 -march=znver2

Apache Siege

Concurrent Users: 100

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 100GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVN6K12K18K24K30KSE +/- 10.57, N = 3SE +/- 17.44, N = 3SE +/- 332.76, N = 426315.8026336.6125958.511. (CC) gcc options: -O3 -march=znver2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 200

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 200GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVN7K14K21K28K35KSE +/- 261.42, N = 15SE +/- 170.01, N = 3SE +/- 428.68, N = 332222.7230752.5432213.591. (CC) gcc options: -O3 -march=znver2 -lpthread -ldl -lssl -lcrypto

Apache Siege

Concurrent Users: 250

OpenBenchmarking.orgTransactions Per Second, More Is BetterApache Siege 2.4.29Concurrent Users: 250GCC 9.1.0GCC 10.0 GitLLVM Clang 9.0 SVN7K14K21K28K35KSE +/- 479.50, N = 15SE +/- 480.75, N = 15SE +/- 377.42, N = 1533869.9332367.3133938.131. (CC) gcc options: -O3 -march=znver2 -lpthread -ldl -lssl -lcrypto


Phoronix Test Suite v10.8.4