POWER9 Talos II Compiler Benchmarks

POWER9 compiler benchmarking for a future article on Phoronix.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1902066-SP-POWER9TAL97
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
GCC 8.2.0
February 05 2019
  2 Hours, 48 Minutes
GCC 9.0.1
February 06 2019
  2 Hours, 44 Minutes
Clang 7.0.1
February 06 2019
  2 Hours, 19 Minutes
Clang 8.0.0-rc
February 06 2019
  2 Hours, 27 Minutes
Invert Behavior (Only Show Selected Data)
  2 Hours, 35 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


POWER9 Talos II Compiler BenchmarksOpenBenchmarking.orgPhoronix Test SuitePOWER9 @ 3.80GHz (44 Cores / 176 Threads)PowerNV T2P9D01 REV 1.0165536MBSamsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1ASPEED Family2 x Broadcom NetXtreme BCM5719 PCIeUbuntu 19.044.18.0-11-generic (ppc64le)GCC 8.2.0 + clang (GCC) 8.2.0GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental)Clang 7.0.1 + LLVM 7.0.1Clang 8.0.0 + LLVM 8.0.0ext41024x768ProcessorMotherboardMemoryDiskGraphicsNetworkOSKernelCompilersFile-SystemScreen ResolutionPOWER9 Talos II Compiler Benchmarks PerformanceSystem Logs- CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=native- GCC 8.2.0: --enable-checking=release- GCC 9.0.1: --enable-checking=release- Clang 7.0.1: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9 - Clang 8.0.0-rc: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9 - Scaling Governor: powernv-cpufreq ondemand

GCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rcResult OverviewPhoronix Test Suite100%166%232%298%365%GraphicsMagickC-RayCacheBenchHierarchical INTegrationSciMarkMemcached mcperfAOBenchLAME MP3 EncodingFLAC Audio Encodingdav1dTimed MAFFT AlignmentTSCPHimeno BenchmarkBullet Physics EngineRedisOpenSSLebizzyCppPerformanceBenchmarksZstd Compressiont-test1x264libjpeg-turbo tjbenchApache Benchmark

POWER9 Talos II Compiler Benchmarkst-test1: 1t-test1: 2lzbench: XZ 0 - Compressionlzbench: XZ 0 - Decompressionlzbench: Zstd 1 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Brotli 0 - Compressionlzbench: Brotli 0 - Decompressionlzbench: Libdeflate 1 - Compressionlzbench: Libdeflate 1 - Decompressionmafft: Multiple Sequence Alignmentcachebench: Readcachebench: Writecachebench: Read / Modify / Writescimark2: Compositescimark2: Monte Carloscimark2: Fast Fourier Transformscimark2: Sparse Matrix Multiplyscimark2: Dense LU Matrix Factorizationscimark2: Jacobi Successive Over-Relaxationtscp: AI Chess Performancex264: H.264 Video Encodingx265: H.265 1080p Video Encodinggraphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spacehimeno: Poisson Pressure Solvercompress-7zip: Compress Speed Testebizzy: c-ray: Total Time - 4K, 16 Rays Per Pixelcompress-pbzip2: 256MB File Compressionaobench: 2048 x 2048 - Total Timebullet: Raytestsbullet: 3000 Fallbullet: 1000 Stackbullet: 1000 Convexbullet: 136 Ragdollsbullet: Prim Trimeshbullet: Convex Trimeshcompress-zstd: Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19dav1d: Summer Nature 4Kdav1d: Summer Nature 1080pencode-flac: WAV To FLACencode-mp3: WAV To MP3m-queens: Time To Solveopenssl: RSA 4096-bit Performancetjbench: Decompression Throughputcpp-perf-bench: Atolcpp-perf-bench: Ctypecpp-perf-bench: Stepanov Vectorcpp-perf-bench: Function Objectscpp-perf-bench: Stepanov Abstractionredis: LPOPredis: SADDredis: LPUSHredis: GETredis: SETxsbench: mcperf: Addmcperf: Getmcperf: Setmcperf: Appendmcperf: Deletemcperf: Prependmcperf: Replacehint: FLOAThint: DOUBLEapache: Static Web Page ServingGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc17.716.6521702857903222921153534.5147821192521154125516130711193441124871436853.7511.30164197147154178159183618157870111779317.842.3458.834.916.287.698.224.151.541.9511.3793.2328.9344.3915.7320.36751410689.1775.0214220.0059.9816859351217931791788160418010171315560007497817401549744517567451551972525592206112665358145312111017.806.9922722808133172991153563.7948971114719793117315330811163042124874029352.1511.25165197148154177157185656153692113059217.872.2659.034.916.267.558.454.131.541.9811.5984.5827.6939.9615.7320.35740710689.2474.8614019.7056.8917143271298341820878161822310966715486220378535628538135396665659939442395402274079215528397322100817.926.884.394750444595869115853483051724460594083577852.344315920238915118594115223256.2374.955.386.848.328.734.841.702.1211.04101.0631.3847.7118.61695810989.7566.2513119.5353.041804132137679586723417213461113384495787406149453519817440952563522001744644193777181932120917.837.184.244750447545887115933783051829450994484996651.984315919228815118623107104856.5375.605.416.858.358.664.821.692.1111.33103.1732.1947.8418.86706310789.4966.3413219.5053.2417299361211944836662166685410331155011074547496825223473918530495234317674020737956129921444OpenBenchmarking.org

t-test1

This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 1GCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc48121620SE +/- 0.10, N = 3SE +/- 0.11, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 317.7117.8017.9217.831. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native

OpenBenchmarking.orgSeconds, Fewer Is Bettert-test1 2017-01-13Threads: 2GCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc246810SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.04, N = 36.656.996.887.181. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: CompressionGCC 8.2.0GCC 9.0.151015202521221. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: XZ 0 - Process: DecompressionGCC 8.2.0GCC 9.0.1163248648070721. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: CompressionGCC 8.2.0GCC 9.0.1601201802403002862801. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Zstd 1 - Process: DecompressionGCC 8.2.0GCC 9.0.12004006008001000SE +/- 0.33, N = 38008131. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Brotli 0 - Process: CompressionGCC 8.2.0GCC 9.0.1701402102803503223171. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Brotli 0 - Process: DecompressionGCC 8.2.0GCC 9.0.170140210280350SE +/- 0.33, N = 32922991. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: CompressionGCC 8.2.0GCC 9.0.13060901201501151151. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 2017-08-08Test: Libdeflate 1 - Process: DecompressionGCC 8.2.0GCC 9.0.1801602403204003533561. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Timed MAFFT Alignment

This test performs an alignment of 100 pyruvate decarboxylase sequences. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.392Multiple Sequence AlignmentGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1.01482.02963.04444.05925.074SE +/- 0.22, N = 12SE +/- 0.25, N = 12SE +/- 0.37, N = 9SE +/- 0.30, N = 94.513.794.394.241. (CC) gcc options: -std=c99 -O3 -lm -lpthread

CacheBench

This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: ReadGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc10002000300040005000SE +/- 0.37, N = 3SE +/- 0.00, N = 3SE +/- 0.05, N = 3SE +/- 0.08, N = 347824897475047501. (CC) gcc options: -lrt

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: WriteGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc10K20K30K40K50KSE +/- 0.52, N = 3SE +/- 14.98, N = 3SE +/- 18.02, N = 3SE +/- 3.02, N = 3119251114744459447541. (CC) gcc options: -lrt

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / WriteGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc13K26K39K52K65KSE +/- 0.03, N = 3SE +/- 9.96, N = 3SE +/- 13.10, N = 3SE +/- 25.70, N = 3211541979358691588711. (CC) gcc options: -lrt

SciMark

This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: CompositeGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc30060090012001500SE +/- 0.23, N = 3SE +/- 0.67, N = 3SE +/- 6.10, N = 3SE +/- 0.37, N = 312551173158515931. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Monte CarloGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc80160240320400SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 31611533483781. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Fast Fourier TransformGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc70140210280350SE +/- 0.41, N = 3SE +/- 0.49, N = 3SE +/- 0.93, N = 3SE +/- 0.73, N = 33073083053051. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Sparse Matrix MultiplyGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc400800120016002000SE +/- 0.06, N = 3SE +/- 0.28, N = 3SE +/- 1.61, N = 3SE +/- 0.79, N = 311191116172418291. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Dense LU Matrix FactorizationGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc10002000300040005000SE +/- 0.98, N = 3SE +/- 3.55, N = 3SE +/- 30.81, N = 3SE +/- 0.88, N = 334413042460545091. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

OpenBenchmarking.orgMflops, More Is BetterSciMark 2.0Computational Test: Jacobi Successive Over-RelaxationGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc30060090012001500SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3124812489409441. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

TSCP

This is a performance test of TSCP, Tom Kerrigan's Simple Chess Program, which has a built-in performance benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterTSCP 1.81AI Chess PerformanceGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc200K400K600K800K1000K7143687402938357788499661. (CC) gcc options: -O3 -mtune=native -mcpu=native

x264

This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx264 2018-09-25H.264 Video EncodingGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1224364860SE +/- 0.16, N = 3SE +/- 0.57, N = 3SE +/- 0.55, N = 3SE +/- 0.35, N = 353.7552.1552.3451.981. (CC) gcc options: -ldl -lm -lpthread -O3 -ffast-math -mtune=native -mcpu=native -maltivec -mabi=altivec -mvsx -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize

x265

This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterx265 3.0H.265 1080p Video EncodingGCC 8.2.0GCC 9.0.13691215SE +/- 0.09, N = 3SE +/- 0.03, N = 311.3011.251. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic -lpthread -lrt -ldl -lnuma

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SwirlGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc4080120160200SE +/- 0.67, N = 31641654343-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: RotateGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc4080120160200197197159159-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: SharpenGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc3060901201501471482019-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: EnhancedGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc3060901201501541542322-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: ResizingGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc4080120160200SE +/- 0.67, N = 3SE +/- 0.58, N = 31781778988-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: Noise-GaussianGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc4080120160200SE +/- 0.67, N = 31591571515-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.30Operation: HWB Color SpaceGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc4080120160200SE +/- 1.20, N = 3SE +/- 0.33, N = 3183185118118-fopenmp -ldl-fopenmp -ldl1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread

Himeno Benchmark

The Himeno benchmark is a linear solver of pressure Poisson using a point-Jacobi method. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterHimeno Benchmark 3.0Poisson Pressure SolverGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc140280420560700SE +/- 14.49, N = 12SE +/- 8.80, N = 3SE +/- 7.79, N = 7SE +/- 7.21, N = 126186565946231. (CC) gcc options: -O3 -mtune=native -mcpu=native

7-Zip Compression

This is a test of 7-Zip using p7zip with its integrated benchmark feature or upstream 7-Zip for the Windows x64 build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 16.02Compress Speed TestGCC 8.2.0GCC 9.0.130K60K90K120K150KSE +/- 2707.02, N = 4SE +/- 2561.27, N = 31578701536921. (CXX) g++ options: -pipe -lpthread

ebizzy

This is a test of ebizzy, a program to generate workloads resembling web server workloads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRecords/s, More Is Betterebizzy 0.3GCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc200K400K600K800K1000KSE +/- 20488.27, N = 12SE +/- 17339.39, N = 12SE +/- 18117.64, N = 3SE +/- 13415.01, N = 311177931130592115223210710481. (CC) gcc options: -pthread -lpthread -O3 -mtune=native -mcpu=native

C-Ray

This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 1.1Total Time - 4K, 16 Rays Per PixelGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1326395265SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 317.8417.8756.2356.531. (CC) gcc options: -lm -lpthread -O3 -mtune=native -mcpu=native

Parallel BZIP2 Compression

This test measures the time needed to compress a file (a .tar package of the Linux kernel source code) using BZIP2 compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterParallel BZIP2 Compression 1.1.12256MB File CompressionGCC 8.2.0GCC 9.0.10.52651.0531.57952.1062.6325SE +/- 0.03, N = 3SE +/- 0.03, N = 52.342.261. (CXX) g++ options: -O2 -pthread -lbz2 -lpthread

AOBench

AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterAOBenchSize: 2048 x 2048 - Total TimeGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc20406080100SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 358.8359.0374.9575.601. (CC) gcc options: -lm -O3 -mtune=native -mcpu=native

Bullet Physics Engine

This is a benchmark of the Bullet Physics Engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: RaytestsGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1.21732.43463.65194.86926.0865SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.914.915.385.41-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 3000 FallGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 36.286.266.846.85-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 StackGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc246810SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 37.697.558.328.35-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 1000 ConvexGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc246810SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 38.228.458.738.66-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: 136 RagdollsGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1.0892.1783.2674.3565.445SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.154.134.844.82-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Prim TrimeshGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc0.38250.7651.14751.531.9125SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.541.541.701.69-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

OpenBenchmarking.orgSeconds, Fewer Is BetterBullet Physics Engine 2.81Test: Convex TrimeshGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc0.4770.9541.4311.9082.385SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.951.982.122.11-lglut -lGL -lGLU-lglut -lGL -lGLU1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic

Zstd Compression

This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterZstd Compression 1.3.4Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19GCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc3691215SE +/- 0.24, N = 12SE +/- 0.21, N = 3SE +/- 0.16, N = 3SE +/- 0.09, N = 311.3711.5911.0411.331. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -lz -llzma

dav1d

Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.1Video Input: Summer Nature 4KGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc20406080100SE +/- 1.78, N = 3SE +/- 0.40, N = 3SE +/- 0.92, N = 3SE +/- 0.44, N = 393.2384.58101.06103.171. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread

OpenBenchmarking.orgSeconds, Fewer Is Betterdav1d 0.1Video Input: Summer Nature 1080pGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc714212835SE +/- 0.09, N = 3SE +/- 0.39, N = 12SE +/- 0.19, N = 3SE +/- 0.18, N = 328.9327.6931.3832.191. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread

FLAC Audio Encoding

This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterFLAC Audio Encoding 1.3.2WAV To FLACGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1122334455SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 5SE +/- 0.01, N = 544.3939.9647.7147.84-fvisibility=hidden-fvisibility=hidden1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -logg -lm

LAME MP3 Encoding

LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterLAME MP3 Encoding 3.100WAV To MP3GCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 315.7315.7318.6118.86-pipe-pipe1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lncurses -lm

m-queens

A solver for the N-queens problem with multi-threading support via the OpenMP library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterm-queens 1.2Time To SolveGCC 8.2.0GCC 9.0.1510152025SE +/- 0.05, N = 3SE +/- 0.01, N = 320.3620.351. (CXX) g++ options: -fopenmp -O3 -mtune=native -mcpu=native -O2

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSigns Per Second, More Is BetterOpenSSL 1.1.1RSA 4096-bit PerformanceGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc16003200480064008000SE +/- 44.37, N = 3SE +/- 14.69, N = 3SE +/- 34.93, N = 3SE +/- 12.55, N = 37514740769587063-Qunused-arguments-Qunused-arguments1. (CC) gcc options: -pthread -m64 -O3 -mtune=native -mcpu=native -lssl -lcrypto -ldl

libjpeg-turbo tjbench

tjbench is a JPEG decompression/compression benchmark part of libjpeg-turbo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 1.5.3Test: Decompression ThroughputGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc20406080100SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3106106109107-lm-lm-lm1. (CC) gcc options: -O3 -mtune=native -mcpu=native

CppPerformanceBenchmarks

CppPerformanceBenchmarks is a set of C++ compiler performance benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: AtolGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc20406080100SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.05, N = 389.1789.2489.7589.491. (CXX) g++ options: -O3 -mtune=native -mcpu=native -std=c++11

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: CtypeGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc20406080100SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.05, N = 3SE +/- 0.00, N = 375.0274.8666.2566.341. (CXX) g++ options: -O3 -mtune=native -mcpu=native -std=c++11

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov VectorGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc306090120150SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31421401311321. (CXX) g++ options: -O3 -mtune=native -mcpu=native -std=c++11

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Function ObjectsGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc510152025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 320.0019.7019.5319.501. (CXX) g++ options: -O3 -mtune=native -mcpu=native -std=c++11

OpenBenchmarking.orgSeconds, Fewer Is BetterCppPerformanceBenchmarks 9Test: Stepanov AbstractionGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc1326395265SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 359.9856.8953.0453.241. (CXX) g++ options: -O3 -mtune=native -mcpu=native -std=c++11

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPOPGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc400K800K1200K1600K2000KSE +/- 21388.79, N = 3SE +/- 5953.84, N = 3SE +/- 12153.33, N = 3SE +/- 18704.28, N = 1016859351714327180413217299361. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SADDGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc300K600K900K1200K1500KSE +/- 23389.59, N = 3SE +/- 13393.71, N = 11SE +/- 3347.83, N = 3SE +/- 13748.37, N = 1212179311298341137679512119441. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: LPUSHGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc200K400K600K800K1000KSE +/- 2969.35, N = 3SE +/- 11192.70, N = 3SE +/- 8833.10, N = 3SE +/- 5628.72, N = 37917888208788672348366621. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: GETGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc400K800K1200K1600K2000KSE +/- 24484.83, N = 3SE +/- 18962.05, N = 12SE +/- 12276.86, N = 3SE +/- 12435.42, N = 316041801618223172134616668541. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 4.0.8Test: SETGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc200K400K600K800K1000KSE +/- 9563.35, N = 3SE +/- 16643.13, N = 4SE +/- 18665.14, N = 3SE +/- 15331.74, N = 510171311096671111338410331151. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread

Xsbench

XSBench is a mini-app representing a key computational kernel of the Monte Carlo neutronics application OpenMC. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgLookups/s, More Is BetterXsbench 2017-07-06GCC 8.2.0GCC 9.0.11.2M2.4M3.6M4.8M6MSE +/- 58462.89, N = 3SE +/- 62432.57, N = 12556000754862201. (CC) gcc options: -std=gnu99 -fopenmp -O3 -lm

Memcached mcperf

This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: AddGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc11K22K33K44K55KSE +/- 791.56, N = 3SE +/- 62.05, N = 3SE +/- 165.31, N = 3SE +/- 770.83, N = 4497813785349578501101. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: GetGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc16K32K48K64K80KSE +/- 368.57, N = 3SE +/- 479.89, N = 3SE +/- 103.90, N = 3SE +/- 753.43, N = 11740155628574061745471. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: SetGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc11K22K33K44K55KSE +/- 973.53, N = 3SE +/- 57.29, N = 3SE +/- 492.09, N = 3SE +/- 89.60, N = 3497443813549453496821. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: AppendGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc11K22K33K44K55KSE +/- 218.98, N = 3SE +/- 131.76, N = 3SE +/- 251.90, N = 3SE +/- 79.39, N = 3517563966651981522341. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: DeleteGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc16K32K48K64K80KSE +/- 838.42, N = 9SE +/- 202.38, N = 3SE +/- 902.55, N = 8SE +/- 113.11, N = 3745155659974409739181. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: PrependGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc11K22K33K44K55KSE +/- 241.03, N = 3SE +/- 154.01, N = 3SE +/- 710.17, N = 3SE +/- 1070.25, N = 3519723944252563530491. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: ReplaceGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc11K22K33K44K55KSE +/- 796.15, N = 3SE +/- 31.59, N = 3SE +/- 246.26, N = 3SE +/- 509.02, N = 3525593954052200523431. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic

Hierarchical INTegration

This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: FLOATGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc50M100M150M200M250MSE +/- 14738.26, N = 3SE +/- 8234.46, N = 3SE +/- 13561.84, N = 3SE +/- 13328.34, N = 32206112662274079211744644191767402071. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

OpenBenchmarking.orgQUIPs, More Is BetterHierarchical INTegration 1.0Test: DOUBLEGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc120M240M360M480M600MSE +/- 30254.60, N = 3SE +/- 91261.04, N = 3SE +/- 149978.94, N = 3SE +/- 147477.11, N = 35358145315528397323777181933795612991. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm

Apache Benchmark

This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingGCC 8.2.0GCC 9.0.1Clang 7.0.1Clang 8.0.0-rc5K10K15K20K25KSE +/- 24.13, N = 3SE +/- 41.67, N = 3SE +/- 125.28, N = 3SE +/- 50.11, N = 3211102100821209214441. (CC) gcc options: -shared -fPIC -pthread -O3 -mtune=native -mcpu=native