POWER9 compiler benchmarking for a future article on Phoronix.
GCC 8.2.0 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 8.2.0 + clang (GCC) 8.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
GCC 9.0.1 OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental), File-System: ext4, Screen Resolution: 1024x768
Clang 7.0.1 OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 7.0.1 + LLVM 7.0.1, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Clang 8.0.0-rc OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 8.0.0 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 1024x768
Hierarchical INTegration This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: DOUBLE GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 120M 240M 360M 480M 600M SE +/- 91261.04, N = 3 SE +/- 30254.60, N = 3 SE +/- 147477.11, N = 3 SE +/- 149978.94, N = 3 552839732 535814531 379561299 377718193 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 50M 100M 150M 200M 250M SE +/- 8234.46, N = 3 SE +/- 14738.26, N = 3 SE +/- 13328.34, N = 3 SE +/- 13561.84, N = 3 227407921 220611266 176740207 174464419 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
CacheBench This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 13K 26K 39K 52K 65K SE +/- 9.96, N = 3 SE +/- 0.03, N = 3 SE +/- 25.70, N = 3 SE +/- 13.10, N = 3 19793 21154 58871 58691 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 10K 20K 30K 40K 50K SE +/- 14.98, N = 3 SE +/- 0.52, N = 3 SE +/- 3.02, N = 3 SE +/- 18.02, N = 3 11147 11925 44754 44459 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 1000 2000 3000 4000 5000 SE +/- 0.00, N = 3 SE +/- 0.37, N = 3 SE +/- 0.08, N = 3 SE +/- 0.05, N = 3 4897 4782 4750 4750 1. (CC) gcc options: -lrt
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.1 Video Input: Summer Nature 4K GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 20 40 60 80 100 SE +/- 0.40, N = 3 SE +/- 1.78, N = 3 SE +/- 0.44, N = 3 SE +/- 0.92, N = 3 84.58 93.23 103.17 101.06 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 11 22 33 44 55 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 39.96 44.39 47.84 47.71 -fvisibility=hidden -fvisibility=hidden 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -logg -lm
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 20 40 60 80 100 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 59.03 58.83 75.60 74.95 1. (CC) gcc options: -lm -O3 -mtune=native -mcpu=native
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 30 60 90 120 150 154 154 22 23 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 40 80 120 160 200 SE +/- 0.67, N = 3 157 159 15 15 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 30 60 90 120 150 148 147 19 20 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 40 80 120 160 200 SE +/- 0.67, N = 3 165 164 43 43 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 40 80 120 160 200 SE +/- 0.33, N = 3 SE +/- 1.20, N = 3 185 183 118 118 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 177 178 88 89 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 40 80 120 160 200 197 197 159 159 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding GCC 9.0.1 GCC 8.2.0 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 11.25 11.30 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Compression GCC 9.0.1 GCC 8.2.0 5 10 15 20 25 22 21 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.1 Video Input: Summer Nature 1080p GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 7 14 21 28 35 SE +/- 0.39, N = 12 SE +/- 0.09, N = 3 SE +/- 0.18, N = 3 SE +/- 0.19, N = 3 27.69 28.93 32.19 31.38 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread
ebizzy This is a test of ebizzy, a program to generate workloads resembling web server workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 200K 400K 600K 800K 1000K SE +/- 17339.39, N = 12 SE +/- 20488.27, N = 12 SE +/- 13415.01, N = 3 SE +/- 18117.64, N = 3 1130592 1117793 1071048 1152232 1. (CC) gcc options: -pthread -lpthread -O3 -mtune=native -mcpu=native
Apache Benchmark This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 5K 10K 15K 20K 25K SE +/- 41.67, N = 3 SE +/- 24.13, N = 3 SE +/- 50.11, N = 3 SE +/- 125.28, N = 3 21008 21110 21444 21209 1. (CC) gcc options: -shared -fPIC -pthread -O3 -mtune=native -mcpu=native
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 13 26 39 52 65 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 17.87 17.84 56.53 56.23 1. (CC) gcc options: -lm -lpthread -O3 -mtune=native -mcpu=native
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Compression GCC 9.0.1 GCC 8.2.0 70 140 210 280 350 317 322 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Decompression GCC 9.0.1 GCC 8.2.0 80 160 240 320 400 356 353 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Compression GCC 9.0.1 GCC 8.2.0 30 60 90 120 150 115 115 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Decompression GCC 9.0.1 GCC 8.2.0 200 400 600 800 1000 SE +/- 0.33, N = 3 813 800 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Compression GCC 9.0.1 GCC 8.2.0 60 120 180 240 300 280 286 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Add GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 11K 22K 33K 44K 55K SE +/- 62.05, N = 3 SE +/- 791.56, N = 3 SE +/- 770.83, N = 4 SE +/- 165.31, N = 3 37853 49781 50110 49578 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 300 600 900 1200 1500 SE +/- 0.67, N = 3 SE +/- 0.23, N = 3 SE +/- 0.37, N = 3 SE +/- 6.10, N = 3 1173 1255 1593 1585 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Append GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 11K 22K 33K 44K 55K SE +/- 131.76, N = 3 SE +/- 218.98, N = 3 SE +/- 79.39, N = 3 SE +/- 251.90, N = 3 39666 51756 52234 51981 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Replace GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 11K 22K 33K 44K 55K SE +/- 31.59, N = 3 SE +/- 796.15, N = 3 SE +/- 509.02, N = 3 SE +/- 246.26, N = 3 39540 52559 52343 52200 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Prepend GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 11K 22K 33K 44K 55K SE +/- 154.01, N = 3 SE +/- 241.03, N = 3 SE +/- 1070.25, N = 3 SE +/- 710.17, N = 3 39442 51972 53049 52563 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 300K 600K 900K 1200K 1500K SE +/- 13393.71, N = 11 SE +/- 23389.59, N = 3 SE +/- 13748.37, N = 12 SE +/- 3347.83, N = 3 1298341 1217931 1211944 1376795 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 1600 3200 4800 6400 8000 SE +/- 14.69, N = 3 SE +/- 44.37, N = 3 SE +/- 12.55, N = 3 SE +/- 34.93, N = 3 7407 7514 7063 6958 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -mtune=native -mcpu=native -lssl -lcrypto -ldl
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 3 6 9 12 15 SE +/- 0.21, N = 3 SE +/- 0.24, N = 12 SE +/- 0.09, N = 3 SE +/- 0.16, N = 3 11.59 11.37 11.33 11.04 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -lz -llzma
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 400K 800K 1200K 1600K 2000K SE +/- 18962.05, N = 12 SE +/- 24484.83, N = 3 SE +/- 12435.42, N = 3 SE +/- 12276.86, N = 3 1618223 1604180 1666854 1721346 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 1 GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 4 8 12 16 20 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 SE +/- 0.09, N = 3 SE +/- 0.06, N = 3 17.80 17.71 17.83 17.92 1. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 5 10 15 20 25 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 15.73 15.73 18.86 18.61 -pipe -pipe 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lncurses -lm
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPOP GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 400K 800K 1200K 1600K 2000K SE +/- 5953.84, N = 3 SE +/- 21388.79, N = 3 SE +/- 18704.28, N = 10 SE +/- 12153.33, N = 3 1714327 1685935 1729936 1804132 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 200K 400K 600K 800K 1000K SE +/- 16643.13, N = 4 SE +/- 9563.35, N = 3 SE +/- 15331.74, N = 5 SE +/- 18665.14, N = 3 1096671 1017131 1033115 1113384 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 12 24 36 48 60 SE +/- 0.57, N = 3 SE +/- 0.16, N = 3 SE +/- 0.35, N = 3 SE +/- 0.55, N = 3 52.15 53.75 51.98 52.34 1. (CC) gcc options: -ldl -lm -lpthread -O3 -ffast-math -mtune=native -mcpu=native -maltivec -mabi=altivec -mvsx -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 200K 400K 600K 800K 1000K SE +/- 11192.70, N = 3 SE +/- 2969.35, N = 3 SE +/- 5628.72, N = 3 SE +/- 8833.10, N = 3 820878 791788 836662 867234 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 2 4 6 8 10 SE +/- 0.03, N = 3 SE +/- 0.02, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 6.99 6.65 7.18 6.88 1. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 0.3825 0.765 1.1475 1.53 1.9125 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.54 1.54 1.69 1.70 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 1.089 2.178 3.267 4.356 5.445 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.13 4.15 4.82 4.84 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.45 8.22 8.66 8.73 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 7.55 7.69 8.35 8.32 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.26 6.28 6.85 6.84 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 300 600 900 1200 1500 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 1248 1248 944 940 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 1000 2000 3000 4000 5000 SE +/- 3.55, N = 3 SE +/- 0.98, N = 3 SE +/- 0.88, N = 3 SE +/- 30.81, N = 3 3042 3441 4509 4605 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 400 800 1200 1600 2000 SE +/- 0.28, N = 3 SE +/- 0.06, N = 3 SE +/- 0.79, N = 3 SE +/- 1.61, N = 3 1116 1119 1829 1724 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 70 140 210 280 350 SE +/- 0.49, N = 3 SE +/- 0.41, N = 3 SE +/- 0.73, N = 3 SE +/- 0.93, N = 3 308 307 305 305 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 80 160 240 320 400 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 153 161 378 348 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
GCC 8.2.0 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 8.2.0 + clang (GCC) 8.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 5 February 2019 20:23 by user phoronix.
GCC 9.0.1 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental), File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 05:10 by user phoronix.
Clang 7.0.1 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 7.0.1 + LLVM 7.0.1, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 08:23 by user phoronix.
Clang 8.0.0-rc Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 8.0.0 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 12:25 by user phoronix.