POWER9 compiler benchmarking for a future article on Phoronix.
GCC 8.2.0 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 8.2.0 + clang (GCC) 8.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
GCC 9.0.1 OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental), File-System: ext4, Screen Resolution: 1024x768
Clang 7.0.1 OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 7.0.1 + LLVM 7.0.1, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Clang 8.0.0-rc OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 8.0.0 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 1024x768
Hierarchical INTegration This test runs the U.S. Department of Energy's Ames Laboratory Hierarchical INTegration (HINT) benchmark. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: DOUBLE Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 120M 240M 360M 480M 600M SE +/- 149978.94, N = 3 SE +/- 147477.11, N = 3 SE +/- 30254.60, N = 3 SE +/- 91261.04, N = 3 377718193 379561299 535814531 552839732 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org QUIPs, More Is Better Hierarchical INTegration 1.0 Test: FLOAT Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 50M 100M 150M 200M 250M SE +/- 13561.84, N = 3 SE +/- 13328.34, N = 3 SE +/- 14738.26, N = 3 SE +/- 8234.46, N = 3 174464419 176740207 220611266 227407921 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
CacheBench This is a performance test of CacheBench, which is part of LLCbench. CacheBench is designed to test the memory and cache bandwidth performance Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read / Modify / Write GCC 9.0.1 GCC 8.2.0 Clang 7.0.1 Clang 8.0.0-rc 13K 26K 39K 52K 65K SE +/- 9.96, N = 3 SE +/- 0.03, N = 3 SE +/- 13.10, N = 3 SE +/- 25.70, N = 3 19793 21154 58691 58871 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Write GCC 9.0.1 GCC 8.2.0 Clang 7.0.1 Clang 8.0.0-rc 10K 20K 30K 40K 50K SE +/- 14.98, N = 3 SE +/- 0.52, N = 3 SE +/- 18.02, N = 3 SE +/- 3.02, N = 3 11147 11925 44459 44754 1. (CC) gcc options: -lrt
OpenBenchmarking.org MB/s, More Is Better CacheBench Test: Read Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 1000 2000 3000 4000 5000 SE +/- 0.05, N = 3 SE +/- 0.08, N = 3 SE +/- 0.37, N = 3 SE +/- 0.00, N = 3 4750 4750 4782 4897 1. (CC) gcc options: -lrt
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.1 Video Input: Summer Nature 4K Clang 8.0.0-rc Clang 7.0.1 GCC 8.2.0 GCC 9.0.1 20 40 60 80 100 SE +/- 0.44, N = 3 SE +/- 0.92, N = 3 SE +/- 1.78, N = 3 SE +/- 0.40, N = 3 103.17 101.06 93.23 84.58 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread
FLAC Audio Encoding This test times how long it takes to encode a sample WAV file to FLAC format five times. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better FLAC Audio Encoding 1.3.2 WAV To FLAC Clang 8.0.0-rc Clang 7.0.1 GCC 8.2.0 GCC 9.0.1 11 22 33 44 55 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 SE +/- 0.01, N = 5 47.84 47.71 44.39 39.96 -fvisibility=hidden -fvisibility=hidden 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -logg -lm
AOBench AOBench is a lightweight ambient occlusion renderer, written in C. The test profile is using a size of 2048 x 2048. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better AOBench Size: 2048 x 2048 - Total Time Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 20 40 60 80 100 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 75.60 74.95 59.03 58.83 1. (CC) gcc options: -lm -O3 -mtune=native -mcpu=native
GraphicsMagick This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests to stress the system's CPU. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Enhanced Clang 8.0.0-rc Clang 7.0.1 GCC 8.2.0 GCC 9.0.1 30 60 90 120 150 22 23 154 154 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Noise-Gaussian Clang 7.0.1 Clang 8.0.0-rc GCC 9.0.1 GCC 8.2.0 40 80 120 160 200 SE +/- 0.67, N = 3 15 15 157 159 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Sharpen Clang 8.0.0-rc Clang 7.0.1 GCC 8.2.0 GCC 9.0.1 30 60 90 120 150 19 20 147 148 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Swirl Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 40 80 120 160 200 SE +/- 0.67, N = 3 43 43 164 165 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: HWB Color Space Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 40 80 120 160 200 SE +/- 1.20, N = 3 SE +/- 0.33, N = 3 118 118 183 185 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Resizing Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 40 80 120 160 200 SE +/- 0.58, N = 3 SE +/- 0.67, N = 3 88 89 177 178 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
OpenBenchmarking.org Iterations Per Minute, More Is Better GraphicsMagick 1.3.30 Operation: Rotate Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 40 80 120 160 200 159 159 197 197 -fopenmp -ldl -fopenmp -ldl 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -ljbig -lwebp -lwebpmux -ltiff -ljpeg -lXext -lSM -lICE -lX11 -llzma -lbz2 -lz -lm -lpthread
x265 This is a simple test of the x265 encoder run on the CPU with a sample 1080p video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x265 3.0 H.265 1080p Video Encoding GCC 9.0.1 GCC 8.2.0 3 6 9 12 15 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 11.25 11.30 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic -lpthread -lrt -ldl -lnuma
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: XZ 0 - Process: Compression GCC 8.2.0 GCC 9.0.1 5 10 15 20 25 21 22 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
dav1d Dav1d is an open-source, speedy AV1 video decoder. This test profile times how long it takes to decode some sample AV1 video content. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better dav1d 0.1 Video Input: Summer Nature 1080p Clang 8.0.0-rc Clang 7.0.1 GCC 8.2.0 GCC 9.0.1 7 14 21 28 35 SE +/- 0.18, N = 3 SE +/- 0.19, N = 3 SE +/- 0.09, N = 3 SE +/- 0.39, N = 12 32.19 31.38 28.93 27.69 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread
ebizzy This is a test of ebizzy, a program to generate workloads resembling web server workloads. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Records/s, More Is Better ebizzy 0.3 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 200K 400K 600K 800K 1000K SE +/- 13415.01, N = 3 SE +/- 20488.27, N = 12 SE +/- 17339.39, N = 12 SE +/- 18117.64, N = 3 1071048 1117793 1130592 1152232 1. (CC) gcc options: -pthread -lpthread -O3 -mtune=native -mcpu=native
Apache Benchmark This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Apache Benchmark 2.4.29 Static Web Page Serving GCC 9.0.1 GCC 8.2.0 Clang 7.0.1 Clang 8.0.0-rc 5K 10K 15K 20K 25K SE +/- 41.67, N = 3 SE +/- 24.13, N = 3 SE +/- 125.28, N = 3 SE +/- 50.11, N = 3 21008 21110 21209 21444 1. (CC) gcc options: -shared -fPIC -pthread -O3 -mtune=native -mcpu=native
C-Ray This is a test of C-Ray, a simple raytracer designed to test the floating-point CPU performance. This test is multi-threaded (16 threads per core), will shoot 8 rays per pixel for anti-aliasing, and will generate a 1600 x 1200 image. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 1.1 Total Time - 4K, 16 Rays Per Pixel Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 13 26 39 52 65 SE +/- 0.04, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 56.53 56.23 17.87 17.84 1. (CC) gcc options: -lm -lpthread -O3 -mtune=native -mcpu=native
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Brotli 0 - Process: Compression GCC 9.0.1 GCC 8.2.0 70 140 210 280 350 317 322 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Decompression GCC 8.2.0 GCC 9.0.1 80 160 240 320 400 353 356 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Libdeflate 1 - Process: Compression GCC 8.2.0 GCC 9.0.1 30 60 90 120 150 115 115 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Decompression GCC 8.2.0 GCC 9.0.1 200 400 600 800 1000 SE +/- 9.00, N = 3 SE +/- 0.33, N = 3 790 813 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org MB/s, More Is Better lzbench 2017-08-08 Test: Zstd 1 - Process: Compression GCC 9.0.1 GCC 8.2.0 60 120 180 240 300 SE +/- 1.00, N = 3 280 285 1. (CXX) g++ options: -lrt -static -lpthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Add GCC 9.0.1 Clang 7.0.1 GCC 8.2.0 Clang 8.0.0-rc 11K 22K 33K 44K 55K SE +/- 62.05, N = 3 SE +/- 165.31, N = 3 SE +/- 791.56, N = 3 SE +/- 770.83, N = 4 37853 49578 49781 50110 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Composite GCC 9.0.1 GCC 8.2.0 Clang 7.0.1 Clang 8.0.0-rc 300 600 900 1200 1500 SE +/- 0.67, N = 3 SE +/- 0.23, N = 3 SE +/- 6.10, N = 3 SE +/- 0.37, N = 3 1173 1255 1585 1593 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Append GCC 9.0.1 GCC 8.2.0 Clang 7.0.1 Clang 8.0.0-rc 11K 22K 33K 44K 55K SE +/- 131.76, N = 3 SE +/- 218.98, N = 3 SE +/- 251.90, N = 3 SE +/- 79.39, N = 3 39666 51756 51981 52234 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Replace GCC 9.0.1 Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 11K 22K 33K 44K 55K SE +/- 31.59, N = 3 SE +/- 246.26, N = 3 SE +/- 509.02, N = 3 SE +/- 796.15, N = 3 39540 52200 52343 52559 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
OpenBenchmarking.org Operations Per Second, More Is Better Memcached mcperf 1.5.10 Method: Prepend GCC 9.0.1 GCC 8.2.0 Clang 7.0.1 Clang 8.0.0-rc 11K 22K 33K 44K 55K SE +/- 154.01, N = 3 SE +/- 241.03, N = 3 SE +/- 710.17, N = 3 SE +/- 1070.25, N = 3 39442 51972 52563 53049 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm -rdynamic
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SADD Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 Clang 7.0.1 300K 600K 900K 1200K 1500K SE +/- 13748.37, N = 12 SE +/- 23389.59, N = 3 SE +/- 13393.71, N = 11 SE +/- 3347.83, N = 3 1211944 1217931 1298341 1376795 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
OpenSSL OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test measures the RSA 4096-bit performance of OpenSSL. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Signs Per Second, More Is Better OpenSSL 1.1.1 RSA 4096-bit Performance Clang 7.0.1 Clang 8.0.0-rc GCC 9.0.1 GCC 8.2.0 1600 3200 4800 6400 8000 SE +/- 34.93, N = 3 SE +/- 12.55, N = 3 SE +/- 14.69, N = 3 SE +/- 44.37, N = 3 6958 7063 7407 7514 -Qunused-arguments -Qunused-arguments 1. (CC) gcc options: -pthread -m64 -O3 -mtune=native -mcpu=native -lssl -lcrypto -ldl
Zstd Compression This test measures the time needed to compress a sample file (an Ubuntu file-system image) using Zstd compression. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better Zstd Compression 1.3.4 Compressing ubuntu-16.04.3-server-i386.img, Compression Level 19 GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 3 6 9 12 15 SE +/- 0.21, N = 3 SE +/- 0.24, N = 12 SE +/- 0.09, N = 3 SE +/- 0.16, N = 3 11.59 11.37 11.33 11.04 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -pthread -lz -llzma
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: GET GCC 8.2.0 GCC 9.0.1 Clang 8.0.0-rc Clang 7.0.1 400K 800K 1200K 1600K 2000K SE +/- 24484.83, N = 3 SE +/- 18962.05, N = 12 SE +/- 12435.42, N = 3 SE +/- 12276.86, N = 3 1604180 1618223 1666854 1721346 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 1 Clang 7.0.1 Clang 8.0.0-rc GCC 9.0.1 GCC 8.2.0 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.09, N = 3 SE +/- 0.11, N = 3 SE +/- 0.10, N = 3 17.92 17.83 17.80 17.71 1. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native
LAME MP3 Encoding LAME is an MP3 encoder licensed under the LGPL. This test measures the time required to encode a WAV file to MP3 format. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better LAME MP3 Encoding 3.100 WAV To MP3 Clang 8.0.0-rc Clang 7.0.1 GCC 9.0.1 GCC 8.2.0 5 10 15 20 25 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 18.86 18.61 15.73 15.73 -pipe -pipe 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lncurses -lm
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPOP GCC 8.2.0 GCC 9.0.1 Clang 8.0.0-rc Clang 7.0.1 400K 800K 1200K 1600K 2000K SE +/- 21388.79, N = 3 SE +/- 5953.84, N = 3 SE +/- 18704.28, N = 10 SE +/- 12153.33, N = 3 1685935 1714327 1729936 1804132 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: SET GCC 8.2.0 Clang 8.0.0-rc GCC 9.0.1 Clang 7.0.1 200K 400K 600K 800K 1000K SE +/- 9563.35, N = 3 SE +/- 15331.74, N = 5 SE +/- 16643.13, N = 4 SE +/- 18665.14, N = 3 1017131 1033115 1096671 1113384 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
x264 This is a simple test of the x264 encoder run on the CPU (OpenCL support disabled) with a sample video file. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Frames Per Second, More Is Better x264 2018-09-25 H.264 Video Encoding Clang 8.0.0-rc GCC 9.0.1 Clang 7.0.1 GCC 8.2.0 12 24 36 48 60 SE +/- 0.35, N = 3 SE +/- 0.57, N = 3 SE +/- 0.55, N = 3 SE +/- 0.16, N = 3 51.98 52.15 52.34 53.75 1. (CC) gcc options: -ldl -lm -lpthread -O3 -ffast-math -mtune=native -mcpu=native -maltivec -mabi=altivec -mvsx -std=gnu99 -fPIC -fomit-frame-pointer -fno-tree-vectorize
Redis Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Requests Per Second, More Is Better Redis 4.0.8 Test: LPUSH GCC 8.2.0 GCC 9.0.1 Clang 8.0.0-rc Clang 7.0.1 200K 400K 600K 800K 1000K SE +/- 2969.35, N = 3 SE +/- 11192.70, N = 3 SE +/- 5628.72, N = 3 SE +/- 8833.10, N = 3 791788 820878 836662 867234 1. (CC) gcc options: -ggdb -rdynamic -lm -ldl -pthread
t-test1 This is a test of t-test1 for basic memory allocator benchmarks. Note this test profile is currently very basic and the overall time does include the warmup time of the custom t-test1 compilation. Improvements welcome. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Seconds, Fewer Is Better t-test1 2017-01-13 Threads: 2 Clang 8.0.0-rc GCC 9.0.1 Clang 7.0.1 GCC 8.2.0 2 4 6 8 10 SE +/- 0.04, N = 3 SE +/- 0.03, N = 3 SE +/- 0.09, N = 3 SE +/- 0.02, N = 3 7.18 6.99 6.88 6.65 1. (CC) gcc options: -pthread -O3 -mtune=native -mcpu=native
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: Prim Trimesh Clang 7.0.1 Clang 8.0.0-rc GCC 9.0.1 GCC 8.2.0 0.3825 0.765 1.1475 1.53 1.9125 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 1.70 1.69 1.54 1.54 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 136 Ragdolls Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 1.089 2.178 3.267 4.356 5.445 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 4.84 4.82 4.15 4.13 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Convex Clang 7.0.1 Clang 8.0.0-rc GCC 9.0.1 GCC 8.2.0 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 8.73 8.66 8.45 8.22 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 1000 Stack Clang 8.0.0-rc Clang 7.0.1 GCC 8.2.0 GCC 9.0.1 2 4 6 8 10 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 8.35 8.32 7.69 7.55 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
OpenBenchmarking.org Seconds, Fewer Is Better Bullet Physics Engine 2.81 Test: 3000 Fall Clang 8.0.0-rc Clang 7.0.1 GCC 8.2.0 GCC 9.0.1 2 4 6 8 10 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 6.85 6.84 6.28 6.26 -lglut -lGL -lGLU -lglut -lGL -lGLU 1. (CXX) g++ options: -O3 -mtune=native -mcpu=native -rdynamic
SciMark This test runs the ANSI C version of SciMark 2.0, which is a benchmark for scientific and numerical computing developed by programmers at the National Institute of Standards and Technology. This test is made up of Fast Foruier Transform, Jacobi Successive Over-relaxation, Monte Carlo, Sparse Matrix Multiply, and dense LU matrix factorization benchmarks. Learn more via the OpenBenchmarking.org test page.
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Jacobi Successive Over-Relaxation Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 300 600 900 1200 1500 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 940 944 1248 1248 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Dense LU Matrix Factorization GCC 9.0.1 GCC 8.2.0 Clang 8.0.0-rc Clang 7.0.1 1000 2000 3000 4000 5000 SE +/- 3.55, N = 3 SE +/- 0.98, N = 3 SE +/- 0.88, N = 3 SE +/- 30.81, N = 3 3042 3441 4509 4605 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Sparse Matrix Multiply GCC 9.0.1 GCC 8.2.0 Clang 7.0.1 Clang 8.0.0-rc 400 800 1200 1600 2000 SE +/- 0.28, N = 3 SE +/- 0.06, N = 3 SE +/- 1.61, N = 3 SE +/- 0.79, N = 3 1116 1119 1724 1829 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Fast Fourier Transform Clang 7.0.1 Clang 8.0.0-rc GCC 8.2.0 GCC 9.0.1 70 140 210 280 350 SE +/- 0.93, N = 3 SE +/- 0.73, N = 3 SE +/- 0.41, N = 3 SE +/- 0.49, N = 3 305 305 307 308 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
OpenBenchmarking.org Mflops, More Is Better SciMark 2.0 Computational Test: Monte Carlo GCC 9.0.1 GCC 8.2.0 Clang 7.0.1 Clang 8.0.0-rc 80 160 240 320 400 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 153 161 348 378 1. (CC) gcc options: -O3 -mtune=native -mcpu=native -lm
GCC 8.2.0 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 8.2.0 + clang (GCC) 8.2.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 5 February 2019 20:23 by user phoronix.
GCC 9.0.1 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: GCC 9.0.1 20190203 + clang (GCC) 9.0.1 20190203 (experimental), File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: --enable-checking=releaseProcessor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 05:10 by user phoronix.
Clang 7.0.1 Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 7.0.1 + LLVM 7.0.1, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 08:23 by user phoronix.
Clang 8.0.0-rc Processor: POWER9 @ 3.80GHz (44 Cores / 176 Threads), Motherboard: PowerNV T2P9D01 REV 1.01, Memory: 65536MB, Disk: Samsung SSD 960 EVO 500GB + 2000GB Seagate ST2000DM006-2DM1, Graphics: ASPEED Family, Network: 2 x Broadcom NetXtreme BCM5719 PCIe
OS: Ubuntu 19.04, Kernel: 4.18.0-11-generic (ppc64le), Compiler: Clang 8.0.0 + LLVM 8.0.0, File-System: ext4, Screen Resolution: 1024x768
Environment Notes: CXXFLAGS=-O3-mtune=native-mcpu=native CFLAGS=-O3-mtune=native-mcpu=nativeCompiler Notes: Optimized build; Default target: powerpc64le-unknown-linux-gnu; Host CPU: pwr9Processor Notes: Scaling Governor: powernv-cpufreq ondemand
Testing initiated at 6 February 2019 12:25 by user phoronix.