Clear Linux Cascadelake JCC Microcode

Benchmarks for a future article of JCC erratum.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 1911113-HU-CLEARLINU55
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C/C++ Compiler Tests 5 Tests
CPU Massive 9 Tests
Creator Workloads 6 Tests
Database Test Suite 2 Tests
Encoding 2 Tests
HPC - High Performance Computing 4 Tests
Imaging 2 Tests
Common Kernel Benchmarks 2 Tests
Molecular Dynamics 2 Tests
MPI Benchmarks 3 Tests
Multi-Core 8 Tests
Intel oneAPI 2 Tests
Scientific Computing 3 Tests
Server 5 Tests
Server CPU Tests 7 Tests
Single-Threaded 4 Tests
Video Encoding 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Old ucode
November 11 2019
  2 Hours, 20 Minutes
New ucode
November 10 2019
  2 Hours, 40 Minutes
New ucode + Assembler
November 11 2019
  2 Hours, 8 Minutes
Invert Hiding All Results Option
  2 Hours, 23 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Clear Linux Cascadelake JCC MicrocodeProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLCompilerFile-SystemScreen ResolutionOld ucodeNew ucodeNew ucode + Assembler2 x Intel Xeon Platinum 8280 @ 4.00GHz (56 Cores / 112 Threads)GIGABYTE MD61-SC2-00 v01000100 (T15 BIOS)Intel Sky Lake-E DMI3 Registers386048MB280GB INTEL SSDPED1D280GAllvmpipe 377GBVE2282 x Intel X722 for 1GbE + 2 x QLogic FastLinQ QL41000 10/25/40/50GbEClear Linux OS 314705.3.8-854.native (x86_64)GNOME Shell 3.34.1X Server 1.20.5modesetting 1.20.53.3 Mesa 19.3.0-devel (LLVM 9.0.0 256 bits)GCC 9.2.1 20191101 gcc-9-branch@277702 + Clang 9.0.0 + LLVM 9.0.0ext41920x1080Clear Linux OS 31480GCC 9.2.1 20191103 gcc-9-branch@277748 + Clang 9.0.0 + LLVM 9.0.0OpenBenchmarking.orgEnvironment Details- CFFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,-sort-common -Wl,--enable-new-dtags" FFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -malign-data=abi -fno-semantic-interposition -ftree-vectorize -ftree-loop-vectorize -Wl,--enable-new-dtags" CXXFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mtune=skylake -fvisibility-inlines-hidden -Wl,--enable-new-dtags" MESA_GLSL_CACHE_DISABLE=0 CFLAGS="-g -O3 -feliminate-unused-debug-types -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=32 -Wformat -Wformat-security -m64 -fasynchronous-unwind-tables -Wp,-D_REENTRANT -ftree-loop-distribute-patterns -Wl,-z -Wl,now -Wl,-z -Wl,relro -fno-semantic-interposition -ffat-lto-objects -fno-trapping-math -Wl,-sort-common -Wl,--enable-new-dtags -mtune=skylake" THEANO_FLAGS="floatX=float32,openmp=true,gcc.cxxflags="-ftree-vectorize -mavx"" Compiler Details- --build=x86_64-generic-linux --disable-libmpx --disable-libunwind-exceptions --disable-multiarch --disable-vtable-verify --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-clocale=gnu --enable-default-pie --enable-gnu-indirect-function --enable-languages=c,c++,fortran,go --enable-ld=default --enable-libstdcxx-pch --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --exec-prefix=/usr --includedir=/usr/include --target=x86_64-generic-linux --with-arch=westmere --with-gcc-major-version-only --with-glibc-version=2.19 --with-gnu-ld --with-isl --with-ppl=yes --with-tune=haswell Processor Details- Scaling Governor: intel_pstate performanceJava Details- OpenJDK Runtime Environment (build 1.8.0-u232-ga-b00)Python Details- Python 3.7.5Security Details- l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling

Old ucodeNew ucodeNew ucode + AssemblerResult OverviewPhoronix Test Suite100%102%105%107%109%Apache BenchmarkPHPBenchPyBenchGraphicsMagickQMCPACKRedislibjpeg-turbo tjbenchFacebook RocksDBSVT-VP9NAS Parallel Benchmarksrav1eMemcached mcperfEmbreeOSPrayGROMACSNAMD

Clear Linux Cascadelake JCC Microcodenpb: EP.Dnamd: ATPase Simulation - 327,506 Atomsqmcpack: graphics-magick: Swirlgraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Noise-Gaussiangraphics-magick: HWB Color Spaceospray: Magnetic Reconnection - SciVisembree: Pathtracer - Asian Dragonembree: Pathtracer ISPC - Asian Dragonrav1e: 1080p To AV1 Video Encodesvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080ptjbench: Decompression Throughputgromacs: Water Benchmarkredis: SETrocksdb: Rand Fillrocksdb: Seq Fillrocksdb: Read While Writingmcperf: Addmcperf: Getmcperf: Setpybench: Total For Average Test Timesapache: Static Web Page Servingphpbench: PHP Benchmark SuiteOld ucodeNew ucodeNew ucode + Assembler6537.220.362033879.815877195878962095664115276.9260.506971.75650.913343.57358.75181.8961755.8591880693.73409287418776619698762995.257144.362591.3102428072.829268916403.290.365803924.615796744848712040663116376.2459.883770.68620.901340.47345.65177.3737085.7901809059.85403069414795587302161271.057139.661768.3108228149.378533336507.450.363803902.315696995828912024643105676.0160.103772.09900.895344.90348.87183.4027605.8451819823.37405800415584573566961713.456098.062221.6103325832.31880797OpenBenchmarking.org

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DOld ucodeNew ucodeNew ucode + Assembler14002800420056007000SE +/- 87.31, N = 4SE +/- 86.47, N = 3SE +/- 93.39, N = 36537.226403.296507.451. (F9X) gfortran options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2
OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DOld ucodeNew ucodeNew ucode + Assembler11002200330044005500Min: 6435.14 / Avg: 6537.22 / Max: 6798.29Min: 6303.11 / Avg: 6403.29 / Max: 6575.46Min: 6403.02 / Avg: 6507.45 / Max: 6693.771. (F9X) gfortran options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi 2. 3.2

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13b1ATPase Simulation - 327,506 AtomsOld ucodeNew ucodeNew ucode + Assembler0.08230.16460.24690.32920.4115SE +/- 0.00033, N = 10SE +/- 0.00299, N = 3SE +/- 0.00094, N = 30.362030.365800.36380
OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.13b1ATPase Simulation - 327,506 AtomsOld ucodeNew ucodeNew ucode + Assembler12345Min: 0.36 / Avg: 0.36 / Max: 0.36Min: 0.36 / Avg: 0.37 / Max: 0.37Min: 0.36 / Avg: 0.36 / Max: 0.37

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.8Old ucodeNew ucodeNew ucode + Assembler80016002400320040003879.83924.63902.31. (CXX) g++ options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fopenmp -fomit-frame-pointer -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -ffast-math -lm

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample 6000x4000 pixel JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlOld ucodeNew ucodeNew ucode + Assembler30060090012001500SE +/- 6.17, N = 3SE +/- 12.67, N = 3SE +/- 9.28, N = 31587157915691. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SwirlOld ucodeNew ucodeNew ucode + Assembler30060090012001500Min: 1580 / Avg: 1586.67 / Max: 1599Min: 1554 / Avg: 1578.67 / Max: 1596Min: 1551 / Avg: 1569.33 / Max: 15811. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateOld ucodeNew ucodeNew ucode + Assembler160320480640800SE +/- 6.33, N = 3SE +/- 10.54, N = 3SE +/- 10.73, N = 37196746991. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: RotateOld ucodeNew ucodeNew ucode + Assembler130260390520650Min: 707 / Avg: 719.33 / Max: 728Min: 662 / Avg: 674 / Max: 695Min: 678 / Avg: 699.33 / Max: 7121. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenOld ucodeNew ucodeNew ucode + Assembler130260390520650SE +/- 2.03, N = 3SE +/- 1.86, N = 3SE +/- 2.73, N = 35874845821. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: SharpenOld ucodeNew ucodeNew ucode + Assembler100200300400500Min: 584 / Avg: 587.33 / Max: 591Min: 480 / Avg: 483.67 / Max: 486Min: 577 / Avg: 582.33 / Max: 5861. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedOld ucodeNew ucodeNew ucode + Assembler2004006008001000SE +/- 1.53, N = 3SE +/- 1.00, N = 3SE +/- 2.60, N = 38968718911. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: EnhancedOld ucodeNew ucodeNew ucode + Assembler160320480640800Min: 893 / Avg: 896 / Max: 898Min: 870 / Avg: 871 / Max: 873Min: 886 / Avg: 890.67 / Max: 8951. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingOld ucodeNew ucodeNew ucode + Assembler400800120016002000SE +/- 2.08, N = 3SE +/- 30.93, N = 3SE +/- 33.31, N = 32095204020241. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: ResizingOld ucodeNew ucodeNew ucode + Assembler400800120016002000Min: 2091 / Avg: 2095 / Max: 2098Min: 1981 / Avg: 2039.67 / Max: 2086Min: 1960 / Avg: 2024 / Max: 20721. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianOld ucodeNew ucodeNew ucode + Assembler140280420560700SE +/- 0.33, N = 3SE +/- 8.16, N = 4SE +/- 6.23, N = 36646636431. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: Noise-GaussianOld ucodeNew ucodeNew ucode + Assembler120240360480600Min: 664 / Avg: 664.33 / Max: 665Min: 641 / Avg: 662.75 / Max: 680Min: 631 / Avg: 643.33 / Max: 6511. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceOld ucodeNew ucodeNew ucode + Assembler30060090012001500SE +/- 9.26, N = 3SE +/- 13.89, N = 15SE +/- 0.67, N = 31152116310561. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread
OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.33Operation: HWB Color SpaceOld ucodeNew ucodeNew ucode + Assembler2004006008001000Min: 1135 / Avg: 1151.67 / Max: 1167Min: 1085 / Avg: 1162.8 / Max: 1231Min: 1055 / Avg: 1056.33 / Max: 10571. (CC) gcc options: -fopenmp -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -pthread -ltiff -lfreetype -ljpeg -lXext -lSM -lICE -lX11 -lbz2 -lxml2 -lz -lm -lpthread

OSPray

Intel OSPray is a portable ray-tracing engine for high-performance, high-fidenlity scientific visualizations. OSPray builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVisOld ucodeNew ucodeNew ucode + Assembler20406080100SE +/- 0.00, N = 12SE +/- 0.69, N = 8SE +/- 0.92, N = 676.9276.2476.01MIN: 52.63MIN: 52.63 / MAX: 76.92MIN: 55.56 / MAX: 76.92
OpenBenchmarking.orgFPS, More Is BetterOSPray 1.8.5Demo: Magnetic Reconnection - Renderer: SciVisOld ucodeNew ucodeNew ucode + Assembler1530456075Min: 76.92 / Avg: 76.92 / Max: 76.92Min: 71.43 / Avg: 76.24 / Max: 76.92Min: 71.43 / Avg: 76.01 / Max: 76.92

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.6.1Binary: Pathtracer - Model: Asian DragonOld ucodeNew ucodeNew ucode + Assembler1428425670SE +/- 0.46, N = 3SE +/- 0.86, N = 4SE +/- 0.69, N = 360.5159.8860.10MIN: 58.32 / MAX: 62.62MIN: 55.58 / MAX: 62.86MIN: 57.24 / MAX: 62.14
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.6.1Binary: Pathtracer - Model: Asian DragonOld ucodeNew ucodeNew ucode + Assembler1224364860Min: 59.6 / Avg: 60.51 / Max: 61.07Min: 57.63 / Avg: 59.88 / Max: 61.47Min: 58.73 / Avg: 60.1 / Max: 60.82

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.6.1Binary: Pathtracer ISPC - Model: Asian DragonOld ucodeNew ucodeNew ucode + Assembler1632486480SE +/- 1.20, N = 3SE +/- 0.71, N = 3SE +/- 0.21, N = 371.7670.6972.10MIN: 66.45 / MAX: 74.99MIN: 67.89 / MAX: 74.49MIN: 68.93 / MAX: 74.37
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.6.1Binary: Pathtracer ISPC - Model: Asian DragonOld ucodeNew ucodeNew ucode + Assembler1428425670Min: 69.36 / Avg: 71.76 / Max: 73.12Min: 69.79 / Avg: 70.69 / Max: 72.1Min: 71.68 / Avg: 72.1 / Max: 72.31

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.11080p To AV1 Video EncodeOld ucodeNew ucodeNew ucode + Assembler0.20540.41080.61620.82161.027SE +/- 0.002, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 30.9130.9010.895
OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.11080p To AV1 Video EncodeOld ucodeNew ucodeNew ucode + Assembler246810Min: 0.91 / Avg: 0.91 / Max: 0.92Min: 0.9 / Avg: 0.9 / Max: 0.91Min: 0.89 / Avg: 0.9 / Max: 0.9

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pOld ucodeNew ucodeNew ucode + Assembler70140210280350SE +/- 2.73, N = 14SE +/- 1.16, N = 3SE +/- 3.42, N = 9343.57340.47344.901. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: VMAF Optimized - Input: Bosphorus 1080pOld ucodeNew ucodeNew ucode + Assembler60120180240300Min: 314.96 / Avg: 343.57 / Max: 361.66Min: 338.22 / Avg: 340.47 / Max: 342.08Min: 332.78 / Avg: 344.9 / Max: 362.321. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pOld ucodeNew ucodeNew ucode + Assembler80160240320400SE +/- 2.60, N = 3SE +/- 4.06, N = 3SE +/- 2.37, N = 3358.75345.65348.871. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.1Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pOld ucodeNew ucodeNew ucode + Assembler60120180240300Min: 354.19 / Avg: 358.75 / Max: 363.2Min: 341.3 / Avg: 345.65 / Max: 353.77Min: 344.43 / Avg: 348.87 / Max: 352.531. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

libjpeg-turbo tjbench

tjbench is a JPEG decompression/compression benchmark part of libjpeg-turbo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.0.2Test: Decompression ThroughputOld ucodeNew ucodeNew ucode + Assembler4080120160200SE +/- 0.32, N = 3SE +/- 0.39, N = 3SE +/- 0.10, N = 3181.90177.37183.401. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic
OpenBenchmarking.orgMegapixels/sec, More Is Betterlibjpeg-turbo tjbench 2.0.2Test: Decompression ThroughputOld ucodeNew ucodeNew ucode + Assembler306090120150Min: 181.37 / Avg: 181.9 / Max: 182.47Min: 176.59 / Avg: 177.37 / Max: 177.85Min: 183.21 / Avg: 183.4 / Max: 183.531. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -rdynamic

GROMACS

The Gromacs molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2019.4Water BenchmarkOld ucodeNew ucodeNew ucode + Assembler1.31832.63663.95495.27326.5915SE +/- 0.026, N = 3SE +/- 0.020, N = 3SE +/- 0.004, N = 35.8595.7905.8451. (CXX) g++ options: -mavx512f -mfma -pthread -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -funroll-all-loops -lrt -lpthread -lm
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2019.4Water BenchmarkOld ucodeNew ucodeNew ucode + Assembler246810Min: 5.81 / Avg: 5.86 / Max: 5.89Min: 5.75 / Avg: 5.79 / Max: 5.82Min: 5.84 / Avg: 5.85 / Max: 5.851. (CXX) g++ options: -mavx512f -mfma -pthread -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -std=c++11 -funroll-all-loops -lrt -lpthread -lm

Redis

Redis is an open-source data structure server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SETOld ucodeNew ucodeNew ucode + Assembler400K800K1200K1600K2000KSE +/- 19430.82, N = 8SE +/- 35782.39, N = 15SE +/- 23886.38, N = 151880693.731809059.851819823.371. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake
OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 5.0.5Test: SETOld ucodeNew ucodeNew ucode + Assembler300K600K900K1200K1500KMin: 1776198.88 / Avg: 1880693.73 / Max: 1937984.62Min: 1618123 / Avg: 1809059.85 / Max: 2004008Min: 1658374.88 / Avg: 1819823.37 / Max: 1930501.881. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake

Facebook RocksDB

This is a benchmark of Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random FillOld ucodeNew ucodeNew ucode + Assembler90K180K270K360K450KSE +/- 1592.77, N = 3SE +/- 2131.05, N = 3SE +/- 2049.92, N = 34092874030694058001. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Random FillOld ucodeNew ucodeNew ucode + Assembler70K140K210K280K350KMin: 406283 / Avg: 409287 / Max: 411707Min: 398814 / Avg: 403068.67 / Max: 405414Min: 401702 / Avg: 405799.67 / Max: 4079641. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Sequential FillOld ucodeNew ucodeNew ucode + Assembler90K180K270K360K450KSE +/- 550.05, N = 3SE +/- 365.04, N = 3SE +/- 438.89, N = 34187764147954155841. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Sequential FillOld ucodeNew ucodeNew ucode + Assembler70K140K210K280K350KMin: 417781 / Avg: 418775.67 / Max: 419680Min: 414065 / Avg: 414794.67 / Max: 415181Min: 414814 / Avg: 415583.67 / Max: 4163341. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Read While WritingOld ucodeNew ucodeNew ucode + Assembler1.3M2.6M3.9M5.2M6.5MSE +/- 64642.57, N = 15SE +/- 103250.30, N = 15SE +/- 78853.34, N = 46196987587302157356691. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread
OpenBenchmarking.orgOp/s, More Is BetterFacebook RocksDB 6.3.6Test: Read While WritingOld ucodeNew ucodeNew ucode + Assembler1.1M2.2M3.3M4.4M5.5MMin: 5734890 / Avg: 6196986.73 / Max: 6740484Min: 4890528 / Avg: 5873021.2 / Max: 6518105Min: 5536878 / Avg: 5735669.25 / Max: 59090291. (CXX) g++ options: -O3 -march=native -std=c++11 -fno-builtin-memcmp -fno-rtti -rdynamic -lpthread

Memcached mcperf

This is a test of twmperf/mcperf with memcached. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: AddOld ucodeNew ucodeNew ucode + Assembler13K26K39K52K65KSE +/- 137.80, N = 3SE +/- 933.56, N = 12SE +/- 262.38, N = 362995.261271.061713.41. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lm -rdynamic
OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: AddOld ucodeNew ucodeNew ucode + Assembler11K22K33K44K55KMin: 62758.5 / Avg: 62995.17 / Max: 63235.8Min: 51038.9 / Avg: 61270.98 / Max: 62602.8Min: 61242.6 / Avg: 61713.43 / Max: 62149.51. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lm -rdynamic

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: GetOld ucodeNew ucodeNew ucode + Assembler12K24K36K48K60KSE +/- 599.69, N = 3SE +/- 525.50, N = 15SE +/- 772.61, N = 357144.357139.656098.01. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lm -rdynamic
OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: GetOld ucodeNew ucodeNew ucode + Assembler10K20K30K40K50KMin: 56525.2 / Avg: 57144.33 / Max: 58343.5Min: 54363.4 / Avg: 57139.61 / Max: 62870.6Min: 54591.7 / Avg: 56098 / Max: 57149.61. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lm -rdynamic

OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: SetOld ucodeNew ucodeNew ucode + Assembler13K26K39K52K65KSE +/- 52.60, N = 3SE +/- 66.58, N = 3SE +/- 210.63, N = 362591.361768.362221.61. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lm -rdynamic
OpenBenchmarking.orgOperations Per Second, More Is BetterMemcached mcperf 1.5.10Method: SetOld ucodeNew ucodeNew ucode + Assembler11K22K33K44K55KMin: 62500.7 / Avg: 62591.33 / Max: 62682.9Min: 61656.2 / Avg: 61768.27 / Max: 61886.6Min: 61836 / Avg: 62221.6 / Max: 62561.31. (CC) gcc options: -O3 -pipe -fexceptions -fstack-protector -m64 -ffat-lto-objects -fno-trapping-math -mtune=skylake -lm -rdynamic

PyBench

This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test TimesOld ucodeNew ucodeNew ucode + Assembler2004006008001000SE +/- 3.06, N = 3SE +/- 3.61, N = 3SE +/- 1.86, N = 3102410821033
OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test TimesOld ucodeNew ucodeNew ucode + Assembler2004006008001000Min: 1020 / Avg: 1024 / Max: 1030Min: 1077 / Avg: 1082 / Max: 1089Min: 1029 / Avg: 1032.67 / Max: 1035

Apache Benchmark

This is a test of ab, which is the Apache benchmark program. This test profile measures how many requests per second a given system can sustain when carrying out 1,000,000 requests with 100 requests being carried out concurrently. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingOld ucodeNew ucodeNew ucode + Assembler6K12K18K24K30KSE +/- 340.14, N = 3SE +/- 120.77, N = 3SE +/- 359.87, N = 328072.8228149.3725832.311. (CC) gcc options: -shared -fPIC -pthread -O3 -fstack-protector -m64 -mtune=skylake
OpenBenchmarking.orgRequests Per Second, More Is BetterApache Benchmark 2.4.29Static Web Page ServingOld ucodeNew ucodeNew ucode + Assembler5K10K15K20K25KMin: 27648.1 / Avg: 28072.82 / Max: 28745.39Min: 28007.83 / Avg: 28149.37 / Max: 28389.63Min: 25121.88 / Avg: 25832.31 / Max: 26287.481. (CC) gcc options: -shared -fPIC -pthread -O3 -fstack-protector -m64 -mtune=skylake

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. The number of iterations used is 1,000,000. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteOld ucodeNew ucodeNew ucode + Assembler200K400K600K800K1000KSE +/- 718.51, N = 3SE +/- 309.51, N = 3SE +/- 1348.44, N = 3926891853333880797
OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteOld ucodeNew ucodeNew ucode + Assembler160K320K480K640K800KMin: 925533 / Avg: 926891 / Max: 927977Min: 852791 / Avg: 853332.67 / Max: 853863Min: 879113 / Avg: 880796.67 / Max: 883463