7980XE Feb 2921

Intel Core i9-7980XE testing with a ASUS PRIME X299-A (2002 BIOS) and Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB on Ubuntu 20.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2102162-HA-7980XEFEB73
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

C++ Boost Tests 2 Tests
C/C++ Compiler Tests 2 Tests
CPU Massive 6 Tests
Creator Workloads 9 Tests
Cryptography 2 Tests
Finance 2 Tests
Fortran Tests 3 Tests
Game Development 2 Tests
HPC - High Performance Computing 11 Tests
Imaging 3 Tests
Machine Learning 3 Tests
Molecular Dynamics 5 Tests
MPI Benchmarks 6 Tests
Multi-Core 8 Tests
NVIDIA GPU Compute 3 Tests
OpenMPI Tests 9 Tests
Python Tests 3 Tests
Scientific Computing 6 Tests
Server CPU Tests 2 Tests
Single-Threaded 3 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
February 14 2021
  6 Hours, 16 Minutes
2
February 14 2021
  7 Hours, 5 Minutes
3
February 15 2021
  6 Hours, 40 Minutes
4
February 15 2021
  7 Hours, 27 Minutes
5
February 15 2021
  6 Hours, 34 Minutes
Invert Hiding All Results Option
  6 Hours, 48 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


7980XE Feb 2921ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen Resolution12345Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads)ASUS PRIME X299-A (2002 BIOS)Intel Sky Lake-E DMI3 Registers16GBSamsung SSD 970 EVO 500GBGigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz)Realtek ALC1220G237HLIntel I219-VUbuntu 20.105.8.0-36-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.94.6 Mesa 20.2.6 (LLVM 11.0.0)1.2.131GCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08Graphics Details- GLAMORPython Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

12345Result OverviewPhoronix Test Suite100%107%114%121%128%IORRedisMobile Neural NetworkQMCPACKLAMMPS Molecular Dynamics SimulatorNAS Parallel BenchmarksJPEG XL DecodingPennantCloverLeafNgspiceONNX RuntimeEtcpakGoogle SynthMarkParaViewJPEG XLChaos Group V-RAYQuantLibTNNLULESHGcrypt LibraryWebP2 Image EncodeGnuPGGROMACSASKAPrav1elzbenchTimed Godot Game Engine CompilationFinanceBench

7980XE Feb 2921lammps: 20k Atomswebp2: Quality 100, Lossless Compressiononnx: bertsquad-10 - OpenMP CPUjpegxl: PNG - 8onnx: super-resolution-10 - OpenMP CPUwebp2: Quality 95, Compression Effort 7ior: 32MB - Default Test Directoryior: 64MB - Default Test Directoryparaview: Many Spheres - 1920 x 1080paraview: Many Spheres - 1920 x 1080gcrypt: webp2: Quality 75, Compression Effort 7ior: 8MB - Default Test Directoryngspice: C2670ngspice: C7552quantlib: askap: tConvolve MT - Degriddingaskap: tConvolve MT - Griddingmnn: inception-v3mnn: mobilenet-v1-1.0mnn: MobileNetV2_224mnn: resnet-v2-50mnn: SqueezeNetV1.0jpegxl: PNG - 7onnx: fcn-resnet101-11 - OpenMP CPUonnx: yolov4 - OpenMP CPUior: 16MB - Default Test Directoryonnx: shufflenet-v2-10 - OpenMP CPUgromacs: water_GMX50_barebuild-godot: Time To Compilecloverleaf: Lagrangian-Eulerian Hydrodynamicsjpegxl-decode: 1v-ray: CPUnpb: EP.Dgnupg: 2.7GB Sample File Encryptionfinancebench: Bonds OpenMPior: 4MB - Default Test Directoryrav1e: 5rav1e: 1pennant: sedovbigjpegxl-decode: Allfinancebench: Repo OpenMPrav1e: 6npb: LU.Clzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressionqmcpack: simple-H2Opennant: leblancbigrav1e: 10ior: 2MB - Default Test Directoryetcpak: ETC1 + Ditheringsynthmark: VoiceMark_100lzbench: Crush 0 - Decompressionlzbench: Crush 0 - Compressionjpegxl: JPEG - 5etcpak: ETC2jpegxl: PNG - 5tnn: CPU - MobileNet v2lzbench: Zstd 8 - Decompressionlzbench: Zstd 8 - Compressionlzbench: Zstd 1 - Decompressionlzbench: Zstd 1 - Compressionnpb: EP.Ctnn: CPU - SqueezeNet v1.1lzbench: Brotli 2 - Decompressionlzbench: Brotli 2 - Compressionlzbench: Brotli 0 - Decompressionlzbench: Brotli 0 - Compressionjpegxl: JPEG - 7askap: Hogbom Clean OpenMPparaview: Wavelet Contour - 1920 x 1080paraview: Wavelet Contour - 1920 x 1080redis: SADDredis: SETredis: LPUSHetcpak: ETC1redis: LPOPparaview: Wavelet Volume - 1920 x 1080paraview: Wavelet Volume - 1920 x 1080lzbench: Libdeflate 1 - Compressionredis: GETlammps: Rhodopsin Proteinaskap: tConvolve OpenMP - Degriddingaskap: tConvolve OpenMP - Griddingjpegxl: JPEG - 8webp2: Quality 100, Compression Effort 5lulesh: etcpak: DXT1webp2: Default1234511.176593.0666300.717531282.047493.82492.85481.6924.80212.796155.740525.44155.596138.6612222.42618.531815.8544.1572.6764.20137.1847.1087.86143553481.3694361.69998.88985.5431.81178282140.5067.39055767.164062887.940.9750.35853.14447175.2239726.1640631.26947323.881154238.34034.298652.677592.90303.570553.64048311148.64183.99552.50361.23517928517674952081.02316.35871118961346948.50376.885864.68282.972153944.421912809.711712504.00315.6162698309.251156.97372.312372539377.429.4833973.972458.0723.199.1956695.04131365.3863.36911.210590.2936490.707249280.636474.62480.71481.8344.81212.057156.550530.83151.978137.5822222.62622.751820.9048.3302.6184.63437.3457.2327.93145555484.0694711.69898.93284.8832.39178722241.4167.17555622.968750466.300.9760.35952.66769178.2339461.8216141.27247734.401154239.04833.689172.675396.56303.514555.29548410949.08182.97753.03360.92318018617674952001.11319.04971118961247049.22377.362864.93783.002168410.901952727.251642807.13313.7031769937.881156.21672.262382415143.759.0843973.972420.9123.259.1106707.85961387.3023.36311.195596.4086580.77080282.575470.37489.25481.4204.80211.787157.528354.01153.061139.5732207.12633.111813.5645.9702.6864.34837.2837.3557.93144557442.4795411.69898.87284.2132.26178422202.8967.12455666.716146467.760.9750.35952.70973177.8839681.5664061.27047778.761144238.48933.558962.694429.36303.105553.55348411048.89183.42453.44361.45417958517724962033.28317.13670918961247249.41377.835864.80482.982159575.211889099.971695317.29314.6621797918.461155.67472.232382431811.589.4083973.972428.7223.439.0756678.48801385.8423.32511.205593.5306590.707457279.573484.50490.66468.4564.67211.895154.880527.45154.214139.0322215.62633.321821.8248.3752.6624.51337.6077.2817.92144555500.3995331.70098.96484.2732.17177422112.5467.15155656.115885493.980.9760.36052.76675178.6139350.2239581.26947482.701154239.79533.401752.679419.08290.726548.55648310948.85182.72253.11361.42518038617714961997.31318.20870818861246948.85378.788865.02783.012158064.171906086.421699000.00314.2711761464.251157.11072.322372442355.929.0993973.972420.6423.379.3096697.31841376.3443.34711.216590.2116520.76939283.331516.94493.41481.3474.80212.963156.753510.05153.099139.0172216.02609.751821.3648.6402.6454.54637.3387.2727.91145555506.0295191.70398.87385.2032.30178492214.0867.10255876.180990478.010.9770.35952.84683178.4739275.5481771.27447360.361144238.62133.935332.676437.60303.890552.79248311049.13183.11853.69361.48818008517704941999.52319.31470918961147049.18378.802864.65482.972179924.51923099.421682254.45313.6931770711.211156.86972.312372443276.758.9013973.972442.8523.379.2166708.23221377.8943.350OpenBenchmarking.org

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms123453691215SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 311.1811.2111.2011.2111.221. (CXX) g++ options: -O3 -pthread -lm

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression12345130260390520650SE +/- 1.66, N = 3SE +/- 0.45, N = 3SE +/- 0.58, N = 3SE +/- 1.92, N = 3SE +/- 0.58, N = 3593.07590.29596.41593.53590.211. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU12345140280420560700SE +/- 5.46, N = 3SE +/- 11.30, N = 12SE +/- 13.29, N = 12SE +/- 13.34, N = 12SE +/- 11.09, N = 126306496586596521. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 8123450.15980.31960.47940.63920.799SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.710.700.700.700.701. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU1234516003200480064008000SE +/- 57.46, N = 3SE +/- 133.82, N = 12SE +/- 173.63, N = 9SE +/- 109.17, N = 3SE +/- 212.04, N = 12753172497080745769391. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 71234560120180240300SE +/- 1.07, N = 3SE +/- 0.54, N = 3SE +/- 2.02, N = 3SE +/- 1.22, N = 3SE +/- 2.02, N = 3282.05280.64282.58279.57283.331. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 32MB - Disk Target: Default Test Directory12345110220330440550SE +/- 3.89, N = 3SE +/- 4.72, N = 8SE +/- 5.73, N = 3SE +/- 4.09, N = 12SE +/- 4.67, N = 3493.82474.62470.37484.50516.94MIN: 245.23 / MAX: 1245.29MIN: 202.58 / MAX: 1355.46MIN: 197.76 / MAX: 1370.54MIN: 196.66 / MAX: 1538.21MIN: 413.96 / MAX: 1183.641. (CC) gcc options: -O2 -lm -pthread -lmpi

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 64MB - Disk Target: Default Test Directory12345110220330440550SE +/- 3.92, N = 3SE +/- 2.46, N = 3SE +/- 4.79, N = 3SE +/- 4.68, N = 3SE +/- 6.10, N = 3492.85480.71489.25490.66493.41MIN: 402.07 / MAX: 1345.65MIN: 301.17 / MAX: 1079.47MIN: 374.86 / MAX: 1033.08MIN: 348.43 / MAX: 1036.5MIN: 246.91 / MAX: 1088.061. (CC) gcc options: -O2 -lm -pthread -lmpi

ParaView

This test runs ParaView benchmarks: an open-source data analytics and visualization application. Paraview describes itself as "an open-source, multi-platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using qualitative and quantitative techniques." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 108012345100200300400500SE +/- 0.37, N = 3SE +/- 0.39, N = 3SE +/- 0.27, N = 3SE +/- 12.92, N = 11SE +/- 0.34, N = 3481.69481.83481.42468.46481.35

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1080123451.08232.16463.24694.32925.4115SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.13, N = 11SE +/- 0.00, N = 34.804.814.804.674.80

Gcrypt Library

Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.91234550100150200250SE +/- 0.63, N = 3SE +/- 0.34, N = 3SE +/- 0.06, N = 3SE +/- 0.49, N = 3SE +/- 0.29, N = 3212.80212.06211.79211.90212.961. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 712345306090120150SE +/- 1.77, N = 3SE +/- 0.15, N = 3SE +/- 0.29, N = 3SE +/- 0.66, N = 3SE +/- 0.73, N = 3155.74156.55157.53154.88156.751. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test Directory12345110220330440550SE +/- 25.39, N = 15SE +/- 5.42, N = 15SE +/- 4.96, N = 15SE +/- 4.21, N = 14SE +/- 6.12, N = 5525.44530.83354.01527.45510.05MIN: 290.23 / MAX: 1447.59MIN: 248.03 / MAX: 1386.96MIN: 189.98 / MAX: 1385.03MIN: 251.24 / MAX: 1378.27MIN: 222.93 / MAX: 1266.181. (CC) gcc options: -O2 -lm -pthread -lmpi

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C267012345306090120150SE +/- 2.33, N = 3SE +/- 1.75, N = 3SE +/- 1.44, N = 3SE +/- 2.23, N = 3SE +/- 1.61, N = 3155.60151.98153.06154.21153.101. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C755212345306090120150SE +/- 0.78, N = 3SE +/- 0.10, N = 3SE +/- 1.23, N = 3SE +/- 1.21, N = 3SE +/- 2.04, N = 3138.66137.58139.57139.03139.021. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21123455001000150020002500SE +/- 17.98, N = 13SE +/- 19.83, N = 12SE +/- 27.77, N = 12SE +/- 21.07, N = 12SE +/- 28.27, N = 122222.42222.62207.12215.62216.01. (CXX) g++ options: -O3 -march=native -rdynamic

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding123456001200180024003000SE +/- 10.95, N = 3SE +/- 4.46, N = 3SE +/- 7.85, N = 3SE +/- 4.51, N = 3SE +/- 11.98, N = 32618.532622.752633.112633.322609.751. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding12345400800120016002000SE +/- 2.23, N = 3SE +/- 0.35, N = 3SE +/- 2.62, N = 3SE +/- 0.61, N = 3SE +/- 0.31, N = 31815.851820.901813.561821.821821.361. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v3123451122334455SE +/- 1.80, N = 5SE +/- 0.10, N = 3SE +/- 2.44, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 344.1648.3345.9748.3848.64MIN: 40.29 / MAX: 51.28MIN: 47.9 / MAX: 50.26MIN: 40.58 / MAX: 48.74MIN: 47.91 / MAX: 48.97MIN: 48.25 / MAX: 49.281. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.0123450.60441.20881.81322.41763.022SE +/- 0.029, N = 5SE +/- 0.019, N = 3SE +/- 0.045, N = 3SE +/- 0.010, N = 3SE +/- 0.011, N = 32.6762.6182.6862.6622.645MIN: 2.44 / MAX: 3.19MIN: 2.5 / MAX: 2.92MIN: 2.42 / MAX: 3.11MIN: 2.48 / MAX: 2.98MIN: 2.44 / MAX: 3.041. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_224123451.04272.08543.12814.17085.2135SE +/- 0.129, N = 5SE +/- 0.086, N = 3SE +/- 0.205, N = 3SE +/- 0.041, N = 3SE +/- 0.008, N = 34.2014.6344.3484.5134.546MIN: 3.68 / MAX: 4.8MIN: 4.2 / MAX: 4.99MIN: 3.62 / MAX: 4.88MIN: 4.22 / MAX: 4.96MIN: 4.28 / MAX: 4.911. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-5012345918273645SE +/- 0.32, N = 5SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 337.1837.3537.2837.6137.34MIN: 35.67 / MAX: 38.06MIN: 36.97 / MAX: 37.99MIN: 36.85 / MAX: 38.11MIN: 37.16 / MAX: 38.09MIN: 36.95 / MAX: 37.851. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.012345246810SE +/- 0.091, N = 5SE +/- 0.085, N = 3SE +/- 0.110, N = 3SE +/- 0.070, N = 3SE +/- 0.007, N = 37.1087.2327.3557.2817.272MIN: 6.52 / MAX: 7.68MIN: 6.81 / MAX: 7.85MIN: 6.8 / MAX: 7.87MIN: 6.97 / MAX: 7.74MIN: 7 / MAX: 7.671. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 712345246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 37.867.937.937.927.911. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU12345306090120150SE +/- 0.44, N = 3SE +/- 0.33, N = 3SE +/- 0.87, N = 3SE +/- 0.44, N = 3SE +/- 0.17, N = 31431451441441451. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU12345120240360480600SE +/- 1.09, N = 3SE +/- 0.29, N = 3SE +/- 2.25, N = 3SE +/- 1.64, N = 3SE +/- 0.88, N = 35535555575555551. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 16MB - Disk Target: Default Test Directory12345110220330440550SE +/- 6.27, N = 3SE +/- 5.80, N = 15SE +/- 6.54, N = 3SE +/- 1.65, N = 3SE +/- 7.12, N = 3481.36484.06442.47500.39506.02MIN: 316.02 / MAX: 1504.02MIN: 224.25 / MAX: 1505.62MIN: 217.24 / MAX: 1379.75MIN: 315.3 / MAX: 1210.23MIN: 308.33 / MAX: 1247.461. (CC) gcc options: -O2 -lm -pthread -lmpi

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU123452K4K6K8K10KSE +/- 35.99, N = 3SE +/- 30.21, N = 3SE +/- 36.96, N = 3SE +/- 46.43, N = 3SE +/- 54.82, N = 3943694719541953395191. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bare123450.38320.76641.14961.53281.916SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 31.6991.6981.6981.7001.7031. (CXX) g++ options: -O3 -pthread

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile1234520406080100SE +/- 0.06, N = 3SE +/- 0.29, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 398.8998.9398.8798.9698.87

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics1234520406080100SE +/- 0.18, N = 3SE +/- 0.09, N = 3SE +/- 0.18, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 385.5484.8884.2184.2785.201. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

JPEG XL Decoding

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: 112345816243240SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 331.8132.3932.2632.1732.30

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPU123454K8K12K16K20KSE +/- 47.06, N = 3SE +/- 70.27, N = 3SE +/- 6.44, N = 3SE +/- 54.08, N = 3SE +/- 84.18, N = 31782817872178421774217849

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D123455001000150020002500SE +/- 10.76, N = 3SE +/- 3.15, N = 3SE +/- 34.24, N = 3SE +/- 35.18, N = 3SE +/- 28.36, N = 42140.502241.412202.892112.542214.081. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

GnuPG

This test times how long it takes to encrypt a sample file using GnuPG. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption123451530456075SE +/- 0.36, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.20, N = 3SE +/- 0.15, N = 367.3967.1867.1267.1567.101. (CC) gcc options: -O2

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP1234512K24K36K48K60KSE +/- 86.69, N = 3SE +/- 37.92, N = 3SE +/- 83.09, N = 3SE +/- 86.41, N = 3SE +/- 53.09, N = 355767.1655622.9755666.7255656.1255876.181. (CXX) g++ options: -O3 -march=native -fopenmp

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 4MB - Disk Target: Default Test Directory123452004006008001000SE +/- 1.87, N = 3SE +/- 5.42, N = 15SE +/- 45.69, N = 12SE +/- 6.63, N = 15SE +/- 5.81, N = 6887.94466.30467.76493.98478.01MIN: 629.82 / MAX: 1334.23MIN: 212.95 / MAX: 1329.58MIN: 188.74 / MAX: 1435.44MIN: 240.44 / MAX: 1344.87MIN: 236.85 / MAX: 1266.851. (CC) gcc options: -O2 -lm -pthread -lmpi

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 5123450.21980.43960.65940.87921.099SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.9750.9760.9750.9760.977

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1123450.0810.1620.2430.3240.405SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.3580.3590.3590.3600.359

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig123451224364860SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.36, N = 3SE +/- 0.21, N = 3SE +/- 0.26, N = 353.1452.6752.7152.7752.851. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

JPEG XL Decoding

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: All123454080120160200SE +/- 0.25, N = 3SE +/- 0.25, N = 3SE +/- 0.10, N = 3SE +/- 0.45, N = 3SE +/- 0.19, N = 3175.22178.23177.88178.61178.47

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP123459K18K27K36K45KSE +/- 210.55, N = 3SE +/- 76.80, N = 3SE +/- 151.55, N = 3SE +/- 69.89, N = 3SE +/- 20.70, N = 339726.1639461.8239681.5739350.2239275.551. (CXX) g++ options: -O3 -march=native -fopenmp

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6123450.28670.57340.86011.14681.4335SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 31.2691.2721.2701.2691.274

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C1234510K20K30K40K50KSE +/- 154.44, N = 3SE +/- 7.96, N = 3SE +/- 139.20, N = 3SE +/- 376.58, N = 3SE +/- 44.69, N = 347323.8847734.4047778.7647482.7047360.361. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression12345306090120150SE +/- 0.33, N = 3SE +/- 0.33, N = 31151151141151141. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression123451020304050SE +/- 0.33, N = 342424242421. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O12345918273645SE +/- 0.09, N = 3SE +/- 0.45, N = 3SE +/- 0.23, N = 3SE +/- 0.68, N = 3SE +/- 0.24, N = 338.3439.0538.4939.8038.621. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig12345816243240SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.17, N = 3SE +/- 0.29, N = 334.3033.6933.5633.4033.941. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 10123450.60621.21241.81862.42483.031SE +/- 0.006, N = 3SE +/- 0.012, N = 3SE +/- 0.005, N = 3SE +/- 0.005, N = 3SE +/- 0.007, N = 32.6772.6752.6942.6792.676

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test Directory12345130260390520650SE +/- 5.79, N = 3SE +/- 4.65, N = 3SE +/- 7.93, N = 15SE +/- 10.65, N = 12SE +/- 7.25, N = 15592.90396.56429.36419.08437.60MIN: 431.01 / MAX: 1028.65MIN: 214.64 / MAX: 1028.95MIN: 220.22 / MAX: 1149.54MIN: 152.75 / MAX: 1057.83MIN: 162.6 / MAX: 1036.241. (CC) gcc options: -O2 -lm -pthread -lmpi

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering1234570140210280350SE +/- 0.86, N = 3SE +/- 0.32, N = 3SE +/- 0.60, N = 3SE +/- 7.07, N = 15SE +/- 0.09, N = 3303.57303.51303.11290.73303.891. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_10012345120240360480600SE +/- 2.84, N = 3SE +/- 1.36, N = 3SE +/- 3.76, N = 3SE +/- 2.23, N = 3SE +/- 1.67, N = 3553.64555.30553.55548.56552.791. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression12345100200300400500SE +/- 0.67, N = 3SE +/- 1.53, N = 3SE +/- 0.88, N = 3SE +/- 1.15, N = 3SE +/- 1.86, N = 34834844844834831. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression1234520406080100SE +/- 0.58, N = 3SE +/- 1.00, N = 3SE +/- 0.67, N = 31111091101091101. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 5123451122334455SE +/- 0.07, N = 3SE +/- 0.26, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 348.6449.0848.8948.8549.131. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2123454080120160200SE +/- 0.08, N = 3SE +/- 0.28, N = 3SE +/- 0.16, N = 3SE +/- 0.12, N = 3SE +/- 0.23, N = 3184.00182.98183.42182.72183.121. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 5123451224364860SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.33, N = 3SE +/- 0.16, N = 3SE +/- 0.13, N = 352.5053.0353.4453.1153.691. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v21234580160240320400SE +/- 0.37, N = 3SE +/- 0.27, N = 3SE +/- 0.39, N = 3SE +/- 0.38, N = 3SE +/- 0.31, N = 3361.24360.92361.45361.43361.49MIN: 357.28 / MAX: 391.58MIN: 356.72 / MAX: 372.66MIN: 356.32 / MAX: 386.75MIN: 356.24 / MAX: 380.38MIN: 357.11 / MAX: 384.741. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression12345400800120016002000SE +/- 3.33, N = 3SE +/- 1.86, N = 3SE +/- 3.38, N = 3SE +/- 3.71, N = 3179218011795180318001. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression123452040608010085868586851. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression12345400800120016002000SE +/- 6.00, N = 3SE +/- 3.06, N = 3SE +/- 1.76, N = 3SE +/- 1.73, N = 3176717671772177117701. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression12345110220330440550SE +/- 2.19, N = 3SE +/- 2.00, N = 3SE +/- 2.67, N = 3SE +/- 3.67, N = 3SE +/- 3.67, N = 34954954964964941. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C12345400800120016002000SE +/- 23.92, N = 15SE +/- 51.51, N = 15SE +/- 51.69, N = 12SE +/- 32.10, N = 15SE +/- 46.79, N = 152081.022001.112033.281997.311999.521. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.11234570140210280350SE +/- 0.46, N = 3SE +/- 0.57, N = 3SE +/- 0.62, N = 3SE +/- 0.59, N = 3SE +/- 0.43, N = 3316.36319.05317.14318.21319.31MIN: 313.3 / MAX: 320.26MIN: 313 / MAX: 340.64MIN: 314.18 / MAX: 323.54MIN: 316.15 / MAX: 320.81MIN: 316.94 / MAX: 338.611. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression12345150300450600750SE +/- 0.88, N = 37117117097087091. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression123454080120160200SE +/- 0.33, N = 3SE +/- 0.67, N = 31891891891881891. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression12345130260390520650SE +/- 1.20, N = 3SE +/- 2.73, N = 3SE +/- 1.00, N = 3SE +/- 0.67, N = 36136126126126111. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression12345100200300400500SE +/- 0.88, N = 3SE +/- 1.67, N = 3SE +/- 1.20, N = 3SE +/- 1.33, N = 3SE +/- 0.88, N = 34694704724694701. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 7123451122334455SE +/- 0.11, N = 3SE +/- 0.28, N = 3SE +/- 0.07, N = 3SE +/- 0.19, N = 3SE +/- 0.14, N = 348.5049.2249.4148.8549.181. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP1234580160240320400SE +/- 0.47, N = 3SE +/- 0.82, N = 3SE +/- 0.48, N = 3SE +/- 0.00, N = 3SE +/- 1.66, N = 3376.89377.36377.84378.79378.801. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

ParaView

This test runs ParaView benchmarks: an open-source data analytics and visualization application. Paraview describes itself as "an open-source, multi-platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using qualitative and quantitative techniques." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080123452004006008001000SE +/- 0.51, N = 3SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.39, N = 3SE +/- 0.27, N = 3864.68864.94864.80865.03864.65

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 10801234520406080100SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 382.9783.0082.9883.0182.97

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD12345500K1000K1500K2000K2500KSE +/- 18441.13, N = 3SE +/- 27569.90, N = 5SE +/- 34602.53, N = 3SE +/- 20189.91, N = 3SE +/- 11360.66, N = 32153944.422168410.902159575.212158064.172179924.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET12345400K800K1200K1600K2000KSE +/- 15512.42, N = 3SE +/- 23318.24, N = 3SE +/- 24702.46, N = 4SE +/- 30239.19, N = 3SE +/- 21792.45, N = 31912809.711952727.251889099.971906086.421923099.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH12345400K800K1200K1600K2000KSE +/- 10755.06, N = 3SE +/- 13174.46, N = 3SE +/- 9034.55, N = 3SE +/- 10458.55, N = 3SE +/- 10827.27, N = 31712504.001642807.131695317.291699000.001682254.451. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC11234570140210280350SE +/- 0.30, N = 3SE +/- 0.85, N = 3SE +/- 1.01, N = 3SE +/- 0.49, N = 3SE +/- 0.49, N = 3315.62313.70314.66314.27313.691. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP12345600K1200K1800K2400K3000KSE +/- 29192.48, N = 3SE +/- 2566.99, N = 3SE +/- 19169.69, N = 3SE +/- 15108.16, N = 3SE +/- 18070.29, N = 32698309.251769937.881797918.461761464.251770711.211. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

ParaView

This test runs ParaView benchmarks: an open-source data analytics and visualization application. Paraview describes itself as "an open-source, multi-platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using qualitative and quantitative techniques." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080123452004006008001000SE +/- 0.52, N = 3SE +/- 0.36, N = 3SE +/- 0.26, N = 3SE +/- 0.60, N = 3SE +/- 0.98, N = 31156.971156.221155.671157.111156.87

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080123451632486480SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 372.3172.2672.2372.3272.31

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression1234550100150200250SE +/- 0.58, N = 32372382382372371. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET12345500K1000K1500K2000K2500KSE +/- 22779.82, N = 3SE +/- 31640.05, N = 3SE +/- 17482.31, N = 3SE +/- 18752.77, N = 3SE +/- 23178.69, N = 32539377.422415143.752431811.582442355.922443276.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein123453691215SE +/- 0.370, N = 12SE +/- 0.433, N = 15SE +/- 0.361, N = 12SE +/- 0.334, N = 15SE +/- 0.306, N = 159.4839.0849.4089.0998.9011. (CXX) g++ options: -O3 -pthread -lm

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding123459001800270036004500SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33973.973973.973973.973973.973973.971. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding123455001000150020002500SE +/- 19.92, N = 3SE +/- 22.21, N = 3SE +/- 32.33, N = 3SE +/- 12.71, N = 3SE +/- 12.94, N = 32458.072420.912428.722420.642442.851. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 812345612182430SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 323.1923.2523.4323.3723.371. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5123453691215SE +/- 0.105, N = 3SE +/- 0.020, N = 3SE +/- 0.024, N = 3SE +/- 0.141, N = 3SE +/- 0.149, N = 39.1959.1109.0759.3099.2161. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.31234514002800420056007000SE +/- 21.98, N = 3SE +/- 16.05, N = 3SE +/- 43.78, N = 3SE +/- 39.16, N = 3SE +/- 30.71, N = 36695.046707.866678.496697.326708.231. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT11234530060090012001500SE +/- 13.09, N = 3SE +/- 3.63, N = 3SE +/- 4.38, N = 3SE +/- 3.03, N = 3SE +/- 2.52, N = 31365.391387.301385.841376.341377.891. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default123450.7581.5162.2743.0323.79SE +/- 0.034, N = 3SE +/- 0.012, N = 3SE +/- 0.027, N = 3SE +/- 0.049, N = 3SE +/- 0.057, N = 33.3693.3633.3253.3473.3501. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

89 Results Shown

LAMMPS Molecular Dynamics Simulator
WebP2 Image Encode
ONNX Runtime
JPEG XL
ONNX Runtime
WebP2 Image Encode
IOR:
  32MB - Default Test Directory
  64MB - Default Test Directory
ParaView:
  Many Spheres - 1920 x 1080:
    MiPolys / Sec
    Frames / Sec
Gcrypt Library
WebP2 Image Encode
IOR
Ngspice:
  C2670
  C7552
QuantLib
ASKAP:
  tConvolve MT - Degridding
  tConvolve MT - Gridding
Mobile Neural Network:
  inception-v3
  mobilenet-v1-1.0
  MobileNetV2_224
  resnet-v2-50
  SqueezeNetV1.0
JPEG XL
ONNX Runtime:
  fcn-resnet101-11 - OpenMP CPU
  yolov4 - OpenMP CPU
IOR
ONNX Runtime
GROMACS
Timed Godot Game Engine Compilation
CloverLeaf
JPEG XL Decoding
Chaos Group V-RAY
NAS Parallel Benchmarks
GnuPG
FinanceBench
IOR
rav1e:
  5
  1
Pennant
JPEG XL Decoding
FinanceBench
rav1e
NAS Parallel Benchmarks
lzbench:
  XZ 0 - Decompression
  XZ 0 - Compression
QMCPACK
Pennant
rav1e
IOR
Etcpak
Google SynthMark
lzbench:
  Crush 0 - Decompression
  Crush 0 - Compression
JPEG XL
Etcpak
JPEG XL
TNN
lzbench:
  Zstd 8 - Decompression
  Zstd 8 - Compression
  Zstd 1 - Decompression
  Zstd 1 - Compression
NAS Parallel Benchmarks
TNN
lzbench:
  Brotli 2 - Decompression
  Brotli 2 - Compression
  Brotli 0 - Decompression
  Brotli 0 - Compression
JPEG XL
ASKAP
ParaView:
  Wavelet Contour - 1920 x 1080:
    MiPolys / Sec
    Frames / Sec
Redis:
  SADD
  SET
  LPUSH
Etcpak
Redis
ParaView:
  Wavelet Volume - 1920 x 1080:
    MiVoxels / Sec
    Frames / Sec
lzbench
Redis
LAMMPS Molecular Dynamics Simulator
ASKAP:
  tConvolve OpenMP - Degridding
  tConvolve OpenMP - Gridding
JPEG XL
WebP2 Image Encode
LULESH
Etcpak
WebP2 Image Encode