7980XE Feb 2921

Intel Core i9-7980XE testing with a ASUS PRIME X299-A (2002 BIOS) and Gigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB on Ubuntu 20.10 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2102162-HA-7980XEFEB73
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
1
February 14 2021
  6 Hours, 16 Minutes
2
February 14 2021
  7 Hours, 5 Minutes
3
February 15 2021
  6 Hours, 40 Minutes
4
February 15 2021
  7 Hours, 27 Minutes
5
February 15 2021
  6 Hours, 34 Minutes
Invert Behavior (Only Show Selected Data)
  6 Hours, 48 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


7980XE Feb 2921ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen Resolution12345Intel Core i9-7980XE @ 4.20GHz (18 Cores / 36 Threads)ASUS PRIME X299-A (2002 BIOS)Intel Sky Lake-E DMI3 Registers16GBSamsung SSD 970 EVO 500GBGigabyte AMD Radeon 540/540X/550/550X / RX 540X/550/550X 2GB (1206/1750MHz)Realtek ALC1220G237HLIntel I219-VUbuntu 20.105.8.0-36-generic (x86_64)GNOME Shell 3.38.1X Server 1.20.94.6 Mesa 20.2.6 (LLVM 11.0.0)1.2.131GCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Disk Details- NONE / errors=remount-ro,relatime,rw / Block Size: 4096Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0x2006a08Graphics Details- GLAMORPython Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Mitigation of PTE Inversion; VMX: conditional cache flushes SMT vulnerable + mds: Mitigation of Clear buffers; SMT vulnerable + meltdown: Mitigation of PTI + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full generic retpoline IBPB: conditional IBRS_FW STIBP: conditional RSB filling + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT vulnerable

12345Result OverviewPhoronix Test Suite100%107%114%121%128%IORRedisMobile Neural NetworkQMCPACKLAMMPS Molecular Dynamics SimulatorNAS Parallel BenchmarksJPEG XL DecodingPennantCloverLeafNgspiceONNX RuntimeEtcpakGoogle SynthMarkParaViewJPEG XLChaos Group V-RAYQuantLibTNNLULESHGcrypt LibraryWebP2 Image EncodeGnuPGGROMACSASKAPrav1elzbenchTimed Godot Game Engine CompilationFinanceBench

7980XE Feb 2921redis: LPOPior: 16MB - Default Test Directoryior: 32MB - Default Test Directorynpb: EP.Dredis: GETredis: LPUSHqmcpack: simple-H2Omnn: SqueezeNetV1.0redis: SETpennant: leblancbigior: 64MB - Default Test Directorymnn: mobilenet-v1-1.0webp2: Quality 100, Compression Effort 5ngspice: C2670jpegxl: PNG - 5jpegxl-decode: Alljpegxl: JPEG - 7lzbench: Crush 0 - Compressionjpegxl-decode: 1webp2: Quality 75, Compression Effort 7etcpak: DXT1cloverleaf: Lagrangian-Eulerian Hydrodynamicsaskap: tConvolve OpenMP - Griddingngspice: C7552jpegxl: PNG - 8onnx: fcn-resnet101-11 - OpenMP CPUwebp2: Quality 95, Compression Effort 7webp2: Defaultsynthmark: VoiceMark_100redis: SADDlzbench: Zstd 8 - Compressionfinancebench: Repo OpenMPmnn: resnet-v2-50onnx: shufflenet-v2-10 - OpenMP CPUwebp2: Quality 100, Lossless Compressionjpegxl: JPEG - 8jpegxl: JPEG - 5npb: LU.Ctnn: CPU - SqueezeNet v1.1pennant: sedovbigaskap: tConvolve MT - Degriddingjpegxl: PNG - 7lzbench: XZ 0 - Decompressionv-ray: CPUonnx: yolov4 - OpenMP CPUrav1e: 10quantlib: etcpak: ETC2lzbench: Brotli 0 - Compressionlzbench: Zstd 8 - Decompressionetcpak: ETC1rav1e: 1gcrypt: lzbench: Brotli 2 - Compressionaskap: Hogbom Clean OpenMPaskap: tConvolve MT - Griddingfinancebench: Bonds OpenMPlulesh: gnupg: 2.7GB Sample File Encryptionlzbench: Brotli 2 - Decompressionlzbench: Libdeflate 1 - Compressionlzbench: Zstd 1 - Compressionrav1e: 6lammps: 20k Atomslzbench: Brotli 0 - Decompressiongromacs: water_GMX50_barelzbench: Zstd 1 - Decompressionlzbench: Crush 0 - Decompressionrav1e: 5tnn: CPU - MobileNet v2paraview: Wavelet Volume - 1920 x 1080paraview: Wavelet Volume - 1920 x 1080build-godot: Time To Compileparaview: Wavelet Contour - 1920 x 1080paraview: Wavelet Contour - 1920 x 1080askap: tConvolve OpenMP - Degriddinglzbench: XZ 0 - Compressiononnx: super-resolution-10 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUmnn: inception-v3mnn: MobileNetV2_224lammps: Rhodopsin Proteinnpb: EP.Cetcpak: ETC1 + Ditheringparaview: Many Spheres - 1920 x 1080paraview: Many Spheres - 1920 x 1080ior: 8MB - Default Test Directoryior: 4MB - Default Test Directoryior: 2MB - Default Test Directory123452698309.25481.36493.822140.502539377.421712504.0038.3407.1081912809.7134.29865492.852.6769.195155.59652.50175.2248.5011131.81155.7401365.38685.542458.07138.6610.71143282.0473.369553.6402153944.428539726.16406337.1849436593.06623.1948.6447323.88316.35853.144472618.537.86115178285532.6772222.4183.9954691792315.6160.358212.796189376.8851815.8555767.1640626695.041367.3907112374951.26911.1766131.69917674830.975361.23572.311156.97398.88982.97864.6823973.9742753163044.1574.2019.4832081.02303.570481.6924.80525.44887.94592.901769937.88484.06474.622241.412415143.751642807.1339.0487.2321952727.2533.68917480.712.6189.110151.97853.03178.2349.2210932.39156.5501387.30284.882420.91137.5820.70145280.6363.363555.2952168410.908639461.82161437.3459471590.29323.2549.0847734.40319.04952.667692622.757.93115178725552.6752222.6182.9774701801313.7030.359212.057189377.3621820.9055622.9687506707.859667.1757112384951.27211.2106121.69817674840.976360.92372.261156.21698.93283.00864.9373973.9742724964948.3304.6349.0842001.11303.514481.8344.81530.83466.30396.561797918.46442.47470.372202.892431811.581695317.2938.4897.3551889099.9733.55896489.252.6869.075153.06153.44177.8849.4111032.26157.5281385.84284.212428.72139.5730.7144282.5753.325553.5532159575.218539681.56640637.2839541596.40823.4348.8947778.76317.13652.709732633.117.93114178425572.6942207.1183.4244721795314.6620.359211.787189377.8351813.5655666.7161466678.488067.1247092384961.27011.1956121.69817724840.975361.45472.231155.67498.87282.98864.8043973.9742708065845.9704.3489.4082033.28303.105481.4204.80354.01467.76429.361761464.25500.39484.502112.542442355.921699000.0039.7957.2811906086.4233.40175490.662.6629.309154.21453.11178.6148.8510932.17154.8801376.34484.272420.64139.0320.70144279.5733.347548.5562158064.178639350.22395837.6079533593.53023.3748.8547482.70318.20852.766752633.327.92115177425552.6792215.6182.7224691803314.2710.360211.895188378.7881821.8255656.1158856697.318467.1517082374961.26911.2056121.70017714830.976361.42572.321157.11098.96483.01865.0273973.9742745765948.3754.5139.0991997.31290.726468.4564.67527.45493.98419.081770711.21506.02516.942214.082443276.751682254.4538.6217.2721923099.4233.93533493.412.6459.216153.09953.69178.4749.1811032.30156.7531377.89485.202442.85139.0170.7145283.3313.350552.7922179924.58539275.54817737.3389519590.21123.3749.1347360.36319.31452.846832609.757.91114178495552.6762216.0183.1184701800313.6930.359212.963189378.8021821.3655876.1809906708.232267.1027092374941.27411.2166111.70317704830.977361.48872.311156.86998.87382.97864.6543973.9742693965248.6404.5468.9011999.52303.890481.3474.80510.05478.01437.60OpenBenchmarking.org

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP12345600K1200K1800K2400K3000KSE +/- 29192.48, N = 3SE +/- 2566.99, N = 3SE +/- 19169.69, N = 3SE +/- 15108.16, N = 3SE +/- 18070.29, N = 32698309.251769937.881797918.461761464.251770711.211. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 16MB - Disk Target: Default Test Directory12345110220330440550SE +/- 6.27, N = 3SE +/- 5.80, N = 15SE +/- 6.54, N = 3SE +/- 1.65, N = 3SE +/- 7.12, N = 3481.36484.06442.47500.39506.02MIN: 316.02 / MAX: 1504.02MIN: 224.25 / MAX: 1505.62MIN: 217.24 / MAX: 1379.75MIN: 315.3 / MAX: 1210.23MIN: 308.33 / MAX: 1247.461. (CC) gcc options: -O2 -lm -pthread -lmpi

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 32MB - Disk Target: Default Test Directory12345110220330440550SE +/- 3.89, N = 3SE +/- 4.72, N = 8SE +/- 5.73, N = 3SE +/- 4.09, N = 12SE +/- 4.67, N = 3493.82474.62470.37484.50516.94MIN: 245.23 / MAX: 1245.29MIN: 202.58 / MAX: 1355.46MIN: 197.76 / MAX: 1370.54MIN: 196.66 / MAX: 1538.21MIN: 413.96 / MAX: 1183.641. (CC) gcc options: -O2 -lm -pthread -lmpi

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D123455001000150020002500SE +/- 10.76, N = 3SE +/- 3.15, N = 3SE +/- 34.24, N = 3SE +/- 35.18, N = 3SE +/- 28.36, N = 42140.502241.412202.892112.542214.081. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET12345500K1000K1500K2000K2500KSE +/- 22779.82, N = 3SE +/- 31640.05, N = 3SE +/- 17482.31, N = 3SE +/- 18752.77, N = 3SE +/- 23178.69, N = 32539377.422415143.752431811.582442355.922443276.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH12345400K800K1200K1600K2000KSE +/- 10755.06, N = 3SE +/- 13174.46, N = 3SE +/- 9034.55, N = 3SE +/- 10458.55, N = 3SE +/- 10827.27, N = 31712504.001642807.131695317.291699000.001682254.451. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O12345918273645SE +/- 0.09, N = 3SE +/- 0.45, N = 3SE +/- 0.23, N = 3SE +/- 0.68, N = 3SE +/- 0.24, N = 338.3439.0538.4939.8038.621. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.012345246810SE +/- 0.091, N = 5SE +/- 0.085, N = 3SE +/- 0.110, N = 3SE +/- 0.070, N = 3SE +/- 0.007, N = 37.1087.2327.3557.2817.272MIN: 6.52 / MAX: 7.68MIN: 6.81 / MAX: 7.85MIN: 6.8 / MAX: 7.87MIN: 6.97 / MAX: 7.74MIN: 7 / MAX: 7.671. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET12345400K800K1200K1600K2000KSE +/- 15512.42, N = 3SE +/- 23318.24, N = 3SE +/- 24702.46, N = 4SE +/- 30239.19, N = 3SE +/- 21792.45, N = 31912809.711952727.251889099.971906086.421923099.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: leblancbig12345816243240SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 3SE +/- 0.17, N = 3SE +/- 0.29, N = 334.3033.6933.5633.4033.941. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 64MB - Disk Target: Default Test Directory12345110220330440550SE +/- 3.92, N = 3SE +/- 2.46, N = 3SE +/- 4.79, N = 3SE +/- 4.68, N = 3SE +/- 6.10, N = 3492.85480.71489.25490.66493.41MIN: 402.07 / MAX: 1345.65MIN: 301.17 / MAX: 1079.47MIN: 374.86 / MAX: 1033.08MIN: 348.43 / MAX: 1036.5MIN: 246.91 / MAX: 1088.061. (CC) gcc options: -O2 -lm -pthread -lmpi

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.0123450.60441.20881.81322.41763.022SE +/- 0.029, N = 5SE +/- 0.019, N = 3SE +/- 0.045, N = 3SE +/- 0.010, N = 3SE +/- 0.011, N = 32.6762.6182.6862.6622.645MIN: 2.44 / MAX: 3.19MIN: 2.5 / MAX: 2.92MIN: 2.42 / MAX: 3.11MIN: 2.48 / MAX: 2.98MIN: 2.44 / MAX: 3.041. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 5123453691215SE +/- 0.105, N = 3SE +/- 0.020, N = 3SE +/- 0.024, N = 3SE +/- 0.141, N = 3SE +/- 0.149, N = 39.1959.1109.0759.3099.2161. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C267012345306090120150SE +/- 2.33, N = 3SE +/- 1.75, N = 3SE +/- 1.44, N = 3SE +/- 2.23, N = 3SE +/- 1.61, N = 3155.60151.98153.06154.21153.101. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 5123451224364860SE +/- 0.14, N = 3SE +/- 0.15, N = 3SE +/- 0.33, N = 3SE +/- 0.16, N = 3SE +/- 0.13, N = 352.5053.0353.4453.1153.691. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

JPEG XL Decoding

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: All123454080120160200SE +/- 0.25, N = 3SE +/- 0.25, N = 3SE +/- 0.10, N = 3SE +/- 0.45, N = 3SE +/- 0.19, N = 3175.22178.23177.88178.61178.47

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 7123451122334455SE +/- 0.11, N = 3SE +/- 0.28, N = 3SE +/- 0.07, N = 3SE +/- 0.19, N = 3SE +/- 0.14, N = 348.5049.2249.4148.8549.181. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression1234520406080100SE +/- 0.58, N = 3SE +/- 1.00, N = 3SE +/- 0.67, N = 31111091101091101. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

JPEG XL Decoding

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is suited for JPEG XL decode performance testing to PNG output file, the pts/jpexl test is for encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL Decoding 0.3.1CPU Threads: 112345816243240SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 331.8132.3932.2632.1732.30

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 712345306090120150SE +/- 1.77, N = 3SE +/- 0.15, N = 3SE +/- 0.29, N = 3SE +/- 0.66, N = 3SE +/- 0.73, N = 3155.74156.55157.53154.88156.751. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT11234530060090012001500SE +/- 13.09, N = 3SE +/- 3.63, N = 3SE +/- 4.38, N = 3SE +/- 3.03, N = 3SE +/- 2.52, N = 31365.391387.301385.841376.341377.891. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

CloverLeaf

CloverLeaf is a Lagrangian-Eulerian hydrodynamics benchmark. This test profile currently makes use of CloverLeaf's OpenMP version and benchmarked with the clover_bm.in input file (Problem 5). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics1234520406080100SE +/- 0.18, N = 3SE +/- 0.09, N = 3SE +/- 0.18, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 385.5484.8884.2184.2785.201. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding123455001000150020002500SE +/- 19.92, N = 3SE +/- 22.21, N = 3SE +/- 32.33, N = 3SE +/- 12.71, N = 3SE +/- 12.94, N = 32458.072420.912428.722420.642442.851. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Ngspice

Ngspice is an open-source SPICE circuit simulator. Ngspice was originally based on the Berkeley SPICE electronic circuit simulator. Ngspice supports basic threading using OpenMP. This test profile is making use of the ISCAS 85 benchmark circuits. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNgspice 34Circuit: C755212345306090120150SE +/- 0.78, N = 3SE +/- 0.10, N = 3SE +/- 1.23, N = 3SE +/- 1.21, N = 3SE +/- 2.04, N = 3138.66137.58139.57139.03139.021. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 8123450.15980.31960.47940.63920.799SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.710.700.700.700.701. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU12345306090120150SE +/- 0.44, N = 3SE +/- 0.33, N = 3SE +/- 0.87, N = 3SE +/- 0.44, N = 3SE +/- 0.17, N = 31431451441441451. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 71234560120180240300SE +/- 1.07, N = 3SE +/- 0.54, N = 3SE +/- 2.02, N = 3SE +/- 1.22, N = 3SE +/- 2.02, N = 3282.05280.64282.58279.57283.331. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default123450.7581.5162.2743.0323.79SE +/- 0.034, N = 3SE +/- 0.012, N = 3SE +/- 0.027, N = 3SE +/- 0.049, N = 3SE +/- 0.057, N = 33.3693.3633.3253.3473.3501. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

Google SynthMark

SynthMark is a cross platform tool for benchmarking CPU performance under a variety of real-time audio workloads. It uses a polyphonic synthesizer model to provide standardized tests for latency, jitter and computational throughput. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_10012345120240360480600SE +/- 2.84, N = 3SE +/- 1.36, N = 3SE +/- 3.76, N = 3SE +/- 2.23, N = 3SE +/- 1.67, N = 3553.64555.30553.55548.56552.791. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD12345500K1000K1500K2000K2500KSE +/- 18441.13, N = 3SE +/- 27569.90, N = 5SE +/- 34602.53, N = 3SE +/- 20189.91, N = 3SE +/- 11360.66, N = 32153944.422168410.902159575.212158064.172179924.501. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression123452040608010085868586851. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP123459K18K27K36K45KSE +/- 210.55, N = 3SE +/- 76.80, N = 3SE +/- 151.55, N = 3SE +/- 69.89, N = 3SE +/- 20.70, N = 339726.1639461.8239681.5739350.2239275.551. (CXX) g++ options: -O3 -march=native -fopenmp

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-5012345918273645SE +/- 0.32, N = 5SE +/- 0.12, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.08, N = 337.1837.3537.2837.6137.34MIN: 35.67 / MAX: 38.06MIN: 36.97 / MAX: 37.99MIN: 36.85 / MAX: 38.11MIN: 37.16 / MAX: 38.09MIN: 36.95 / MAX: 37.851. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU123452K4K6K8K10KSE +/- 35.99, N = 3SE +/- 30.21, N = 3SE +/- 36.96, N = 3SE +/- 46.43, N = 3SE +/- 54.82, N = 3943694719541953395191. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression12345130260390520650SE +/- 1.66, N = 3SE +/- 0.45, N = 3SE +/- 0.58, N = 3SE +/- 1.92, N = 3SE +/- 0.58, N = 3593.07590.29596.41593.53590.211. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg -lgif -lwebp -lwebpdemux

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 812345612182430SE +/- 0.05, N = 3SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.14, N = 323.1923.2523.4323.3723.371. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: JPEG - Encode Speed: 5123451122334455SE +/- 0.07, N = 3SE +/- 0.26, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.09, N = 348.6449.0848.8948.8549.131. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C1234510K20K30K40K50KSE +/- 154.44, N = 3SE +/- 7.96, N = 3SE +/- 139.20, N = 3SE +/- 376.58, N = 3SE +/- 44.69, N = 347323.8847734.4047778.7647482.7047360.361. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.11234570140210280350SE +/- 0.46, N = 3SE +/- 0.57, N = 3SE +/- 0.62, N = 3SE +/- 0.59, N = 3SE +/- 0.43, N = 3316.36319.05317.14318.21319.31MIN: 313.3 / MAX: 320.26MIN: 313 / MAX: 340.64MIN: 314.18 / MAX: 323.54MIN: 316.15 / MAX: 320.81MIN: 316.94 / MAX: 338.611. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbig123451224364860SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.36, N = 3SE +/- 0.21, N = 3SE +/- 0.26, N = 353.1452.6752.7152.7752.851. (CXX) g++ options: -fopenmp -pthread -lmpi_cxx -lmpi

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding123456001200180024003000SE +/- 10.95, N = 3SE +/- 4.46, N = 3SE +/- 7.85, N = 3SE +/- 4.51, N = 3SE +/- 11.98, N = 32618.532622.752633.112633.322609.751. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

JPEG XL

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL 0.3.1Input: PNG - Encode Speed: 712345246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 37.867.937.937.927.911. (CXX) g++ options: -funwind-tables -O3 -O2 -fPIE -pie -pthread -ldl

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression12345306090120150SE +/- 0.33, N = 3SE +/- 0.33, N = 31151151141151141. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPU123454K8K12K16K20KSE +/- 47.06, N = 3SE +/- 70.27, N = 3SE +/- 6.44, N = 3SE +/- 54.08, N = 3SE +/- 84.18, N = 31782817872178421774217849

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU12345120240360480600SE +/- 1.09, N = 3SE +/- 0.29, N = 3SE +/- 2.25, N = 3SE +/- 1.64, N = 3SE +/- 0.88, N = 35535555575555551. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 10123450.60621.21241.81862.42483.031SE +/- 0.006, N = 3SE +/- 0.012, N = 3SE +/- 0.005, N = 3SE +/- 0.005, N = 3SE +/- 0.007, N = 32.6772.6752.6942.6792.676

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21123455001000150020002500SE +/- 17.98, N = 13SE +/- 19.83, N = 12SE +/- 27.77, N = 12SE +/- 21.07, N = 12SE +/- 28.27, N = 122222.42222.62207.12215.62216.01. (CXX) g++ options: -O3 -march=native -rdynamic

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC2123454080120160200SE +/- 0.08, N = 3SE +/- 0.28, N = 3SE +/- 0.16, N = 3SE +/- 0.12, N = 3SE +/- 0.23, N = 3184.00182.98183.42182.72183.121. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression12345100200300400500SE +/- 0.88, N = 3SE +/- 1.67, N = 3SE +/- 1.20, N = 3SE +/- 1.33, N = 3SE +/- 0.88, N = 34694704724694701. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression12345400800120016002000SE +/- 3.33, N = 3SE +/- 1.86, N = 3SE +/- 3.38, N = 3SE +/- 3.71, N = 3179218011795180318001. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC11234570140210280350SE +/- 0.30, N = 3SE +/- 0.85, N = 3SE +/- 1.01, N = 3SE +/- 0.49, N = 3SE +/- 0.49, N = 3315.62313.70314.66314.27313.691. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1123450.0810.1620.2430.3240.405SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 3SE +/- 0.000, N = 30.3580.3590.3590.3600.359

Gcrypt Library

Libgcrypt is a general purpose cryptographic library developed as part of the GnuPG project. This is a benchmark of libgcrypt's integrated benchmark and is measuring the time to run the benchmark command with a cipher/mac/hash repetition count set for 50 times as simple, high level look at the overall crypto performance of the system under test. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.91234550100150200250SE +/- 0.63, N = 3SE +/- 0.34, N = 3SE +/- 0.06, N = 3SE +/- 0.49, N = 3SE +/- 0.29, N = 3212.80212.06211.79211.90212.961. (CC) gcc options: -O2 -fvisibility=hidden -lgpg-error

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression123454080120160200SE +/- 0.33, N = 3SE +/- 0.67, N = 31891891891881891. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP1234580160240320400SE +/- 0.47, N = 3SE +/- 0.82, N = 3SE +/- 0.48, N = 3SE +/- 0.00, N = 3SE +/- 1.66, N = 3376.89377.36377.84378.79378.801. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding12345400800120016002000SE +/- 2.23, N = 3SE +/- 0.35, N = 3SE +/- 2.62, N = 3SE +/- 0.61, N = 3SE +/- 0.31, N = 31815.851820.901813.561821.821821.361. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP1234512K24K36K48K60KSE +/- 86.69, N = 3SE +/- 37.92, N = 3SE +/- 83.09, N = 3SE +/- 86.41, N = 3SE +/- 53.09, N = 355767.1655622.9755666.7255656.1255876.181. (CXX) g++ options: -O3 -march=native -fopenmp

LULESH

LULESH is the Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.31234514002800420056007000SE +/- 21.98, N = 3SE +/- 16.05, N = 3SE +/- 43.78, N = 3SE +/- 39.16, N = 3SE +/- 30.71, N = 36695.046707.866678.496697.326708.231. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

GnuPG

This test times how long it takes to encrypt a sample file using GnuPG. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterGnuPG 2.2.272.7GB Sample File Encryption123451530456075SE +/- 0.36, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 3SE +/- 0.20, N = 3SE +/- 0.15, N = 367.3967.1867.1267.1567.101. (CC) gcc options: -O2

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression12345150300450600750SE +/- 0.88, N = 37117117097087091. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression1234550100150200250SE +/- 0.58, N = 32372382382372371. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression12345110220330440550SE +/- 2.19, N = 3SE +/- 2.00, N = 3SE +/- 2.67, N = 3SE +/- 3.67, N = 3SE +/- 3.67, N = 34954954964964941. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 6123450.28670.57340.86011.14681.4335SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 31.2691.2721.2701.2691.274

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k Atoms123453691215SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 311.1811.2111.2011.2111.221. (CXX) g++ options: -O3 -pthread -lm

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression12345130260390520650SE +/- 1.20, N = 3SE +/- 2.73, N = 3SE +/- 1.00, N = 3SE +/- 0.67, N = 36136126126126111. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bare123450.38320.76641.14961.53281.916SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 31.6991.6981.6981.7001.7031. (CXX) g++ options: -O3 -pthread

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression12345400800120016002000SE +/- 6.00, N = 3SE +/- 3.06, N = 3SE +/- 1.76, N = 3SE +/- 1.73, N = 3176717671772177117701. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression12345100200300400500SE +/- 0.67, N = 3SE +/- 1.53, N = 3SE +/- 0.88, N = 3SE +/- 1.15, N = 3SE +/- 1.86, N = 34834844844834831. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

rav1e

Xiph rav1e is a Rust-written AV1 video encoder. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 5123450.21980.43960.65940.87921.099SE +/- 0.002, N = 3SE +/- 0.003, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 30.9750.9760.9750.9760.977

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v21234580160240320400SE +/- 0.37, N = 3SE +/- 0.27, N = 3SE +/- 0.39, N = 3SE +/- 0.38, N = 3SE +/- 0.31, N = 3361.24360.92361.45361.43361.49MIN: 357.28 / MAX: 391.58MIN: 356.72 / MAX: 372.66MIN: 356.32 / MAX: 386.75MIN: 356.24 / MAX: 380.38MIN: 357.11 / MAX: 384.741. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

ParaView

This test runs ParaView benchmarks: an open-source data analytics and visualization application. Paraview describes itself as "an open-source, multi-platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using qualitative and quantitative techniques." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080123451632486480SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 372.3172.2672.2372.3272.31

OpenBenchmarking.orgMiVoxels / Sec, More Is BetterParaView 5.9Test: Wavelet Volume - Resolution: 1920 x 1080123452004006008001000SE +/- 0.52, N = 3SE +/- 0.36, N = 3SE +/- 0.26, N = 3SE +/- 0.60, N = 3SE +/- 0.98, N = 31156.971156.221155.671157.111156.87

Timed Godot Game Engine Compilation

This test times how long it takes to compile the Godot Game Engine. Godot is a popular, open-source, cross-platform 2D/3D game engine and is built using the SCons build system and targeting the X11 platform. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile1234520406080100SE +/- 0.06, N = 3SE +/- 0.29, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 3SE +/- 0.11, N = 398.8998.9398.8798.9698.87

ParaView

This test runs ParaView benchmarks: an open-source data analytics and visualization application. Paraview describes itself as "an open-source, multi-platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using qualitative and quantitative techniques." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 10801234520406080100SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 382.9783.0082.9883.0182.97

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Wavelet Contour - Resolution: 1920 x 1080123452004006008001000SE +/- 0.51, N = 3SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.39, N = 3SE +/- 0.27, N = 3864.68864.94864.80865.03864.65

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding123459001800270036004500SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 33973.973973.973973.973973.973973.971. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

lzbench

lzbench is an in-memory benchmark of various compressors. The file used for compression is a Linux kernel source tree tarball. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression123451020304050SE +/- 0.33, N = 342424242421. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU1234516003200480064008000SE +/- 57.46, N = 3SE +/- 133.82, N = 12SE +/- 173.63, N = 9SE +/- 109.17, N = 3SE +/- 212.04, N = 12753172497080745769391. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU12345140280420560700SE +/- 5.46, N = 3SE +/- 11.30, N = 12SE +/- 13.29, N = 12SE +/- 13.34, N = 12SE +/- 11.09, N = 126306496586596521. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v3123451122334455SE +/- 1.80, N = 5SE +/- 0.10, N = 3SE +/- 2.44, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 344.1648.3345.9748.3848.64MIN: 40.29 / MAX: 51.28MIN: 47.9 / MAX: 50.26MIN: 40.58 / MAX: 48.74MIN: 47.91 / MAX: 48.97MIN: 48.25 / MAX: 49.281. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_224123451.04272.08543.12814.17085.2135SE +/- 0.129, N = 5SE +/- 0.086, N = 3SE +/- 0.205, N = 3SE +/- 0.041, N = 3SE +/- 0.008, N = 34.2014.6344.3484.5134.546MIN: 3.68 / MAX: 4.8MIN: 4.2 / MAX: 4.99MIN: 3.62 / MAX: 4.88MIN: 4.22 / MAX: 4.96MIN: 4.28 / MAX: 4.911. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein123453691215SE +/- 0.370, N = 12SE +/- 0.433, N = 15SE +/- 0.361, N = 12SE +/- 0.334, N = 15SE +/- 0.306, N = 159.4839.0849.4089.0998.9011. (CXX) g++ options: -O3 -pthread -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C12345400800120016002000SE +/- 23.92, N = 15SE +/- 51.51, N = 15SE +/- 51.69, N = 12SE +/- 32.10, N = 15SE +/- 46.79, N = 152081.022001.112033.281997.311999.521. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Etcpak

Etcpack is the self-proclaimed "fastest ETC compressor on the planet" with focused on providing open-source, very fast ETC and S3 texture compression support. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering1234570140210280350SE +/- 0.86, N = 3SE +/- 0.32, N = 3SE +/- 0.60, N = 3SE +/- 7.07, N = 15SE +/- 0.09, N = 3303.57303.51303.11290.73303.891. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

ParaView

This test runs ParaView benchmarks: an open-source data analytics and visualization application. Paraview describes itself as "an open-source, multi-platform data analysis and visualization application. ParaView users can quickly build visualizations to analyze their data using qualitative and quantitative techniques." Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiPolys / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 108012345100200300400500SE +/- 0.37, N = 3SE +/- 0.39, N = 3SE +/- 0.27, N = 3SE +/- 12.92, N = 11SE +/- 0.34, N = 3481.69481.83481.42468.46481.35

OpenBenchmarking.orgFrames / Sec, More Is BetterParaView 5.9Test: Many Spheres - Resolution: 1920 x 1080123451.08232.16463.24694.32925.4115SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.13, N = 11SE +/- 0.00, N = 34.804.814.804.674.80

IOR

IOR is a parallel I/O storage benchmark making use of MPI with a particular focus on HPC (High Performance Computing) systems. IOR is developed at the Lawrence Livermore National Laboratory (LLNL). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 8MB - Disk Target: Default Test Directory12345110220330440550SE +/- 25.39, N = 15SE +/- 5.42, N = 15SE +/- 4.96, N = 15SE +/- 4.21, N = 14SE +/- 6.12, N = 5525.44530.83354.01527.45510.05MIN: 290.23 / MAX: 1447.59MIN: 248.03 / MAX: 1386.96MIN: 189.98 / MAX: 1385.03MIN: 251.24 / MAX: 1378.27MIN: 222.93 / MAX: 1266.181. (CC) gcc options: -O2 -lm -pthread -lmpi

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 4MB - Disk Target: Default Test Directory123452004006008001000SE +/- 1.87, N = 3SE +/- 5.42, N = 15SE +/- 45.69, N = 12SE +/- 6.63, N = 15SE +/- 5.81, N = 6887.94466.30467.76493.98478.01MIN: 629.82 / MAX: 1334.23MIN: 212.95 / MAX: 1329.58MIN: 188.74 / MAX: 1435.44MIN: 240.44 / MAX: 1344.87MIN: 236.85 / MAX: 1266.851. (CC) gcc options: -O2 -lm -pthread -lmpi

OpenBenchmarking.orgMB/s, More Is BetterIOR 3.3.0Block Size: 2MB - Disk Target: Default Test Directory12345130260390520650SE +/- 5.79, N = 3SE +/- 4.65, N = 3SE +/- 7.93, N = 15SE +/- 10.65, N = 12SE +/- 7.25, N = 15592.90396.56429.36419.08437.60MIN: 431.01 / MAX: 1028.65MIN: 214.64 / MAX: 1028.95MIN: 220.22 / MAX: 1149.54MIN: 152.75 / MAX: 1057.83MIN: 162.6 / MAX: 1036.241. (CC) gcc options: -O2 -lm -pthread -lmpi

89 Results Shown

Redis
IOR:
  16MB - Default Test Directory
  32MB - Default Test Directory
NAS Parallel Benchmarks
Redis:
  GET
  LPUSH
QMCPACK
Mobile Neural Network
Redis
Pennant
IOR
Mobile Neural Network
WebP2 Image Encode
Ngspice
JPEG XL
JPEG XL Decoding
JPEG XL
lzbench
JPEG XL Decoding
WebP2 Image Encode
Etcpak
CloverLeaf
ASKAP
Ngspice
JPEG XL
ONNX Runtime
WebP2 Image Encode:
  Quality 95, Compression Effort 7
  Default
Google SynthMark
Redis
lzbench
FinanceBench
Mobile Neural Network
ONNX Runtime
WebP2 Image Encode
JPEG XL:
  JPEG - 8
  JPEG - 5
NAS Parallel Benchmarks
TNN
Pennant
ASKAP
JPEG XL
lzbench
Chaos Group V-RAY
ONNX Runtime
rav1e
QuantLib
Etcpak
lzbench:
  Brotli 0 - Compression
  Zstd 8 - Decompression
Etcpak
rav1e
Gcrypt Library
lzbench
ASKAP:
  Hogbom Clean OpenMP
  tConvolve MT - Gridding
FinanceBench
LULESH
GnuPG
lzbench:
  Brotli 2 - Decompression
  Libdeflate 1 - Compression
  Zstd 1 - Compression
rav1e
LAMMPS Molecular Dynamics Simulator
lzbench
GROMACS
lzbench:
  Zstd 1 - Decompression
  Crush 0 - Decompression
rav1e
TNN
ParaView:
  Wavelet Volume - 1920 x 1080:
    Frames / Sec
    MiVoxels / Sec
Timed Godot Game Engine Compilation
ParaView:
  Wavelet Contour - 1920 x 1080:
    Frames / Sec
    MiPolys / Sec
ASKAP
lzbench
ONNX Runtime:
  super-resolution-10 - OpenMP CPU
  bertsquad-10 - OpenMP CPU
Mobile Neural Network:
  inception-v3
  MobileNetV2_224
LAMMPS Molecular Dynamics Simulator
NAS Parallel Benchmarks
Etcpak
ParaView:
  Many Spheres - 1920 x 1080:
    MiPolys / Sec
    Frames / Sec
IOR:
  8MB - Default Test Directory
  4MB - Default Test Directory
  2MB - Default Test Directory