openSUSE Tumbleweed GCC12 Benchmarks

openSUSE Tumbleweed before/after GCC 12 upgrade. Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2205149-NE-TUMBLEWEE25
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

AV1 2 Tests
C/C++ Compiler Tests 9 Tests
CPU Massive 15 Tests
Creator Workloads 7 Tests
Encoding 4 Tests
Fortran Tests 2 Tests
Go Language Tests 2 Tests
HPC - High Performance Computing 8 Tests
Java 2 Tests
Common Kernel Benchmarks 3 Tests
Machine Learning 3 Tests
Molecular Dynamics 4 Tests
MPI Benchmarks 4 Tests
Multi-Core 11 Tests
OpenMPI Tests 5 Tests
Programmer / Developer System Benchmarks 4 Tests
Python 3 Tests
Scientific Computing 5 Tests
Server 3 Tests
Server CPU Tests 12 Tests
Single-Threaded 4 Tests
Video Encoding 4 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Tumbleweed 20220507
May 09 2022
  8 Hours, 35 Minutes
Tumbleweed 20220510
May 13 2022
  10 Hours, 56 Minutes
Invert Hiding All Results Option
  9 Hours, 46 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


openSUSE Tumbleweed GCC12 BenchmarksProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionTumbleweed 20220507Tumbleweed 202205102 x Intel Xeon Platinum 8380 @ 3.40GHz (80 Cores / 160 Threads)Intel M50CYP2SB2U (SE5C6200.86B.0022.D08.2103221623 BIOS)Intel Device 0998512GB800GB INTEL SSDPF21Q800GBASPEEDVE2282 x Intel X710 for 10GBASE-T + 2 x Intel E810-C for QSFPopenSUSE Tumbleweed 202205075.17.4-1-default (x86_64)GCC 11.2.1 20220420 [revision 691af15031e00227ba6d5935c1d737026cda4129]btrfs1920x1080openSUSE 202205105.17.5-1-default (x86_64)GCC 12.1.0OpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- Tumbleweed 20220507: --build=x86_64-suse-linux --disable-libcc1 --disable-libssp --disable-libstdcxx-pch --disable-libvtv --disable-werror --enable-cet=auto --enable-checking=release --enable-gnu-indirect-function --enable-host-shared --enable-languages=c,c++,objc,fortran,obj-c++,ada,go,d,jit --enable-libphobos --enable-libstdcxx-allocator=new --enable-link-mutex --enable-linux-futex --enable-multilib --enable-offload-targets=nvptx-none,amdgcn-amdhsa, --enable-plugin --enable-ssp --enable-version-specific-runtime-libs --host=x86_64-suse-linux --mandir=/usr/share/man --with-arch-32=x86-64 --with-build-config=bootstrap-lto-lean --with-gcc-major-version-only --with-slibdir=/lib64 --with-tune=generic --without-cuda-driver --without-system-libunwind - Tumbleweed 20220510: --build=x86_64-suse-linux --disable-libcc1 --disable-libssp --disable-libstdcxx-pch --disable-libvtv --disable-werror --enable-cet=auto --enable-checking=release --enable-gnu-indirect-function --enable-host-shared --enable-languages=c,c++,objc,fortran,obj-c++,ada,go,d,jit --enable-libphobos --enable-libstdcxx-allocator=new --enable-link-mutex --enable-linux-futex --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none,amdgcn-amdhsa, --enable-plugin --enable-ssp --enable-version-specific-runtime-libs --host=x86_64-suse-linux --mandir=/usr/share/man --with-arch-32=x86-64 --with-build-config=bootstrap-lto-lean --with-gcc-major-version-only --with-slibdir=/lib64 --with-tune=generic --without-cuda-driver --without-system-libunwind Processor Details- Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0xd000331Java Details- Tumbleweed 20220507: OpenJDK Runtime Environment (IcedTea 3.22.0) (build 1.8.0_322-b06 suse-2.1-x86_64)- Tumbleweed 20220510: OpenJDK Runtime Environment (IcedTea 3.22.0) (build 1.8.0_322-b06 suse-2.2-x86_64)Python Details- Python 3.8.13Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Vulnerable: eIBRS with unprivileged eBPF + srbds: Not affected + tsx_async_abort: Not affected

Tumbleweed 20220507 vs. Tumbleweed 20220510 ComparisonPhoronix Test SuiteBaseline+4.4%+4.4%+8.8%+8.8%+13.2%+13.2%17.6%17.1%15.9%15.6%14.3%14.1%13.2%12%9.6%8.7%8.6%8.5%8%7.7%6.3%5.9%5.1%4.9%4.8%4.6%4.4%4%3.9%3.8%3.5%3.4%2.8%2.7%2.4%2.3%2.3%2%160 - 256 - 57ArcFace ResNet-100 - CPU - Standardnbodychaospickle_pure_pythoncrypto_pyaesS.V.M.P13.3%128 - 256 - 57django_templatefloatpathlibT.F.A.T.Tregex_compileI.M.D.SMatrix MathSENDFILE6.6%2to3MediumVector Math5.1%Rhodopsin Protein3, Long Mode - Compression SpeedgoApache Spark Bayes4.7%R.N.N.T - u8s8f32 - CPU8, Long Mode - Compression SpeedCryptopython_startupA.S.PMalloc20k AtomsR.N.N.I - bf16bf16bf16 - CPU3.2%63.1%10, Lossless3.1%6, Lossless2.9%Exhaustive19 - Compression SpeedG.A.U.J.F2.5%Total Time2.4%19 - D.S2.4%GPT-2 - CPU - Standard19, Long Mode - Compression Speed2.4%CPU Stress3, Long Mode - Compression SpeedThorough2.2%H22.1%3 - D.S2.1%MMAPLiquid-DSPONNX RuntimePyPerformancePyPerformancePyPerformancePyPerformanceStress-NGLiquid-DSPPyPerformancePyPerformancePyPerformancePyBenchPyPerformanceRenaissanceStress-NGStress-NGPyPerformanceASTC EncoderStress-NGLAMMPS Molecular Dynamics SimulatorZstd CompressionPyPerformanceRenaissanceoneDNNZstd CompressionStress-NGPyPerformanceRenaissanceStress-NGLAMMPS Molecular Dynamics SimulatoroneDNNlibavif avifenclibavif avifenclibavif avifencASTC EncoderZstd CompressionRenaissanceTachyonZstd CompressionONNX RuntimeZstd CompressionStress-NGZstd CompressionASTC EncoderDaCapo BenchmarkZstd CompressionStress-NGTumbleweed 20220507Tumbleweed 20220510

openSUSE Tumbleweed GCC12 Benchmarksstress-ng: MMAPstress-ng: Cryptostress-ng: Mallocstress-ng: Forkingstress-ng: SENDFILEstress-ng: CPU Stressstress-ng: Semaphoresstress-ng: Matrix Mathstress-ng: Vector Mathstress-ng: Memory Copyingstress-ng: Context Switchingstress-ng: Glibc Qsort Data Sortingstress-ng: System V Message Passingamg: svt-av1: Preset 10 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080ponnx: GPT-2 - CPU - Standardonnx: yolov4 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardonnx: super-resolution-10 - CPU - Standardcompress-zstd: 3 - Compression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedcompress-zstd: 3 - Compression Speedcompress-zstd: 3 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 3, Long Mode - Compression Speedcompress-zstd: 3, Long Mode - Decompression Speedcompress-zstd: 8, Long Mode - Compression Speedcompress-zstd: 8, Long Mode - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedstockfish: Total Timegromacs: MPI CPU - water_GMX50_barelammps: 20k Atomslammps: Rhodopsin Proteinnginx: 100nginx: 200nginx: 500nginx: 1000apache: 200apache: 500apache: 1000liquid-dsp: 128 - 256 - 57liquid-dsp: 160 - 256 - 57numpy: phpbench: PHP Benchmark Suitepennant: sedovbigpybench: Total For Average Test Timespyperformance: gopyperformance: 2to3pyperformance: chaospyperformance: floatpyperformance: nbodypyperformance: pathlibpyperformance: json_loadspyperformance: crypto_pyaespyperformance: regex_compilepyperformance: python_startuppyperformance: django_templatepyperformance: pickle_pure_pythonrenaissance: Rand Forestrenaissance: ALS Movie Lensrenaissance: Apache Spark ALSrenaissance: Apache Spark Bayesrenaissance: Apache Spark PageRankrenaissance: Finagle HTTP Requestsrenaissance: In-Memory Database Shootoutrenaissance: Akka Unbalanced Cobwebbed Treerenaissance: Genetic Algorithm Using Jenetics + Futuresonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUdacapobench: H2wireguard: incompact3d: X3D-benchmarking input.i3dincompact3d: input.i3d 193 Cells Per Directionavifenc: 0avifenc: 2avifenc: 6avifenc: 6, Losslessavifenc: 10, Losslesstachyon: Total Timeastcenc: Mediumastcenc: Thoroughastcenc: ExhaustiveTumbleweed 20220507Tumbleweed 202205104664.6285026.09311705065.9686555.941308526.97136088.2412219151.39307118.55324345.3713248.4420744069.17934.516948990.47215912666780.04892.52529.83174.38264.92221.22214.71175.6593956221820124296426.185.82659.8294.93232.9323.647.72742.16695.23062.584.32576.2278.63251.9314.33248.643.12674.21845346009.05234.10224.60779426.07197968.99208849.70214270.04197075.88206530.83213476.3514101666671458400000392.9665527215.70179120632855820820521931.137.519429526.585.77901503.218753.22978.41130.73586.25779.221253.234923.66895.9678.302430.983688.678431.517705.397424.34713062580.279294.41263911.421742185.51147.8165.8789.7897.09723.62535.50427.00188.93584758.6388400.87322581943.4686425.701228020.63139265.4212156959.44330917.14308630.6413258.1820458137.33937.176134754.47215886166780.18291.80730.09175.95266.83217.89213.09178.9196206342132125106509.688.12650.9301.73226.6337.846.82755.56645.93000.585.42515.7292.33215.4318.73196.642.12626.51821507969.12135.25525.85079403.07196528.56206578.82211423.33194927.31205683.86210703.4115958000001715666667386.0565095115.56548111031352518018718928.636.917027225.576.56911507.318758.72964.91183.53455.45708.619686.634659.47069.7676.385435.588658.221436.172695.647437.88513335575.210293.18889411.339818385.03447.9386.06010.0707.31524.19665.19697.15898.6924OpenBenchmarking.org

Stress-NG

Stress-NG is a Linux stress tool developed by Colin King of Canonical. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: MMAPTumbleweed 20220510Tumbleweed 2022050710002000300040005000SE +/- 39.04, N = 3SE +/- 35.45, N = 34758.634664.621. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CryptoTumbleweed 20220510Tumbleweed 2022050720K40K60K80K100KSE +/- 348.57, N = 3SE +/- 631.01, N = 388400.8785026.091. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: MallocTumbleweed 20220510Tumbleweed 2022050770M140M210M280M350MSE +/- 630952.97, N = 3SE +/- 999889.80, N = 3322581943.46311705065.961. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: ForkingTumbleweed 20220510Tumbleweed 2022050720K40K60K80K100KSE +/- 781.93, N = 3SE +/- 1050.04, N = 386425.7086555.941. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: SENDFILETumbleweed 20220510Tumbleweed 20220507300K600K900K1200K1500KSE +/- 7676.13, N = 3SE +/- 4804.43, N = 31228020.631308526.971. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: CPU StressTumbleweed 20220510Tumbleweed 2022050730K60K90K120K150KSE +/- 404.23, N = 3SE +/- 474.84, N = 3139265.42136088.241. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: SemaphoresTumbleweed 20220510Tumbleweed 202205073M6M9M12M15MSE +/- 9586.79, N = 3SE +/- 6493.81, N = 312156959.4412219151.391. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Matrix MathTumbleweed 20220510Tumbleweed 2022050770K140K210K280K350KSE +/- 500.14, N = 3SE +/- 211.97, N = 3330917.14307118.551. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Vector MathTumbleweed 20220510Tumbleweed 2022050770K140K210K280K350KSE +/- 969.86, N = 3SE +/- 1007.62, N = 3308630.64324345.371. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Memory CopyingTumbleweed 20220510Tumbleweed 202205073K6K9K12K15KSE +/- 13.12, N = 3SE +/- 7.33, N = 313258.1813248.441. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Context SwitchingTumbleweed 20220510Tumbleweed 202205074M8M12M16M20MSE +/- 80641.32, N = 3SE +/- 211441.81, N = 320458137.3320744069.171. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: Glibc Qsort Data SortingTumbleweed 20220510Tumbleweed 202205072004006008001000SE +/- 3.85, N = 3SE +/- 3.17, N = 3937.17934.511. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.14Test: System V Message PassingTumbleweed 20220510Tumbleweed 202205071.5M3M4.5M6M7.5MSE +/- 83162.42, N = 3SE +/- 84690.67, N = 46134754.476948990.471. (CC) gcc options: -O2 -std=gnu99 -lm -latomic -lc -lcrypt -ldl -ljpeg -lrt -lz -pthread

Algebraic Multi-Grid Benchmark

AMG is a parallel algebraic multigrid solver for linear systems arising from problems on unstructured grids. The driver provided with AMG builds linear systems for various 3-dimensional problems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2Tumbleweed 20220510Tumbleweed 20220507500M1000M1500M2000M2500MSE +/- 136645.93, N = 3SE +/- 1049155.59, N = 3215886166721591266671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.0Encoder Mode: Preset 10 - Input: Bosphorus 4KTumbleweed 20220510Tumbleweed 2022050720406080100SE +/- 0.85, N = 3SE +/- 0.57, N = 380.1880.051. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.0Encoder Mode: Preset 12 - Input: Bosphorus 4KTumbleweed 20220510Tumbleweed 2022050720406080100SE +/- 0.47, N = 3SE +/- 1.07, N = 391.8192.531. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq -pie

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pTumbleweed 20220510Tumbleweed 20220507714212835SE +/- 0.09, N = 3SE +/- 0.13, N = 330.0929.831. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pTumbleweed 20220510Tumbleweed 202205074080120160200SE +/- 2.16, N = 3SE +/- 1.95, N = 3175.95174.381. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pTumbleweed 20220510Tumbleweed 2022050760120180240300SE +/- 3.03, N = 4SE +/- 2.84, N = 3266.83264.921. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pTumbleweed 20220510Tumbleweed 2022050750100150200250SE +/- 0.66, N = 3SE +/- 1.95, N = 3217.89221.221. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pTumbleweed 20220510Tumbleweed 2022050750100150200250SE +/- 1.50, N = 3SE +/- 2.48, N = 3213.09214.711. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pTumbleweed 20220510Tumbleweed 202205074080120160200SE +/- 0.79, N = 3SE +/- 1.35, N = 15178.91175.651. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: StandardTumbleweed 20220510Tumbleweed 202205072K4K6K8K10KSE +/- 213.42, N = 12SE +/- 110.66, N = 3962093951. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: yolov4 - Device: CPU - Executor: StandardTumbleweed 20220510Tumbleweed 20220507140280420560700SE +/- 7.65, N = 12SE +/- 7.13, N = 46346221. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: StandardTumbleweed 20220510Tumbleweed 202205075001000150020002500SE +/- 23.47, N = 3SE +/- 16.03, N = 12213218201. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: StandardTumbleweed 20220510Tumbleweed 202205073K6K9K12K15KSE +/- 25.63, N = 3SE +/- 28.06, N = 312510124291. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

Zstd Compression

This test measures the time needed to compress/decompress a sample file (a FreeBSD disk image - FreeBSD-12.2-RELEASE-amd64-memstick.img) using Zstd compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3 - Compression SpeedTumbleweed 20220510Tumbleweed 2022050714002800420056007000SE +/- 54.93, N = 15SE +/- 80.06, N = 46509.66426.11. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Compression SpeedTumbleweed 20220510Tumbleweed 2022050720406080100SE +/- 0.24, N = 3SE +/- 0.42, N = 388.185.81. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19 - Decompression SpeedTumbleweed 20220510Tumbleweed 202205076001200180024003000SE +/- 8.47, N = 3SE +/- 4.98, N = 32650.92659.81. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Compression SpeedTumbleweed 20220510Tumbleweed 2022050770140210280350SE +/- 2.88, N = 3SE +/- 0.56, N = 3301.7294.91. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 3, Long Mode - Decompression SpeedTumbleweed 20220510Tumbleweed 202205077001400210028003500SE +/- 3.18, N = 3SE +/- 7.60, N = 33226.63232.91. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 8, Long Mode - Compression SpeedTumbleweed 20220510Tumbleweed 2022050770140210280350SE +/- 2.40, N = 15SE +/- 2.82, N = 3337.8323.61. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Compression SpeedTumbleweed 20220510Tumbleweed 202205071122334455SE +/- 0.42, N = 15SE +/- 0.50, N = 346.847.71. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.0Compression Level: 19, Long Mode - Decompression SpeedTumbleweed 20220510Tumbleweed 202205076001200180024003000SE +/- 3.84, N = 15SE +/- 1.47, N = 32755.52742.11. (CC) gcc options: -O3 -pthread -lz

OpenBenchmarking.orgMB/s, More Is BetterZstd CompressionCompression Level: 3 - Compression SpeedTumbleweed 20220510Tumbleweed 2022050714002800420056007000SE +/- 72.01, N = 3SE +/- 49.94, N = 36645.96695.21. *** zstd command line interface 64-bits v1.5.2, by Yann Collet ***

OpenBenchmarking.orgMB/s, More Is BetterZstd CompressionCompression Level: 3 - Decompression SpeedTumbleweed 20220510Tumbleweed 202205077001400210028003500SE +/- 1.91, N = 3SE +/- 2.28, N = 33000.53062.51. *** zstd command line interface 64-bits v1.5.2, by Yann Collet ***

OpenBenchmarking.orgMB/s, More Is BetterZstd CompressionCompression Level: 19 - Compression SpeedTumbleweed 20220510Tumbleweed 2022050720406080100SE +/- 0.81, N = 15SE +/- 0.58, N = 1585.484.31. *** zstd command line interface 64-bits v1.5.2, by Yann Collet ***

OpenBenchmarking.orgMB/s, More Is BetterZstd CompressionCompression Level: 19 - Decompression SpeedTumbleweed 20220510Tumbleweed 202205076001200180024003000SE +/- 2.92, N = 15SE +/- 4.43, N = 152515.72576.21. *** zstd command line interface 64-bits v1.5.2, by Yann Collet ***

OpenBenchmarking.orgMB/s, More Is BetterZstd CompressionCompression Level: 3, Long Mode - Compression SpeedTumbleweed 20220510Tumbleweed 2022050760120180240300SE +/- 3.53, N = 4SE +/- 1.27, N = 3292.3278.61. *** zstd command line interface 64-bits v1.5.2, by Yann Collet ***

OpenBenchmarking.orgMB/s, More Is BetterZstd CompressionCompression Level: 3, Long Mode - Decompression SpeedTumbleweed 20220510Tumbleweed 202205077001400210028003500SE +/- 7.96, N = 4SE +/- 17.69, N = 33215.43251.91. *** zstd command line interface 64-bits v1.5.2, by Yann Collet ***

OpenBenchmarking.orgMB/s, More Is BetterZstd CompressionCompression Level: 8, Long Mode - Compression SpeedTumbleweed 20220510Tumbleweed 2022050770140210280350SE +/- 1.70, N = 3SE +/- 1.64, N = 3318.7314.31. *** zstd command line interface 64-bits v1.5.2, by Yann Collet ***

OpenBenchmarking.orgMB/s, More Is BetterZstd CompressionCompression Level: 8, Long Mode - Decompression SpeedTumbleweed 20220510Tumbleweed 202205077001400210028003500SE +/- 13.36, N = 3SE +/- 13.20, N = 33196.63248.61. *** zstd command line interface 64-bits v1.5.2, by Yann Collet ***

OpenBenchmarking.orgMB/s, More Is BetterZstd CompressionCompression Level: 19, Long Mode - Compression SpeedTumbleweed 20220510Tumbleweed 202205071020304050SE +/- 0.25, N = 3SE +/- 0.46, N = 1542.143.11. *** zstd command line interface 64-bits v1.5.2, by Yann Collet ***

OpenBenchmarking.orgMB/s, More Is BetterZstd CompressionCompression Level: 19, Long Mode - Decompression SpeedTumbleweed 20220510Tumbleweed 202205076001200180024003000SE +/- 3.90, N = 3SE +/- 1.22, N = 152626.52674.21. *** zstd command line interface 64-bits v1.5.2, by Yann Collet ***

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeTumbleweed 20220510Tumbleweed 2022050740M80M120M160M200MSE +/- 1359631.79, N = 15SE +/- 1769551.70, N = 31821507961845346001. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021.2Implementation: MPI CPU - Input: water_GMX50_bareTumbleweed 20220510Tumbleweed 202205073691215SE +/- 0.026, N = 3SE +/- 0.027, N = 39.1219.0521. (CXX) g++ options: -O3

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: 20k AtomsTumbleweed 20220510Tumbleweed 20220507816243240SE +/- 0.10, N = 3SE +/- 0.05, N = 335.2634.101. (CXX) g++ options: -O3 -lm

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin ProteinTumbleweed 20220510Tumbleweed 20220507612182430SE +/- 0.57, N = 12SE +/- 0.45, N = 1525.8524.611. (CXX) g++ options: -O3 -lm

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 100Tumbleweed 20220510Tumbleweed 2022050720K40K60K80K100KSE +/- 303.63, N = 3SE +/- 417.99, N = 379403.0779426.071. (CC) gcc options: -lcrypt -lz -O3 -march=native

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 200Tumbleweed 20220510Tumbleweed 2022050740K80K120K160K200KSE +/- 1519.46, N = 15SE +/- 1923.63, N = 3196528.56197968.991. (CC) gcc options: -lcrypt -lz -O3 -march=native

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 500Tumbleweed 20220510Tumbleweed 2022050740K80K120K160K200KSE +/- 1939.73, N = 6SE +/- 2546.05, N = 4206578.82208849.701. (CC) gcc options: -lcrypt -lz -O3 -march=native

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.21.1Concurrent Requests: 1000Tumbleweed 20220510Tumbleweed 2022050750K100K150K200K250KSE +/- 2147.64, N = 5SE +/- 822.90, N = 3211423.33214270.041. (CC) gcc options: -lcrypt -lz -O3 -march=native

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the Golang "Bombardier" program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 200Tumbleweed 20220510Tumbleweed 2022050740K80K120K160K200KSE +/- 2234.54, N = 13SE +/- 1210.47, N = 3194927.31197075.88-shared -fPIC-lcrypt -lpthread1. (CC) gcc options: -O2

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 500Tumbleweed 20220510Tumbleweed 2022050740K80K120K160K200KSE +/- 1501.57, N = 3SE +/- 2654.22, N = 3205683.86206530.83-shared -fPIC-lcrypt -lpthread1. (CC) gcc options: -O2

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.48Concurrent Requests: 1000Tumbleweed 20220510Tumbleweed 2022050750K100K150K200K250KSE +/- 2186.63, N = 3SE +/- 1527.83, N = 3210703.41213476.35-shared -fPIC-lcrypt -lpthread1. (CC) gcc options: -O2

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57Tumbleweed 20220510Tumbleweed 20220507300M600M900M1200M1500MSE +/- 10500000.00, N = 3SE +/- 3735118.62, N = 3159580000014101666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 160 - Buffer Length: 256 - Filter Length: 57Tumbleweed 20220510Tumbleweed 20220507400M800M1200M1600M2000MSE +/- 15460954.40, N = 3SE +/- 7379927.73, N = 3171566666714584000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkTumbleweed 20220510Tumbleweed 2022050790180270360450SE +/- 3.07, N = 3SE +/- 4.68, N = 3386.05392.96

PHPBench

PHPBench is a benchmark suite for PHP. It performs a large number of simple tests in order to bench various aspects of the PHP interpreter. PHPBench can be used to compare hardware, operating systems, PHP versions, PHP accelerators and caches, compiler options, etc. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark SuiteTumbleweed 20220510Tumbleweed 20220507140K280K420K560K700KSE +/- 5984.74, N = 3SE +/- 118.82, N = 3650951655272

Pennant

Pennant is an application focused on hydrodynamics on general unstructured meshes in 2D. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgHydro Cycle Time - Seconds, Fewer Is BetterPennant 1.0.1Test: sedovbigTumbleweed 20220510Tumbleweed 2022050748121620SE +/- 0.17, N = 3SE +/- 0.05, N = 315.5715.701. (CXX) g++ options: -fopenmp -lmpi

PyBench

This test profile reports the total time of the different average timed test results from PyBench. PyBench reports average test times for different functions such as BuiltinFunctionCalls and NestedForLoops, with this total result providing a rough estimate as to Python's average performance on a given system. This test profile runs PyBench each time for 20 rounds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyBench 2018-02-16Total For Average Test TimesTumbleweed 20220510Tumbleweed 2022050730060090012001500SE +/- 1.20, N = 3SE +/- 1.20, N = 311101206

PyPerformance

PyPerformance is the reference Python performance benchmark suite. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: goTumbleweed 20220510Tumbleweed 2022050770140210280350SE +/- 4.51, N = 3SE +/- 2.85, N = 3313328

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: 2to3Tumbleweed 20220510Tumbleweed 20220507120240360480600SE +/- 0.88, N = 3SE +/- 0.58, N = 3525558

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: chaosTumbleweed 20220510Tumbleweed 2022050750100150200250SE +/- 0.33, N = 3SE +/- 0.33, N = 3180208

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: floatTumbleweed 20220510Tumbleweed 202205074080120160200SE +/- 0.33, N = 3SE +/- 0.33, N = 3187205

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: nbodyTumbleweed 20220510Tumbleweed 2022050750100150200250SE +/- 0.00, N = 3SE +/- 0.00, N = 3189219

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pathlibTumbleweed 20220510Tumbleweed 20220507714212835SE +/- 0.03, N = 3SE +/- 0.03, N = 328.631.1

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: json_loadsTumbleweed 20220510Tumbleweed 20220507918273645SE +/- 0.03, N = 3SE +/- 0.03, N = 336.937.5

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: crypto_pyaesTumbleweed 20220510Tumbleweed 202205074080120160200SE +/- 0.00, N = 3SE +/- 0.00, N = 3170194

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: regex_compileTumbleweed 20220510Tumbleweed 2022050760120180240300SE +/- 0.00, N = 3SE +/- 0.33, N = 3272295

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: python_startupTumbleweed 20220510Tumbleweed 20220507612182430SE +/- 0.07, N = 3SE +/- 0.03, N = 325.526.5

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: django_templateTumbleweed 20220510Tumbleweed 2022050720406080100SE +/- 0.06, N = 3SE +/- 0.10, N = 376.585.7

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.0.0Benchmark: pickle_pure_pythonTumbleweed 20220510Tumbleweed 202205072004006008001000SE +/- 0.33, N = 3SE +/- 0.33, N = 3691790

Renaissance

Renaissance is a suite of benchmarks designed to test the Java JVM from Apache Spark to a Twitter-like service to Scala and other features. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.12Test: Random ForestTumbleweed 20220510Tumbleweed 2022050730060090012001500SE +/- 15.97, N = 3SE +/- 16.29, N = 31507.31503.2MIN: 1376.07 / MAX: 2006.76MIN: 1329.44 / MAX: 1908.05

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.12Test: ALS Movie LensTumbleweed 20220510Tumbleweed 202205074K8K12K16K20KSE +/- 117.66, N = 3SE +/- 181.53, N = 318758.718753.2MIN: 18497.37 / MAX: 21352.15MIN: 18390.55 / MAX: 21033.08

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.12Test: Apache Spark ALSTumbleweed 20220510Tumbleweed 202205076001200180024003000SE +/- 4.56, N = 3SE +/- 23.76, N = 32964.92978.4MIN: 2728.19 / MAX: 3331.71MIN: 2688.22 / MAX: 3561.27

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.12Test: Apache Spark BayesTumbleweed 20220510Tumbleweed 2022050730060090012001500SE +/- 16.54, N = 15SE +/- 26.13, N = 121183.51130.7MIN: 618.02 / MAX: 1647.68MIN: 602.03 / MAX: 1997.71

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.12Test: Apache Spark PageRankTumbleweed 20220510Tumbleweed 202205078001600240032004000SE +/- 29.12, N = 3SE +/- 43.42, N = 33455.43586.2MIN: 3005.61 / MAX: 4731.78MIN: 3041.24 / MAX: 4391.89

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.12Test: Finagle HTTP RequestsTumbleweed 20220510Tumbleweed 2022050712002400360048006000SE +/- 149.97, N = 15SE +/- 179.26, N = 155708.65779.2MIN: 3401.45 / MAX: 8925.31MIN: 3643.61 / MAX: 8878.75

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.12Test: In-Memory Database ShootoutTumbleweed 20220510Tumbleweed 202205075K10K15K20K25KSE +/- 151.02, N = 3SE +/- 201.25, N = 319686.621253.2MIN: 19418.39 / MAX: 22255.62MIN: 20240.05 / MAX: 24744

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.12Test: Akka Unbalanced Cobwebbed TreeTumbleweed 20220510Tumbleweed 202205077K14K21K28K35KSE +/- 149.63, N = 3SE +/- 184.63, N = 334659.434923.6MIN: 27645.02 / MAX: 34954.13MIN: 27025.46 / MAX: 35226.8

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.12Test: Genetic Algorithm Using Jenetics + FuturesTumbleweed 20220510Tumbleweed 2022050715003000450060007500SE +/- 54.20, N = 3SE +/- 44.24, N = 37069.76895.9MIN: 6787.55 / MAX: 7755.21MIN: 6518.54 / MAX: 7318.42

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of Intel oneAPI. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUTumbleweed 20220510Tumbleweed 20220507150300450600750SE +/- 8.40, N = 15SE +/- 8.36, N = 4676.39678.30MIN: 593.14MIN: 627.421. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUTumbleweed 20220510Tumbleweed 2022050790180270360450SE +/- 5.15, N = 15SE +/- 6.92, N = 15435.59430.98MIN: 370.6MIN: 379.381. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUTumbleweed 20220510Tumbleweed 20220507150300450600750SE +/- 6.86, N = 4SE +/- 4.64, N = 3658.22688.68MIN: 617.41MIN: 653.481. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUTumbleweed 20220510Tumbleweed 2022050790180270360450SE +/- 5.96, N = 15SE +/- 8.37, N = 15436.17431.52MIN: 385.48MIN: 361.411. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUTumbleweed 20220510Tumbleweed 20220507150300450600750SE +/- 3.41, N = 3SE +/- 2.91, N = 3695.65705.40MIN: 659.77MIN: 672.821. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUTumbleweed 20220510Tumbleweed 2022050790180270360450SE +/- 7.24, N = 12SE +/- 8.22, N = 12437.89424.35MIN: 370.22MIN: 366.931. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl -lpthread

DaCapo Benchmark

This test runs the DaCapo Benchmarks written in Java and intended to test system/CPU performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H2Tumbleweed 20220510Tumbleweed 202205073K6K9K12K15KSE +/- 128.87, N = 4SE +/- 153.28, N = 41333513062

WireGuard + Linux Networking Stack Stress Test

This is a benchmark of the WireGuard secure VPN tunnel and Linux networking stack stress test. The test runs on the local host but does require root permissions to run. The way it works is it creates three namespaces. ns0 has a loopback device. ns1 and ns2 each have wireguard devices. Those two wireguard devices send traffic through the loopback device of ns0. The end result of this is that tests wind up testing encryption and decryption at the same time -- a pretty CPU and scheduler-heavy workflow. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWireGuard + Linux Networking Stack Stress TestTumbleweed 20220510Tumbleweed 20220507130260390520650SE +/- 8.32, N = 9SE +/- 8.20, N = 3575.21580.28

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dTumbleweed 20220510Tumbleweed 2022050760120180240300SE +/- 0.25, N = 3SE +/- 0.24, N = 3293.19294.411. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionTumbleweed 20220510Tumbleweed 202205073691215SE +/- 0.05, N = 3SE +/- 0.04, N = 311.3411.421. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi

libavif avifenc

This is a test of the AOMedia libavif library testing the encoding of a JPEG image to AV1 Image Format (AVIF). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 0Tumbleweed 20220510Tumbleweed 2022050720406080100SE +/- 0.17, N = 3SE +/- 0.65, N = 385.0385.511. (CXX) g++ options: -O3 -fPIC -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 2Tumbleweed 20220510Tumbleweed 202205071122334455SE +/- 0.09, N = 3SE +/- 0.63, N = 347.9447.821. (CXX) g++ options: -O3 -fPIC -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6Tumbleweed 20220510Tumbleweed 20220507246810SE +/- 0.077, N = 3SE +/- 0.064, N = 156.0605.8781. (CXX) g++ options: -O3 -fPIC -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 6, LosslessTumbleweed 20220510Tumbleweed 202205073691215SE +/- 0.107, N = 5SE +/- 0.095, N = 1510.0709.7891. (CXX) g++ options: -O3 -fPIC -lm

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.10Encoder Speed: 10, LosslessTumbleweed 20220510Tumbleweed 20220507246810SE +/- 0.070, N = 3SE +/- 0.075, N = 37.3157.0971. (CXX) g++ options: -O3 -fPIC -lm

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. The sample scene used is the Teapot scene ray-traced to 8K x 8K with 32 samples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99.2Total TimeTumbleweed 20220510Tumbleweed 20220507612182430SE +/- 0.06, N = 3SE +/- 0.04, N = 324.2023.631. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: MediumTumbleweed 20220510Tumbleweed 202205071.23842.47683.71524.95366.192SE +/- 0.1298, N = 15SE +/- 0.0533, N = 155.19695.50421. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: ThoroughTumbleweed 20220510Tumbleweed 20220507246810SE +/- 0.0624, N = 15SE +/- 0.0853, N = 47.15897.00181. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 3.2Preset: ExhaustiveTumbleweed 20220510Tumbleweed 20220507246810SE +/- 0.0186, N = 3SE +/- 0.0119, N = 38.69248.93581. (CXX) g++ options: -O3 -flto -pthread

101 Results Shown

Stress-NG:
  MMAP
  Crypto
  Malloc
  Forking
  SENDFILE
  CPU Stress
  Semaphores
  Matrix Math
  Vector Math
  Memory Copying
  Context Switching
  Glibc Qsort Data Sorting
  System V Message Passing
Algebraic Multi-Grid Benchmark
SVT-AV1:
  Preset 10 - Bosphorus 4K
  Preset 12 - Bosphorus 4K
SVT-HEVC:
  1 - Bosphorus 1080p
  7 - Bosphorus 1080p
  10 - Bosphorus 1080p
SVT-VP9:
  VMAF Optimized - Bosphorus 1080p
  PSNR/SSIM Optimized - Bosphorus 1080p
  Visual Quality Optimized - Bosphorus 1080p
ONNX Runtime:
  GPT-2 - CPU - Standard
  yolov4 - CPU - Standard
  ArcFace ResNet-100 - CPU - Standard
  super-resolution-10 - CPU - Standard
Zstd Compression:
  3 - Compression Speed
  19 - Compression Speed
  19 - Decompression Speed
  3, Long Mode - Compression Speed
  3, Long Mode - Decompression Speed
  8, Long Mode - Compression Speed
  19, Long Mode - Compression Speed
  19, Long Mode - Decompression Speed
Zstd Compression:
  3 - Compression Speed
  3 - Decompression Speed
  19 - Compression Speed
  19 - Decompression Speed
  3, Long Mode - Compression Speed
  3, Long Mode - Decompression Speed
  8, Long Mode - Compression Speed
  8, Long Mode - Decompression Speed
  19, Long Mode - Compression Speed
  19, Long Mode - Decompression Speed
Stockfish
GROMACS
LAMMPS Molecular Dynamics Simulator:
  20k Atoms
  Rhodopsin Protein
nginx:
  100
  200
  500
  1000
Apache HTTP Server:
  200
  500
  1000
Liquid-DSP:
  128 - 256 - 57
  160 - 256 - 57
Numpy Benchmark
PHPBench
Pennant
PyBench
PyPerformance:
  go
  2to3
  chaos
  float
  nbody
  pathlib
  json_loads
  crypto_pyaes
  regex_compile
  python_startup
  django_template
  pickle_pure_python
Renaissance:
  Rand Forest
  ALS Movie Lens
  Apache Spark ALS
  Apache Spark Bayes
  Apache Spark PageRank
  Finagle HTTP Requests
  In-Memory Database Shootout
  Akka Unbalanced Cobwebbed Tree
  Genetic Algorithm Using Jenetics + Futures
oneDNN:
  Recurrent Neural Network Training - f32 - CPU
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Training - u8s8f32 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
DaCapo Benchmark
WireGuard + Linux Networking Stack Stress Test
Xcompact3d Incompact3d:
  X3D-benchmarking input.i3d
  input.i3d 193 Cells Per Direction
libavif avifenc:
  0
  2
  6
  6, Lossless
  10, Lossless
Tachyon
ASTC Encoder:
  Medium
  Thorough
  Exhaustive