AMD EPYC 7763 Cooling Performance

AMD EPYC 7763 64-Core CPU benchmarks by Michael Larabel evaluating some heatsink fans in a 4U server.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2104096-IB-HEATSINK430
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Noctua NH-U9 TR4-SP3
April 08 2021
  8 Hours, 12 Minutes
Dynatron A26
April 09 2021
  8 Hours, 45 Minutes
Dynatron A38
April 09 2021
  11 Hours, 17 Minutes
Invert Behavior (Only Show Selected Data)
  9 Hours, 25 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 7763 Cooling PerformanceOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 7763 64-Core @ 2.45GHz (64 Cores / 128 Threads)Supermicro H12SSL-i v1.01 (2.0 BIOS)AMD Starship/Matisse126GB3841GB Micron_9300_MTFDHAL3T8TDPllvmpipe2 x Broadcom NetXtreme BCM5720 2-port PCIeUbuntu 20.045.12.0-051200rc6daily20210408-generic (x86_64) 20210407GNOME Shell 3.36.4X Server 1.20.83.3 Mesa 20.0.8 (LLVM 10.0.0 128 bits)GCC 9.3.0ext41024x768ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerOpenGLCompilerFile-SystemScreen ResolutionAMD EPYC 7763 Cooling Performance BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,gm2 --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-9-HskZEa/gcc-9-9.3.0/debian/tmp-nvptx/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq ondemand (Boost: Enabled) - CPU Microcode: 0xa001119 - Python 3.8.2- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Full AMD retpoline IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Noctua NH-U9 TR4-SP3Dynatron A26Dynatron A38Result OverviewPhoronix Test Suite100%101%101%102%StockfishXcompact3d Incompact3dTimed Erlang/OTP CompilationChaos Group V-RAYViennaCLOpenSCADMobile Neural NetworkTimed Node.js CompilationASTC EncoderTimed GDB GNU Debugger CompilationLuaRadioGROMACSIndigoBenchAOM AV1Timed Apache CompilationSVT-AV1Timed Linux Kernel CompilationNAMDsimdjsonsrsLTEGNU RadioSVT-HEVCLiquid-DSPGNU GMP GMPbenchTimed Mesa CompilationSVT-VP9BlenderoneDNN

AMD EPYC 7763 Cooling Performanceindigobench: CPU - Supercarindigobench: CPU - Bedroomv-ray: CPUblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Fishy Cat - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyblender: Barbershop - CPU-Onlybuild-linux-kernel: Time To Compilebuild-gdb: Time To Compilebuild-apache: Time To Compilebuild-mesa: Time To Compilebuild-nodejs: Time To Compilebuild-erlang: Time To Compileaom-av1: Speed 9 Realtime - Bosphorus 1080paom-av1: Speed 9 Realtime - Bosphorus 4Kaom-av1: Speed 8 Realtime - Bosphorus 1080paom-av1: Speed 8 Realtime - Bosphorus 4Kaom-av1: Speed 6 Realtime - Bosphorus 1080paom-av1: Speed 6 Realtime - Bosphorus 4Kaom-av1: Speed 6 Two-Pass - Bosphorus 1080paom-av1: Speed 6 Two-Pass - Bosphorus 4Kaom-av1: Speed 4 Two-Pass - Bosphorus 1080paom-av1: Speed 4 Two-Pass - Bosphorus 4Kaom-av1: Speed 0 Two-Pass - Bosphorus 1080paom-av1: Speed 0 Two-Pass - Bosphorus 4Ksvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: VMAF Optimized - Bosphorus 1080psvt-hevc: 1 - Bosphorus 1080psvt-hevc: 7 - Bosphorus 1080psvt-hevc: 10 - Bosphorus 1080psvt-av1: Enc Mode 8 - 1080psvt-av1: Enc Mode 4 - 1080psvt-av1: Enc Mode 0 - 1080pviennacl: CPU BLAS - sCOPYviennacl: CPU BLAS - sAXPYviennacl: CPU BLAS - dCOPYviennacl: CPU BLAS - dAXPYviennacl: CPU BLAS - dDOTviennacl: CPU BLAS - dGEMV-Nviennacl: CPU BLAS - dGEMV-Tviennacl: CPU BLAS - dGEMM-NNviennacl: CPU BLAS - dGEMM-NTviennacl: CPU BLAS - dGEMM-TNviennacl: CPU BLAS - dGEMM-TTviennacl: CPU BLAS - sDOTgmpbench: Total Timeopenscad: Projector Mount Swivelopenscad: Leonardo Phone Case Slimopenscad: Pistolopenscad: Retro Caropenscad: Mini-ITX Casestockfish: Total Timeincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionincompact3d: X3D-benchmarking input.i3donednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUmnn: SqueezeNetV1.0mnn: resnet-v2-50mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3astcenc: Mediumastcenc: Thoroughastcenc: Exhaustivesimdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyasimdjson: DistinctUserIDsrslte: OFDM_Testsrslte: PHY_DL_Testsrslte: PHY_DL_Testgnuradio: Five Back to Back FIR Filtersgnuradio: Signal Source (Cosine)gnuradio: FIR Filtergnuradio: IIR Filtergnuradio: FM Deemphasis Filtergnuradio: Hilbert Transformluaradio: Five Back to Back FIR Filtersluaradio: FM Deemphasis Filterluaradio: Hilbert Transformluaradio: Complex Phaseliquid-dsp: 16 - 256 - 57liquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 57liquid-dsp: 128 - 256 - 57gromacs: water_GMX50_barenamd: ATPase Simulation - 327,506 AtomsNoctua NH-U9 TR4-SP3Dynatron A26Dynatron A3824.19011.3945791231.9280.9645.9893.59111.6126.93098.81423.59019.835110.706134.328104.5737.4586.1334.124.6116.0921.319.156.744.720.50.2345.24471.60468.7637.76324.42607.0292.8619.4730.130103562714011222111588.679388.186.192.189.76375098.8100.90218.699109.83319.06345.8411566566855.1565307222.5096487625.8123370.8800471.652207.136080.6050583.038200.7818511.177961.177583.663070.6630590.3849290.7209281373.911385.321370.82665.228664.988664.6805.78522.1303.7572.32828.1714.91767.992520.44283.640.962.834.01115633333257.494.3560.93322.2639.0608.6765.3378.21101.6344.793.5591.88033100001613766667279240000030179333335.5770.3811024.25611.3995827031.8580.8645.8093.72111.4826.87799.32623.67319.796111.318132.497101.7038.0686.5634.2924.7116.2521.479.166.714.830.500.2347.78471.09468.5937.89323.60602.5592.3479.4520.130105264014091205111682.678188.686.692.189.96405099.2100.72318.525108.61718.88645.2321601076515.1506515922.3308512667.2684940.8797521.656717.178390.6076553.024590.7834641.178631.186853.649410.6506240.3814950.7267891378.861369.981391.14666.049664.807666.4835.79722.3973.7892.34228.4514.94128.006220.61963.630.962.834.00116233333257.194.3567.73324.7641.6604.7760.3377.31110.6344.293.6591.68001900001614933333278286666730280666675.5820.3816424.36011.4045850431.9280.9246.0193.72111.5326.84499.08623.58619.800110.941133.119101.2737.4687.6334.7324.8216.1421.469.356.684.770.500.2346.82470.69467.9637.92322.43605.9793.0489.3550.130104464013991204111278.977988.286.391.989.96365089.1101.04518.734108.79418.92745.4821585120045.1505176722.7087644627.6671140.8794801.672567.180020.6036213.043520.7810601.180871.179063.617620.6403310.3852070.7215861400.191377.491381.77665.526665.265666.1305.87822.2693.7802.33228.3284.94247.989820.55913.630.962.833.98117866667255.993.7560.13303.6642.0607.0764.1376.21097.4343.493.5591.08041466671614566667279373333330281333335.5990.38215OpenBenchmarking.org

IndigoBench

This is a test of Indigo Renderer's IndigoBench benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: SupercarDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3612182430SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 324.3624.2624.19

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: BedroomDynatron A38Dynatron A26Noctua NH-U9 TR4-SP33691215SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 311.4011.4011.39

Chaos Group V-RAY

This is a test of Chaos Group's V-RAY benchmark. V-RAY is a commercial renderer that can integrate with various creator software products like SketchUp and 3ds Max. The V-RAY benchmark is standalone and supports CPU and NVIDIA CUDA/RTX based rendering. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgvsamples, More Is BetterChaos Group V-RAY 5Mode: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP313K26K39K52K65KSE +/- 477.68, N = 3SE +/- 814.01, N = 3SE +/- 265.36, N = 3585045827057912

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles benchmark with various sample files. GPU computing via OpenCL, NVIDIA OptiX, and NVIDIA CUDA is supported. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: BMW27 - Compute: CPU-OnlyDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3714212835SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 331.9231.8531.92

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Classroom - Compute: CPU-OnlyDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.08, N = 3SE +/- 0.07, N = 3SE +/- 0.08, N = 380.9280.8680.96

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Fishy Cat - Compute: CPU-OnlyDynatron A38Dynatron A26Noctua NH-U9 TR4-SP31020304050SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 346.0145.8045.98

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Pabellon Barcelona - Compute: CPU-OnlyDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.01, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 393.7293.7293.59

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 2.92Blend File: Barbershop - Compute: CPU-OnlyDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.05, N = 3111.53111.48111.61

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 5.10.20Time To CompileDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3612182430SE +/- 0.26, N = 9SE +/- 0.26, N = 9SE +/- 0.28, N = 826.8426.8826.93

Timed GDB GNU Debugger Compilation

This test times how long it takes to build the GNU Debugger (GDB) in a default configuration. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 9.1Time To CompileDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.07, N = 3SE +/- 0.12, N = 3SE +/- 0.12, N = 399.0999.3398.81

Timed Apache Compilation

This test times how long it takes to build the Apache HTTPD web server. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To CompileDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3612182430SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 323.5923.6723.59

Timed Mesa Compilation

This test profile times how long it takes to compile Mesa with Meson/Ninja. For minimizing build dependencies and avoid versioning conflicts, test this is just the core Mesa build without LLVM or the extra Gallium3D/Mesa drivers enabled. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To CompileDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3510152025SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 319.8019.8019.84

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 15.11Time To CompileDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.30, N = 3SE +/- 0.11, N = 3SE +/- 0.28, N = 3110.94111.32110.71

Timed Erlang/OTP Compilation

This test times how long it takes to compile Erlang/OTP. Erlang is a programming language and run-time for massively scalable soft real-time systems with high availability requirements. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Erlang/OTP Compilation 23.2Time To CompileDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3306090120150SE +/- 0.29, N = 3SE +/- 0.39, N = 3SE +/- 0.26, N = 3133.12132.50134.33

AOM AV1

This is a test of the AOMedia AV1 encoder (libaom) developed by AOMedia and Google. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.67, N = 6SE +/- 1.09, N = 6SE +/- 1.05, N = 7101.27101.70104.571. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 9 Realtime - Input: Bosphorus 4KDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3918273645SE +/- 0.48, N = 5SE +/- 0.46, N = 6SE +/- 0.43, N = 337.4638.0637.451. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.58, N = 6SE +/- 0.59, N = 6SE +/- 0.36, N = 687.6386.5686.131. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 8 Realtime - Input: Bosphorus 4KDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3816243240SE +/- 0.28, N = 3SE +/- 0.22, N = 3SE +/- 0.07, N = 334.7334.2934.101. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3612182430SE +/- 0.10, N = 3SE +/- 0.18, N = 3SE +/- 0.05, N = 324.8224.7124.611. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Realtime - Input: Bosphorus 4KDynatron A38Dynatron A26Noctua NH-U9 TR4-SP348121620SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 316.1416.2516.091. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3510152025SE +/- 0.03, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 321.4621.4721.311. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 6 Two-Pass - Input: Bosphorus 4KDynatron A38Dynatron A26Noctua NH-U9 TR4-SP33691215SE +/- 0.11, N = 6SE +/- 0.10, N = 3SE +/- 0.07, N = 39.359.169.151. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3246810SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.686.716.741. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 4 Two-Pass - Input: Bosphorus 4KDynatron A38Dynatron A26Noctua NH-U9 TR4-SP31.08682.17363.26044.34725.434SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 34.774.834.721. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.11250.2250.33750.450.5625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.500.500.501. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

OpenBenchmarking.orgFrames Per Second, More Is BetterAOM AV1 3.0Encoder Mode: Speed 0 Two-Pass - Input: Bosphorus 4KDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.0450.090.1350.180.225SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.20.20.21. (CXX) g++ options: -O3 -std=c++11 -U_FORTIFY_SOURCE -lm -lpthread

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP380160240320400SE +/- 6.16, N = 15SE +/- 6.37, N = 15SE +/- 5.87, N = 15346.82347.78345.241. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3100200300400500SE +/- 1.32, N = 10SE +/- 1.26, N = 10SE +/- 1.14, N = 10470.69471.09471.601. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3100200300400500SE +/- 0.90, N = 10SE +/- 1.01, N = 10SE +/- 1.02, N = 10467.96468.59468.761. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-HEVC

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-HEVC CPU-based multi-threaded video encoder for the HEVC / H.265 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 1 - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3918273645SE +/- 0.10, N = 4SE +/- 0.12, N = 4SE +/- 0.05, N = 437.9237.8937.761. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 7 - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP370140210280350SE +/- 0.82, N = 10SE +/- 0.66, N = 10SE +/- 1.48, N = 10322.43323.60324.421. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-HEVC 1.5.0Tuning: 10 - Input: Bosphorus 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3130260390520650SE +/- 0.85, N = 12SE +/- 1.45, N = 12SE +/- 1.56, N = 12605.97602.55607.021. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt

SVT-AV1

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-AV1 CPU-based multi-threaded video encoder for the AV1 video format with a sample 1080p YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 8 - Input: 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.62, N = 6SE +/- 0.38, N = 6SE +/- 0.39, N = 693.0592.3592.861. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 4 - Input: 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP33691215SE +/- 0.114, N = 6SE +/- 0.083, N = 4SE +/- 0.108, N = 49.3559.4529.4731. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 0.8Encoder Mode: Enc Mode 0 - Input: 1080pDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.02930.05860.08790.11720.1465SE +/- 0.001, N = 3SE +/- 0.000, N = 3SE +/- 0.001, N = 30.1300.1300.1301. (CXX) g++ options: -O3 -fcommon -fPIE -fPIC -pie

ViennaCL

ViennaCL is an open-source linear algebra library written in C++ and with support for OpenCL and OpenMP. This test profile makes use of ViennaCL's built-in benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sCOPYDynatron A38Dynatron A26Noctua NH-U9 TR4-SP32004006008001000SE +/- 27.57, N = 15SE +/- 28.51, N = 14SE +/- 30.05, N = 151044105210351. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sAXPYDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3140280420560700SE +/- 1.86, N = 15SE +/- 2.67, N = 14SE +/- 2.32, N = 156406406271. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dCOPYDynatron A38Dynatron A26Noctua NH-U9 TR4-SP330060090012001500SE +/- 3.16, N = 15SE +/- 3.55, N = 14SE +/- 5.47, N = 151399140914011. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dAXPYDynatron A38Dynatron A26Noctua NH-U9 TR4-SP330060090012001500SE +/- 1.31, N = 15SE +/- 2.28, N = 14SE +/- 2.23, N = 151204120512221. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dDOTDynatron A38Dynatron A26Noctua NH-U9 TR4-SP32004006008001000SE +/- 2.00, N = 15SE +/- 1.73, N = 14SE +/- 2.15, N = 151112111611151. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-NDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 4.88, N = 15SE +/- 9.76, N = 14SE +/- 7.63, N = 1578.982.688.61. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMV-TDynatron A38Dynatron A26Noctua NH-U9 TR4-SP32004006008001000SE +/- 3.44, N = 15SE +/- 1.78, N = 14SE +/- 1.64, N = 157797817931. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NNDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.34, N = 15SE +/- 0.08, N = 14SE +/- 0.52, N = 1588.288.688.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-NTDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.30, N = 15SE +/- 0.06, N = 14SE +/- 0.28, N = 1586.386.686.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TNDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.13, N = 15SE +/- 0.03, N = 14SE +/- 0.04, N = 1591.992.192.11. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGFLOPs/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - dGEMM-TTDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.05, N = 15SE +/- 0.02, N = 14SE +/- 0.13, N = 1589.989.989.71. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

OpenBenchmarking.orgGB/s, More Is BetterViennaCL 1.7.1Test: CPU BLAS - sDOTDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3140280420560700SE +/- 1.17, N = 13SE +/- 1.19, N = 14SE +/- 0.77, N = 146366406371. (CXX) g++ options: -fopenmp -O3 -rdynamic -lOpenCL

GNU GMP GMPbench

GMPbench is a test of the GNU Multiple Precision Arithmetic (GMP) Library. GMPbench is a single-threaded integer benchmark that leverages the GMP library to stress the CPU with widening integer multiplication. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGMPbench Score, More Is BetterGNU GMP GMPbench 6.2.1Total TimeDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3110022003300440055005089.15099.25098.81. (CC) gcc options: -O3 -fomit-frame-pointer -lm

OpenSCAD

OpenSCAD is a programmer-focused solid 3D CAD modeller. OpenSCAD is free software and allows creating 3D CAD objects in a script-based modelling environment. This test profile will use the system-provided OpenSCAD program otherwise and time how long it takes tn render different SCAD assets to PNG output. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Projector Mount SwivelDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.14, N = 3SE +/- 0.42, N = 3SE +/- 0.29, N = 3101.05100.72100.901. OpenSCAD version 2019.05

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Leonardo Phone Case SlimDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3510152025SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 318.7318.5318.701. OpenSCAD version 2019.05

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: PistolDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.31, N = 3SE +/- 0.10, N = 3SE +/- 0.20, N = 3108.79108.62109.831. OpenSCAD version 2019.05

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Retro CarDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3510152025SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 318.9318.8919.061. OpenSCAD version 2019.05

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenSCADRender: Mini-ITX CaseDynatron A38Dynatron A26Noctua NH-U9 TR4-SP31020304050SE +/- 0.17, N = 3SE +/- 0.17, N = 3SE +/- 0.06, N = 345.4845.2345.841. OpenSCAD version 2019.05

Stockfish

This is a test of Stockfish, an advanced open-source C++11 chess benchmark that can scale up to 512 CPU threads. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 13Total TimeDynatron A38Dynatron A26Noctua NH-U9 TR4-SP330M60M90M120M150MSE +/- 1246901.76, N = 3SE +/- 2061799.88, N = 15SE +/- 2161918.89, N = 41585120041601076511566566851. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fprofile-use -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per DirectionDynatron A38Dynatron A26Noctua NH-U9 TR4-SP31.16022.32043.48064.64085.801SE +/- 0.02393955, N = 7SE +/- 0.02227690, N = 7SE +/- 0.01747253, N = 75.150517675.150651595.156530721. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3510152025SE +/- 0.08, N = 3SE +/- 0.30, N = 3SE +/- 0.04, N = 322.7122.3322.511. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3dDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3140280420560700SE +/- 0.23, N = 3SE +/- 11.65, N = 9SE +/- 0.48, N = 3627.67667.27625.811. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of the Intel oneAPI initiative. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.1980.3960.5940.7920.99SE +/- 0.000819, N = 7SE +/- 0.000564, N = 7SE +/- 0.000667, N = 70.8794800.8797520.880047MIN: 0.84MIN: 0.84MIN: 0.841. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.37630.75261.12891.50521.8815SE +/- 0.01439, N = 7SE +/- 0.00304, N = 7SE +/- 0.00301, N = 71.672561.656711.65220MIN: 1.57MIN: 1.57MIN: 1.571. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3246810SE +/- 0.01979, N = 3SE +/- 0.02988, N = 3SE +/- 0.03799, N = 37.180027.178397.13608MIN: 6.18MIN: 6.17MIN: 6.041. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.13670.27340.41010.54680.6835SE +/- 0.000705, N = 3SE +/- 0.001607, N = 3SE +/- 0.001828, N = 30.6036210.6076550.605058MIN: 0.56MIN: 0.56MIN: 0.561. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.68481.36962.05442.73923.424SE +/- 0.00693, N = 9SE +/- 0.00911, N = 9SE +/- 0.00904, N = 93.043523.024593.03820MIN: 2.34MIN: 2.24MIN: 2.211. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.17630.35260.52890.70520.8815SE +/- 0.002274, N = 9SE +/- 0.002597, N = 9SE +/- 0.002596, N = 90.7810600.7834640.781851MIN: 0.72MIN: 0.72MIN: 0.721. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.26570.53140.79711.06281.3285SE +/- 0.00303, N = 4SE +/- 0.00176, N = 4SE +/- 0.00196, N = 41.180871.178631.17796MIN: 1.1MIN: 1.12MIN: 1.11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.2670.5340.8011.0681.335SE +/- 0.01187, N = 4SE +/- 0.00760, N = 4SE +/- 0.00777, N = 41.179061.186851.17758MIN: 0.99MIN: 0.98MIN: 11. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.82421.64842.47263.29684.121SE +/- 0.03350, N = 5SE +/- 0.02260, N = 5SE +/- 0.04102, N = 53.617623.649413.66307MIN: 3.36MIN: 3.4MIN: 3.371. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.14920.29840.44760.59680.746SE +/- 0.005445, N = 5SE +/- 0.004042, N = 5SE +/- 0.006455, N = 50.6403310.6506240.663059MIN: 0.58MIN: 0.59MIN: 0.611. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.08670.17340.26010.34680.4335SE +/- 0.004169, N = 4SE +/- 0.001495, N = 4SE +/- 0.005034, N = 40.3852070.3814950.384929MIN: 0.36MIN: 0.36MIN: 0.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.16350.3270.49050.6540.8175SE +/- 0.001725, N = 4SE +/- 0.001153, N = 4SE +/- 0.002487, N = 40.7215860.7267890.720928MIN: 0.67MIN: 0.67MIN: 0.671. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP330060090012001500SE +/- 14.96, N = 3SE +/- 9.11, N = 3SE +/- 3.07, N = 31400.191378.861373.91MIN: 1335.97MIN: 1326.59MIN: 1332.451. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP330060090012001500SE +/- 4.57, N = 3SE +/- 3.15, N = 3SE +/- 3.58, N = 31377.491369.981385.32MIN: 1335.99MIN: 1325.96MIN: 1350.611. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP330060090012001500SE +/- 8.30, N = 3SE +/- 4.55, N = 3SE +/- 8.02, N = 31381.771391.141370.82MIN: 1330.53MIN: 1343.15MIN: 1322.761. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3140280420560700SE +/- 1.18, N = 3SE +/- 1.29, N = 3SE +/- 0.56, N = 3665.53666.05665.23MIN: 639.97MIN: 638.64MIN: 638.881. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3140280420560700SE +/- 0.96, N = 3SE +/- 1.25, N = 3SE +/- 1.05, N = 3665.27664.81664.99MIN: 637.88MIN: 637.03MIN: 636.721. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.1.2Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3140280420560700SE +/- 1.16, N = 3SE +/- 1.24, N = 3SE +/- 1.52, N = 3666.13666.48664.68MIN: 639.05MIN: 640.16MIN: 637.441. (CXX) g++ options: -O3 -march=native -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread -ldl

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: SqueezeNetV1.0Dynatron A38Dynatron A26Noctua NH-U9 TR4-SP31.32262.64523.96785.29046.613SE +/- 0.015, N = 3SE +/- 0.027, N = 3SE +/- 0.039, N = 35.8785.7975.785MIN: 5.64 / MAX: 6.84MIN: 5.54 / MAX: 7.52MIN: 5.58 / MAX: 6.641. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: resnet-v2-50Dynatron A38Dynatron A26Noctua NH-U9 TR4-SP3510152025SE +/- 0.10, N = 3SE +/- 0.09, N = 3SE +/- 0.07, N = 322.2722.4022.13MIN: 21.57 / MAX: 30.86MIN: 21.64 / MAX: 41.29MIN: 21.58 / MAX: 32.331. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: MobileNetV2_224Dynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.85251.7052.55753.414.2625SE +/- 0.016, N = 3SE +/- 0.021, N = 3SE +/- 0.017, N = 33.7803.7893.757MIN: 3.66 / MAX: 6.67MIN: 3.66 / MAX: 4.64MIN: 3.63 / MAX: 6.091. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: mobilenet-v1-1.0Dynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.5271.0541.5812.1082.635SE +/- 0.014, N = 3SE +/- 0.012, N = 3SE +/- 0.010, N = 32.3322.3422.328MIN: 2.28 / MAX: 2.65MIN: 2.29 / MAX: 2.64MIN: 2.28 / MAX: 2.551. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.3Model: inception-v3Dynatron A38Dynatron A26Noctua NH-U9 TR4-SP3714212835SE +/- 0.10, N = 3SE +/- 0.13, N = 3SE +/- 0.03, N = 328.3328.4528.17MIN: 27.14 / MAX: 44.36MIN: 27.16 / MAX: 42.86MIN: 27.06 / MAX: 43.231. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ASTC Encoder

ASTC Encoder (astcenc) is for the Adaptive Scalable Texture Compression (ASTC) format commonly used with OpenGL, OpenGL ES, and Vulkan graphics APIs. This test profile does a coding test of both compression/decompression. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: MediumDynatron A38Dynatron A26Noctua NH-U9 TR4-SP31.1122.2243.3364.4485.56SE +/- 0.0054, N = 7SE +/- 0.0032, N = 7SE +/- 0.0039, N = 74.94244.94124.91761. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ThoroughDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3246810SE +/- 0.0064, N = 6SE +/- 0.0081, N = 6SE +/- 0.0055, N = 67.98988.00627.99251. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.4Preset: ExhaustiveDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3510152025SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 320.5620.6220.441. (CXX) g++ options: -O3 -flto -pthread

simdjson

This is a benchmark of SIMDJSON, a high performance JSON parser. SIMDJSON aims to be the fastest JSON parser and is used by projects like Microsoft FishStore, Yandex ClickHouse, Shopify, and others. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: PartialTweetsDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.8191.6382.4573.2764.095SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 33.633.633.641. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: LargeRandomDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.2160.4320.6480.8641.08SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.960.960.961. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: KostyaDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.63681.27361.91042.54723.184SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 32.832.832.831. (CXX) g++ options: -O3 -pthread

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.8.2Throughput Test: DistinctUserIDDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.90231.80462.70693.60924.5115SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 33.984.004.011. (CXX) g++ options: -O3 -pthread

srsLTE

srsLTE is an open-source LTE software radio suite created by Software Radio Systems (SRS). srsLTE can be used for building your own software defined (SDR) LTE mobile network. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSamples / Second, More Is BettersrsLTE 20.10.1Test: OFDM_TestDynatron A38Dynatron A26Noctua NH-U9 TR4-SP330M60M90M120M150MSE +/- 1178039.80, N = 3SE +/- 1902921.73, N = 3SE +/- 1770436.23, N = 31178666671162333331156333331. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

OpenBenchmarking.orgeNb Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_TestDynatron A38Dynatron A26Noctua NH-U9 TR4-SP360120180240300SE +/- 0.84, N = 3SE +/- 0.18, N = 3SE +/- 0.22, N = 3255.9257.1257.41. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

OpenBenchmarking.orgUE Mb/s, More Is BettersrsLTE 20.10.1Test: PHY_DL_TestDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.52, N = 3SE +/- 0.23, N = 3SE +/- 0.46, N = 393.794.394.31. (CXX) g++ options: -std=c++11 -fno-strict-aliasing -march=native -mfpmath=sse -mavx2 -fvisibility=hidden -O3 -fno-trapping-math -fno-math-errno -rdynamic -lpthread -lmbedcrypto -lconfig++ -lsctp -lbladeRF -lm -lfftw3f

GNU Radio

GNU Radio is a free software development toolkit providing signal processing blocks to implement software-defined radios (SDR) and signal processing systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Five Back to Back FIR FiltersDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3120240360480600SE +/- 5.99, N = 9SE +/- 5.98, N = 3SE +/- 8.06, N = 4560.1567.7560.91. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Signal Source (Cosine)Dynatron A38Dynatron A26Noctua NH-U9 TR4-SP37001400210028003500SE +/- 17.69, N = 9SE +/- 26.65, N = 3SE +/- 6.28, N = 43303.63324.73322.21. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FIR FilterDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3140280420560700SE +/- 0.99, N = 9SE +/- 1.06, N = 3SE +/- 0.72, N = 4642.0641.6639.01. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: IIR FilterDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3130260390520650SE +/- 1.20, N = 9SE +/- 1.25, N = 3SE +/- 1.08, N = 4607.0604.7608.61. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: FM Deemphasis FilterDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3160320480640800SE +/- 1.58, N = 9SE +/- 1.05, N = 3SE +/- 1.86, N = 4764.1760.3765.31. 3.8.1.0

OpenBenchmarking.orgMiB/s, More Is BetterGNU RadioTest: Hilbert TransformDynatron A38Dynatron A26Noctua NH-U9 TR4-SP380160240320400SE +/- 0.76, N = 9SE +/- 1.82, N = 3SE +/- 1.21, N = 4376.2377.3378.21. 3.8.1.0

LuaRadio

LuaRadio is a lightweight software-defined radio (SDR) framework built atop LuaJIT. LuaRadio provides a suite of source, sink, and processing blocks, with a simple API for defining flow graphs, running flow graphs, creating blocks, and creating data types. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Five Back to Back FIR FiltersDynatron A38Dynatron A26Noctua NH-U9 TR4-SP32004006008001000SE +/- 2.63, N = 3SE +/- 1.05, N = 3SE +/- 3.75, N = 31097.41110.61101.6

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: FM Deemphasis FilterDynatron A38Dynatron A26Noctua NH-U9 TR4-SP370140210280350SE +/- 0.41, N = 3SE +/- 0.22, N = 3SE +/- 0.20, N = 3343.4344.2344.7

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Hilbert TransformDynatron A38Dynatron A26Noctua NH-U9 TR4-SP320406080100SE +/- 0.03, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 393.593.693.5

OpenBenchmarking.orgMiB/s, More Is BetterLuaRadio 0.9.1Test: Complex PhaseDynatron A38Dynatron A26Noctua NH-U9 TR4-SP3130260390520650SE +/- 0.49, N = 3SE +/- 0.72, N = 3SE +/- 0.62, N = 3591.0591.6591.8

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 16 - Buffer Length: 256 - Filter Length: 57Dynatron A38Dynatron A26Noctua NH-U9 TR4-SP3200M400M600M800M1000MSE +/- 2740148.01, N = 3SE +/- 5535858.86, N = 3SE +/- 6476302.96, N = 38041466678001900008033100001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 32 - Buffer Length: 256 - Filter Length: 57Dynatron A38Dynatron A26Noctua NH-U9 TR4-SP3300M600M900M1200M1500MSE +/- 2380709.51, N = 3SE +/- 3637917.60, N = 3SE +/- 3887729.99, N = 31614566667161493333316137666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 64 - Buffer Length: 256 - Filter Length: 57Dynatron A38Dynatron A26Noctua NH-U9 TR4-SP3600M1200M1800M2400M3000MSE +/- 3347304.06, N = 3SE +/- 3268196.92, N = 3SE +/- 5372460.64, N = 32793733333278286666727924000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 128 - Buffer Length: 256 - Filter Length: 57Dynatron A38Dynatron A26Noctua NH-U9 TR4-SP3600M1200M1800M2400M3000MSE +/- 266666.67, N = 3SE +/- 448454.13, N = 3SE +/- 2577035.33, N = 33028133333302806666730179333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing on the CPU with the water_GMX50 data. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2021Input: water_GMX50_bareDynatron A38Dynatron A26Noctua NH-U9 TR4-SP31.25982.51963.77945.03926.299SE +/- 0.010, N = 3SE +/- 0.018, N = 3SE +/- 0.003, N = 35.5995.5825.5771. (CXX) g++ options: -O3 -pthread

NAMD

NAMD is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. NAMD was developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 AtomsDynatron A38Dynatron A26Noctua NH-U9 TR4-SP30.0860.1720.2580.3440.43SE +/- 0.00076, N = 3SE +/- 0.00041, N = 3SE +/- 0.00051, N = 30.382150.381640.38110

CPU Temperature Monitor

OpenBenchmarking.orgCelsiusCPU Temperature MonitorPhoronix Test Suite System MonitoringDynatron A38Dynatron A26Noctua NH-U9 TR4-SP31530456075Min: 40.25 / Avg: 51.96 / Max: 70.25Min: 41 / Avg: 59.01 / Max: 79.25Min: 41.5 / Avg: 56.86 / Max: 79.5

107 Results Shown

IndigoBench:
  CPU - Supercar
  CPU - Bedroom
Chaos Group V-RAY
Blender:
  BMW27 - CPU-Only
  Classroom - CPU-Only
  Fishy Cat - CPU-Only
  Pabellon Barcelona - CPU-Only
  Barbershop - CPU-Only
Timed Linux Kernel Compilation
Timed GDB GNU Debugger Compilation
Timed Apache Compilation
Timed Mesa Compilation
Timed Node.js Compilation
Timed Erlang/OTP Compilation
AOM AV1:
  Speed 9 Realtime - Bosphorus 1080p
  Speed 9 Realtime - Bosphorus 4K
  Speed 8 Realtime - Bosphorus 1080p
  Speed 8 Realtime - Bosphorus 4K
  Speed 6 Realtime - Bosphorus 1080p
  Speed 6 Realtime - Bosphorus 4K
  Speed 6 Two-Pass - Bosphorus 1080p
  Speed 6 Two-Pass - Bosphorus 4K
  Speed 4 Two-Pass - Bosphorus 1080p
  Speed 4 Two-Pass - Bosphorus 4K
  Speed 0 Two-Pass - Bosphorus 1080p
  Speed 0 Two-Pass - Bosphorus 4K
SVT-VP9:
  Visual Quality Optimized - Bosphorus 1080p
  PSNR/SSIM Optimized - Bosphorus 1080p
  VMAF Optimized - Bosphorus 1080p
SVT-HEVC:
  1 - Bosphorus 1080p
  7 - Bosphorus 1080p
  10 - Bosphorus 1080p
SVT-AV1:
  Enc Mode 8 - 1080p
  Enc Mode 4 - 1080p
  Enc Mode 0 - 1080p
ViennaCL:
  CPU BLAS - sCOPY
  CPU BLAS - sAXPY
  CPU BLAS - dCOPY
  CPU BLAS - dAXPY
  CPU BLAS - dDOT
  CPU BLAS - dGEMV-N
  CPU BLAS - dGEMV-T
  CPU BLAS - dGEMM-NN
  CPU BLAS - dGEMM-NT
  CPU BLAS - dGEMM-TN
  CPU BLAS - dGEMM-TT
  CPU BLAS - sDOT
GNU GMP GMPbench
OpenSCAD:
  Projector Mount Swivel
  Leonardo Phone Case Slim
  Pistol
  Retro Car
  Mini-ITX Case
Stockfish
Xcompact3d Incompact3d:
  input.i3d 129 Cells Per Direction
  input.i3d 193 Cells Per Direction
  X3D-benchmarking input.i3d
oneDNN:
  Convolution Batch Shapes Auto - f32 - CPU
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  IP Shapes 1D - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU
  IP Shapes 3D - f32 - CPU
  IP Shapes 3D - u8s8f32 - CPU
  Matrix Multiply Batch Shapes Transformer - f32 - CPU
  Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Recurrent Neural Network Training - u8s8f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Mobile Neural Network:
  SqueezeNetV1.0
  resnet-v2-50
  MobileNetV2_224
  mobilenet-v1-1.0
  inception-v3
ASTC Encoder:
  Medium
  Thorough
  Exhaustive
simdjson:
  PartialTweets
  LargeRand
  Kostya
  DistinctUserID
srsLTE:
  OFDM_Test
  PHY_DL_Test
  PHY_DL_Test
GNU Radio:
  Five Back to Back FIR Filters
  Signal Source (Cosine)
  FIR Filter
  IIR Filter
  FM Deemphasis Filter
  Hilbert Transform
LuaRadio:
  Five Back to Back FIR Filters
  FM Deemphasis Filter
  Hilbert Transform
  Complex Phase
Liquid-DSP:
  16 - 256 - 57
  32 - 256 - 57
  64 - 256 - 57
  128 - 256 - 57
GROMACS
NAMD
CPU Temperature Monitor