AMD 3D V-Cache Comparison

Tests for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2204297-NE-CC929132156
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Disable Color Branding
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
Ryzen 9 5950X
April 26 2022
  14 Hours, 53 Minutes
Ryzen 7 5800X3D
April 26 2022
  14 Hours, 56 Minutes
Ryzen 7 5800X
April 27 2022
  18 Hours, 29 Minutes
Ryzen 9 5900X
April 28 2022
  14 Hours, 16 Minutes
Core i9 12900K
April 28 2022
  14 Hours, 51 Minutes
Invert Behavior (Only Show Selected Data)
  15 Hours, 29 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD 3D V-Cache ComparisonProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerOpenGLVulkanCompilerFile-SystemScreen ResolutionDisplay DriverRyzen 9 5950XRyzen 7 5800X3DRyzen 7 5800XRyzen 9 5900XCore i9 12900KAMD Ryzen 9 5950X 16-Core @ 3.40GHz (16 Cores / 32 Threads)ASUS ROG CROSSHAIR VIII HERO (WI-FI) (4006 BIOS)AMD Starship/Matisse32GB1000GB Sabrent Rocket 4.0 1TBAMD Radeon RX 6800 16GB (2475/1000MHz)AMD Navi 21 HDMI AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211 + Intel Wi-Fi 6 AX200Ubuntu 22.045.17.4-051704-generic (x86_64)GNOME Shell 42.0X Server + Wayland4.6 Mesa 22.2.0-devel (git-092ac67 2022-04-21 jammy-oibaf-ppa) (LLVM 14.0.0 DRM 3.44)1.3.211GCC 11.2.0ext43840x2160AMD Ryzen 7 5800X3D 8-Core @ 3.40GHz (8 Cores / 16 Threads)ASRock X570 Pro4 (P4.30 BIOS)16GBAMD Radeon RX 6800 XT 16GB (2575/1000MHz)ASUS VP28UIntel I211AMD Ryzen 7 5800X 8-Core @ 3.80GHz (8 Cores / 16 Threads)AMD Ryzen 9 5900X 12-Core @ 3.70GHz (12 Cores / 24 Threads)ASUS ROG CROSSHAIR VIII HERO (3904 BIOS)NVIDIA NV134 8GBNVIDIA GP104 HD AudioASUS MG28URealtek RTL8125 2.5GbE + Intel I211nouveau4.3 Mesa 22.2.0-devel (git-092ac67 2022-04-21 jammy-oibaf-ppa)Intel Core i9-12900K @ 5.20GHz (16 Cores / 24 Threads)ASUS ROG STRIX Z690-E GAMING WIFI (1003 BIOS)Intel Device 7aa732GBAMD Radeon RX 6800 XT 16GB (2575/1000MHz)Intel Device 7ad0ASUS VP28UIntel I225-V + Intel Wi-Fi 6 AX210/AX211/AX4114.6 Mesa 22.2.0-devel (git-092ac67 2022-04-21 jammy-oibaf-ppa) (LLVM 14.0.0 DRM 3.44)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-11-gBFGDP/gcc-11-11.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Ryzen 9 5950X: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016- Ryzen 7 5800X3D: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201205- Ryzen 7 5800X: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016- Ryzen 9 5900X: Scaling Governor: acpi-cpufreq schedutil (Boost: Enabled) - CPU Microcode: 0xa201016- Core i9 12900K: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x18 - Thermald 2.4.9Python Details- Python 3.10.4Security Details- Ryzen 9 5950X: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected- Ryzen 7 5800X3D: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected- Ryzen 7 5800X: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected- Ryzen 9 5900X: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling + srbds: Not affected + tsx_async_abort: Not affected- Core i9 12900K: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Not affected + tsx_async_abort: Not affected

Ryzen 9 5950XRyzen 7 5800X3DRyzen 7 5800XRyzen 9 5900XCore i9 12900KResult OverviewPhoronix Test Suite100%156%211%267%323%LeelaChessZeroXcompact3d Incompact3dOpenFOAMASKAPCaffeWebP2 Image EncodeTNNNCNNMlpack BenchmarkMobile Neural NetworkONNX RuntimeoneDNNNumpy BenchmarkECP-CANDLEOpen Porous Media Git

AMD 3D V-Cache Comparisononnx: yolov4 - CPU - Standardonnx: yolov4 - CPU - Parallelonnx: fcn-resnet101-11 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Parallelonnx: super-resolution-10 - CPU - Standardonnx: super-resolution-10 - CPU - Parallelonnx: bertsquad-12 - CPU - Standardonnx: bertsquad-12 - CPU - Parallelonnx: GPT-2 - CPU - Standardonnx: GPT-2 - CPU - Parallelonnx: ArcFace ResNet-100 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Parallelaskap: Hogbom Clean OpenMPaskap: tConvolve OpenMP - Griddingaskap: tConvolve OpenMP - Degriddingaskap: tConvolve MT - Griddingaskap: tConvolve MT - Degriddingaskap: tConvolve MPI - Degriddingaskap: tConvolve MPI - Griddinglczero: BLASlczero: Eigennumpy: caffe: AlexNet - CPU - 100caffe: AlexNet - CPU - 200caffe: GoogleNet - CPU - 100caffe: GoogleNet - CPU - 200onednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Convolution Batch Shapes Auto - u8s8f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: IP Shapes 1D - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPUonednn: IP Shapes 3D - f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - f32 - CPUonednn: Recurrent Neural Network Inference - u8s8f32 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUncnn: CPU - mobilenetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU - shufflenet-v2ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - vgg16ncnn: CPU - resnet18ncnn: CPU - alexnetncnn: CPU - resnet50ncnn: CPU - yolov4-tinyncnn: CPU - squeezenet_ssdncnn: CPU - regnety_400mtnn: CPU - DenseNettnn: CPU - MobileNet v2tnn: CPU - SqueezeNet v1.1tnn: CPU - SqueezeNet v2mnn: mobilenetV3mnn: squeezenetv1.1mnn: resnet-v2-50mnn: SqueezeNetV1.0mnn: MobileNetV2_224mnn: mobilenet-v1-1.0mnn: inception-v3ecp-candle: P1B2ecp-candle: P3B1ecp-candle: P3B2mlpack: scikit_svmmlpack: scikit_linearridgeregressionmlpack: scikit_qdamlpack: scikit_icaopenfoam: Motorbike 30Mopenfoam: Motorbike 60Mincompact3d: input.i3d 129 Cells Per Directionincompact3d: input.i3d 193 Cells Per Directionopm-git: Flow MPI Norne - 1opm-git: Flow MPI Norne - 2opm-git: Flow MPI Norne - 4opm-git: Flow MPI Norne - 8opm-git: Flow MPI Norne-4C MSW - 1opm-git: Flow MPI Norne-4C MSW - 2opm-git: Flow MPI Norne-4C MSW - 4opm-git: Flow MPI Norne-4C MSW - 8opm-git: Flow MPI Extra - 1opm-git: Flow MPI Extra - 2opm-git: Flow MPI Extra - 4opm-git: Flow MPI Extra - 8webp2: Defaultwebp2: Quality 75, Compression Effort 7webp2: Quality 95, Compression Effort 7webp2: Quality 100, Compression Effort 5webp2: Quality 100, Lossless CompressionRyzen 9 5950XRyzen 7 5800X3DRyzen 7 5800XRyzen 9 5900XCore i9 12900K4872969888616964368005547062553416681540220.0362755.723214.58784.5071346.576643.646728.09668684591.5436658733529662419371016.705118.03214.755331.069263.626091.618993.896470.8230097.796520.4760972703.332745.442750.181814.131783.851820.4012.274.293.774.163.875.231.8012.9256.5514.2811.0824.4020.4914.559.612425.594224.264213.32350.9402.3913.98820.5155.2313.2752.63925.72631.3921309.594654.35316.561.6955.0641.7798.441382.5433.9481214156.162526270.81203.89232.61357.86569.14441.07377.03575.131074.33712.62690.62808.052.165110.284234.9273.249536.76757230810772410746068225188826691919391223527.7236946.118741.6930.5741594.956453.228746.5212541160603.0930222604948086216202212.622710.38007.276231.776415.571092.483932.897171.237856.557650.6047802683.162691.672691.241382.161387.861385.797.622.131.902.112.013.011.067.3142.6010.279.6218.1314.8012.445.182609.843233.179222.33253.2211.0822.59016.3564.2131.8311.67623.19329.9921023.315553.74516.171.6338.2734.8880.291090.2127.1814976126.861033224.10187.49223.97361.91476.91403.35357.33581.46997.34672.48643.96781.983.170178.013376.9565.169864.9354312734967362841035574806832562110421107221.7321709.253367.67854.7071471.583976.304256.05867854491.5633905676618982517914618.753516.35348.342032.036006.370232.839133.291361.407288.935521.457523053.303055.843065.261859.141864.541847.7011.632.612.282.352.243.601.2210.2255.9713.0211.8320.2219.7816.765.933016.479272.037266.18763.6051.1562.80518.4424.5361.9421.81625.69234.8411158.674390.26920.291.8252.5739.16177.601270.7237.2105344141.210954281.47234.63275.40447.63601.31511.31454.34733.911154.09799.40829.491027.173.619199.645420.0325.875974.59153929911585754555919495407862568715191421238.6672936.853732.72837.9441541.677668.058258.02954886605.1734329686768949217872516.139816.73825.386891.300494.387731.997423.445481.0069417.548430.5197332873.452908.282904.331785.051742.061776.1911.423.913.463.883.444.771.6311.4450.7512.5010.0121.2120.6313.518.432563.757224.199213.77650.8521.8383.39424.4994.7582.9754.07223.69230.2411174.469667.28216.151.5851.3939.3796.161277.5532.1536293144.473750273.89203.98233.63361.84572.60441.14378.33581.351073.71713.34691.38810.332.439129.733268.6123.709614.812693629101111474745169889191108280881975362540.5414631.827793.772721.153872.034198.514472.7322012161665.342559051713678331340455.875366.002148.737501.352125.253942.223362.636111.052003.446430.8120472881.722881.102881.321613.811616.061617.1011.113.412.903.103.105.341.469.9428.249.667.5816.8415.8613.277.391792.972170.583133.97038.9151.1742.40523.0854.1512.412.89124.30120.941429.611503.26410.541.6027.3132.9384.47487.8014.553045055.6096713247.89223.20280.85515.71537.38475.95451.00825.561053.38717.04671.56891.782.062101.568215.5822.936476.451OpenBenchmarking.org

ONNX Runtime

ONNX Runtime is developed by Microsoft and partners as a open-source, cross-platform, high performance machine learning inferencing and training accelerator. This test profile runs the ONNX Runtime with various models available from the ONNX Zoo. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: yolov4 - Device: CPU - Executor: StandardCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X150300450600750SE +/- 1.42, N = 3SE +/- 17.59, N = 12SE +/- 28.87, N = 12SE +/- 42.62, N = 12SE +/- 17.40, N = 96935394315724871. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: yolov4 - Device: CPU - Executor: ParallelCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X140280420560700SE +/- 1.09, N = 3SE +/- 0.33, N = 3SE +/- 0.44, N = 3SE +/- 0.60, N = 3SE +/- 0.44, N = 36292992733082961. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: StandardCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X306090120150SE +/- 0.00, N = 3SE +/- 0.44, N = 3SE +/- 0.00, N = 3SE +/- 0.17, N = 3SE +/- 4.73, N = 1210111549107981. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: ParallelCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X20406080100SE +/- 0.44, N = 3SE +/- 0.00, N = 3SE +/- 0.29, N = 3SE +/- 0.17, N = 3SE +/- 0.33, N = 3111856772881. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: StandardCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X16003200480064008000SE +/- 10.33, N = 3SE +/- 318.33, N = 12SE +/- 10.27, N = 3SE +/- 11.77, N = 3SE +/- 21.53, N = 3474775453628410761691. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: ParallelCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X14002800420056007000SE +/- 46.82, N = 4SE +/- 5.61, N = 3SE +/- 10.11, N = 3SE +/- 15.67, N = 3SE +/- 22.98, N = 3451655914103460664361. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: StandardCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2004006008001000SE +/- 0.76, N = 3SE +/- 4.80, N = 3SE +/- 0.17, N = 3SE +/- 57.02, N = 12SE +/- 1.20, N = 39889495578228001. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: ParallelCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2004006008001000SE +/- 11.29, N = 3SE +/- 0.60, N = 3SE +/- 0.29, N = 3SE +/- 1.17, N = 3SE +/- 0.29, N = 39195404805185541. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: StandardCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2K4K6K8K10KSE +/- 20.71, N = 3SE +/- 10.85, N = 3SE +/- 104.18, N = 12SE +/- 14.95, N = 3SE +/- 58.87, N = 81108278626832882670621. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: GPT-2 - Device: CPU - Executor: ParallelCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2K4K6K8K10KSE +/- 20.34, N = 3SE +/- 11.00, N = 3SE +/- 8.85, N = 3SE +/- 5.78, N = 3SE +/- 11.18, N = 3808856875621691955341. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: StandardCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X400800120016002000SE +/- 0.67, N = 3SE +/- 45.67, N = 12SE +/- 1.33, N = 3SE +/- 3.42, N = 3SE +/- 46.80, N = 9197515191042193916681. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: ParallelCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X30060090012001500SE +/- 0.17, N = 3SE +/- 1.17, N = 3SE +/- 4.25, N = 3SE +/- 3.51, N = 3SE +/- 8.09, N = 336214211107122315401. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto -fno-fat-lto-objects -ldl -lrt

ASKAP

ASKAP is a set of benchmarks from the Australian SKA Pathfinder. The principal ASKAP benchmarks are the Hogbom Clean Benchmark (tHogbomClean) and Convolutional Resamping Benchmark (tConvolve) as well as some previous ASKAP benchmarks being included as well for OpenCL and CUDA execution of tConvolve. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMPCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X120240360480600SE +/- 0.00, N = 5SE +/- 0.52, N = 4SE +/- 0.57, N = 3SE +/- 1.80, N = 4SE +/- 1.00, N = 4540.54238.67221.73527.72220.041. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - GriddingCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X15003000450060007500SE +/- 34.51, N = 6SE +/- 10.74, N = 6SE +/- 14.53, N = 15SE +/- 78.12, N = 15SE +/- 27.21, N = 64631.822936.851709.256946.112755.721. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - DegriddingCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2K4K6K8K10KSE +/- 37.29, N = 6SE +/- 10.98, N = 6SE +/- 6.53, N = 15SE +/- 38.17, N = 15SE +/- 11.91, N = 67793.773732.723367.678741.603214.581. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - GriddingCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X6001200180024003000SE +/- 2.70, N = 3SE +/- 0.90, N = 3SE +/- 2.02, N = 3SE +/- 2.64, N = 3SE +/- 1.65, N = 32721.15837.94854.71930.57784.511. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - DegriddingCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X8001600240032004000SE +/- 2.07, N = 3SE +/- 3.45, N = 3SE +/- 5.57, N = 3SE +/- 1.83, N = 3SE +/- 0.92, N = 33872.031541.671471.581594.951346.571. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - DegriddingCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X16003200480064008000SE +/- 19.39, N = 3SE +/- 49.47, N = 3SE +/- 34.79, N = 3SE +/- 53.33, N = 3SE +/- 48.56, N = 34198.517668.053976.306453.226643.641. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - GriddingCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2K4K6K8K10KSE +/- 12.67, N = 3SE +/- 58.16, N = 3SE +/- 45.52, N = 3SE +/- 0.00, N = 34472.738258.024256.058746.526728.091. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X5001000150020002500SE +/- 22.18, N = 3SE +/- 11.02, N = 3SE +/- 6.36, N = 3SE +/- 6.08, N = 3SE +/- 2.73, N = 3220195486712546681. (CXX) g++ options: -flto -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X5001000150020002500SE +/- 19.84, N = 7SE +/- 7.53, N = 9SE +/- 8.85, N = 9SE +/- 8.89, N = 3SE +/- 7.32, N = 9216188685411606841. (CXX) g++ options: -flto -pthread

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X140280420560700SE +/- 0.68, N = 3SE +/- 8.26, N = 3SE +/- 0.43, N = 3SE +/- 1.21, N = 3SE +/- 0.86, N = 3665.34605.17491.56603.09591.54

Caffe

This is a benchmark of the Caffe deep learning framework and currently supports the AlexNet and Googlenet model and execution on both CPUs and NVIDIA GPUs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 100Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X8K16K24K32K40KSE +/- 17.32, N = 3SE +/- 38.11, N = 3SE +/- 7.06, N = 3SE +/- 24.84, N = 3SE +/- 30.55, N = 325590343293390530222366581. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: AlexNet - Acceleration: CPU - Iterations: 200Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X16K32K48K64K80KSE +/- 323.04, N = 3SE +/- 57.49, N = 3SE +/- 49.12, N = 3SE +/- 94.00, N = 3SE +/- 178.17, N = 351713686766766160494733521. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 100Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X20K40K60K80K100KSE +/- 741.46, N = 4SE +/- 260.55, N = 3SE +/- 34.64, N = 3SE +/- 158.22, N = 3SE +/- 34.42, N = 367833894928982580862966241. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

OpenBenchmarking.orgMilli-Seconds, Fewer Is BetterCaffe 2020-02-13Model: GoogleNet - Acceleration: CPU - Iterations: 200Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X40K80K120K160K200KSE +/- 116.45, N = 3SE +/- 46.44, N = 3SE +/- 39.00, N = 3SE +/- 248.50, N = 3SE +/- 215.28, N = 31340451787251791461620221937101. (CXX) g++ options: -fPIC -O3 -rdynamic -lglog -lgflags -lprotobuf -lcrypto -lcurl -lpthread -lsz -lz -ldl -lm -llmdb -lopenblas

oneDNN

This is a test of the Intel oneDNN as an Intel-optimized library for Deep Neural Networks and making use of its built-in benchdnn functionality. The result is the total perf time reported. Intel oneDNN was formerly known as DNNL (Deep Neural Network Library) and MKL-DNN before being rebranded as part of Intel oneAPI. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X510152025SE +/- 0.00220, N = 7SE +/- 0.29208, N = 15SE +/- 0.12899, N = 7SE +/- 0.01917, N = 7SE +/- 0.00896, N = 75.8753616.1398018.7535012.6227016.70510MIN: 5.78-lpthread - MIN: 15.35-lpthread - MIN: 18.34-lpthread - MIN: 12.28-lpthread - MIN: 16.311. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X48121620SE +/- 0.00249, N = 7SE +/- 0.04249, N = 7SE +/- 0.22593, N = 15SE +/- 0.04088, N = 7SE +/- 0.02368, N = 76.0021416.7382016.3534010.3800018.03210MIN: 5.9-lpthread - MIN: 16.08-lpthread - MIN: 14.78-lpthread - MIN: 9.78-lpthread - MIN: 17.61. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X246810SE +/- 0.16860, N = 12SE +/- 0.07639, N = 3SE +/- 0.11645, N = 15SE +/- 0.06264, N = 3SE +/- 0.22726, N = 158.737505.386898.342037.276234.75533MIN: 4.15-lpthread - MIN: 4-lpthread - MIN: 5.88-lpthread - MIN: 5.11-lpthread - MIN: 3.381. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.45810.91621.37431.83242.2905SE +/- 0.00773, N = 3SE +/- 0.00489, N = 3SE +/- 0.00040, N = 3SE +/- 0.00274, N = 3SE +/- 0.00299, N = 31.352121.300492.036001.776411.06926MIN: 1.28-lpthread - MIN: 1.2-lpthread - MIN: 2.01-lpthread - MIN: 1.74-lpthread - MIN: 0.961. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X246810SE +/- 0.00117, N = 9SE +/- 0.00484, N = 9SE +/- 0.00140, N = 9SE +/- 0.00483, N = 9SE +/- 0.00224, N = 95.253944.387736.370235.571093.62609MIN: 5.16-lpthread - MIN: 4.17-lpthread - MIN: 6.32-lpthread - MIN: 5.45-lpthread - MIN: 3.431. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.63881.27761.91642.55523.194SE +/- 0.00121, N = 9SE +/- 0.00298, N = 9SE +/- 0.00474, N = 9SE +/- 0.00503, N = 9SE +/- 0.00702, N = 92.223361.997422.839132.483931.61899MIN: 2.2-lpthread - MIN: 1.82-lpthread - MIN: 2.8-lpthread - MIN: 2.41-lpthread - MIN: 1.451. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 1D - Data Type: f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.87671.75342.63013.50684.3835SE +/- 0.00329, N = 4SE +/- 0.02871, N = 4SE +/- 0.00733, N = 4SE +/- 0.00793, N = 4SE +/- 0.01755, N = 42.636113.445483.291362.897173.89647MIN: 2.5-lpthread - MIN: 3.05-lpthread - MIN: 3.12-lpthread - MIN: 2.81-lpthread - MIN: 3.661. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.31660.63320.94981.26641.583SE +/- 0.008609, N = 4SE +/- 0.007658, N = 4SE +/- 0.000645, N = 4SE +/- 0.001247, N = 4SE +/- 0.006926, N = 151.0520001.0069411.4072801.2378500.823009MIN: 1.01-lpthread - MIN: 0.94-lpthread - MIN: 1.39-lpthread - MIN: 1.21-lpthread - MIN: 0.71. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 3D - Data Type: f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X246810SE +/- 0.00441, N = 5SE +/- 0.02525, N = 5SE +/- 0.01310, N = 5SE +/- 0.01065, N = 5SE +/- 0.05069, N = 53.446437.548438.935526.557657.79652MIN: 3.4-lpthread - MIN: 7.29-lpthread - MIN: 8.33-lpthread - MIN: 6.32-lpthread - MIN: 7.291. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.32790.65580.98371.31161.6395SE +/- 0.002976, N = 5SE +/- 0.003942, N = 5SE +/- 0.002722, N = 5SE +/- 0.002108, N = 5SE +/- 0.002527, N = 50.8120470.5197331.4575200.6047800.476097MIN: 0.79-lpthread - MIN: 0.47-lpthread - MIN: 1.35-lpthread - MIN: 0.58-lpthread - MIN: 0.421. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X7001400210028003500SE +/- 0.27, N = 3SE +/- 23.67, N = 3SE +/- 2.27, N = 3SE +/- 5.12, N = 3SE +/- 10.92, N = 32881.722873.453053.302683.162703.33MIN: 2874.65-lpthread - MIN: 2830.55-lpthread - MIN: 3045.76-lpthread - MIN: 2665.66-lpthread - MIN: 2665.51. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X7001400210028003500SE +/- 1.61, N = 3SE +/- 15.10, N = 3SE +/- 0.89, N = 3SE +/- 1.77, N = 3SE +/- 23.91, N = 32881.102908.283055.842691.672745.44MIN: 2869.73-lpthread - MIN: 2862.45-lpthread - MIN: 3050.91-lpthread - MIN: 2680.17-lpthread - MIN: 2684.631. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X7001400210028003500SE +/- 0.76, N = 3SE +/- 32.41, N = 3SE +/- 1.56, N = 3SE +/- 2.34, N = 3SE +/- 26.41, N = 32881.322904.333065.262691.242750.18MIN: 2872.06-lpthread - MIN: 2837.45-lpthread - MIN: 3058.73-lpthread - MIN: 2678.39-lpthread - MIN: 2685.911. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X400800120016002000SE +/- 0.28, N = 3SE +/- 8.68, N = 3SE +/- 5.60, N = 3SE +/- 2.31, N = 3SE +/- 8.84, N = 31613.811785.051859.141382.161814.13MIN: 1608.23-lpthread - MIN: 1762.05-lpthread - MIN: 1846.97-lpthread - MIN: 1372.35-lpthread - MIN: 1783.131. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X400800120016002000SE +/- 1.95, N = 3SE +/- 13.72, N = 3SE +/- 4.89, N = 3SE +/- 0.61, N = 3SE +/- 22.20, N = 31616.061742.061864.541387.861783.85MIN: 1608.68-lpthread - MIN: 1715.13-lpthread - MIN: 1851.66-lpthread - MIN: 1380.9-lpthread - MIN: 1730.181. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.6Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X400800120016002000SE +/- 2.95, N = 3SE +/- 5.06, N = 3SE +/- 9.79, N = 3SE +/- 1.81, N = 3SE +/- 19.76, N = 51617.101776.191847.701385.791820.40MIN: 1608.6-lpthread - MIN: 1756.45-lpthread - MIN: 1820.32-lpthread - MIN: 1375.42-lpthread - MIN: 1761.071. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -std=c++11 -pie -ldl

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mobilenetCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X3691215SE +/- 0.26, N = 15SE +/- 0.01, N = 3SE +/- 0.13, N = 15SE +/- 0.08, N = 15SE +/- 0.15, N = 411.1111.4211.637.6212.27MIN: 8.87 / MAX: 13.96MIN: 11.16 / MAX: 18.48MIN: 11.06 / MAX: 12.93MIN: 7.25 / MAX: 9.92MIN: 11.71 / MAX: 18.811. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v2-v2 - Model: mobilenet-v2Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.96531.93062.89593.86124.8265SE +/- 0.12, N = 15SE +/- 0.01, N = 3SE +/- 0.01, N = 15SE +/- 0.00, N = 15SE +/- 0.01, N = 43.413.912.612.134.29MIN: 2.72 / MAX: 5.86MIN: 3.82 / MAX: 4.11MIN: 2.54 / MAX: 3.77MIN: 2.03 / MAX: 2.88MIN: 4.15 / MAX: 7.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU-v3-v3 - Model: mobilenet-v3Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.84831.69662.54493.39324.2415SE +/- 0.06, N = 15SE +/- 0.01, N = 3SE +/- 0.00, N = 15SE +/- 0.00, N = 15SE +/- 0.00, N = 42.903.462.281.903.77MIN: 2.53 / MAX: 4.55MIN: 3.39 / MAX: 3.66MIN: 2.22 / MAX: 3.92MIN: 1.84 / MAX: 2.38MIN: 3.7 / MAX: 4.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: shufflenet-v2Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.9361.8722.8083.7444.68SE +/- 0.08, N = 14SE +/- 0.01, N = 3SE +/- 0.00, N = 15SE +/- 0.01, N = 15SE +/- 0.01, N = 43.103.882.352.114.16MIN: 2.68 / MAX: 4.51MIN: 3.82 / MAX: 4.06MIN: 2.31 / MAX: 3.74MIN: 2.07 / MAX: 3MIN: 4.05 / MAX: 4.91. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: mnasnetCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.87081.74162.61243.48324.354SE +/- 0.07, N = 15SE +/- 0.01, N = 3SE +/- 0.00, N = 15SE +/- 0.00, N = 15SE +/- 0.05, N = 43.103.442.242.013.87MIN: 2.66 / MAX: 4.79MIN: 3.39 / MAX: 3.72MIN: 2.2 / MAX: 3.68MIN: 1.97 / MAX: 2.76MIN: 3.76 / MAX: 10.71. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: efficientnet-b0Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X1.20152.4033.60454.8066.0075SE +/- 0.11, N = 15SE +/- 0.01, N = 3SE +/- 0.01, N = 15SE +/- 0.01, N = 15SE +/- 0.06, N = 45.344.773.603.015.23MIN: 4.35 / MAX: 9.28MIN: 4.7 / MAX: 5MIN: 3.53 / MAX: 5.22MIN: 2.93 / MAX: 13.07MIN: 5.09 / MAX: 6.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: blazefaceCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.4050.811.2151.622.025SE +/- 0.05, N = 15SE +/- 0.00, N = 3SE +/- 0.00, N = 15SE +/- 0.00, N = 15SE +/- 0.02, N = 41.461.631.221.061.80MIN: 1.15 / MAX: 2.96MIN: 1.61 / MAX: 1.81MIN: 1.19 / MAX: 2.1MIN: 1.04 / MAX: 4.31MIN: 1.75 / MAX: 2.151. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: googlenetCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X3691215SE +/- 0.21, N = 15SE +/- 0.02, N = 3SE +/- 0.02, N = 15SE +/- 0.05, N = 15SE +/- 0.28, N = 49.9411.4410.227.3112.92MIN: 7.91 / MAX: 14.3MIN: 11.28 / MAX: 11.82MIN: 9.79 / MAX: 18.21MIN: 7.05 / MAX: 14.4MIN: 12.09 / MAX: 15.321. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: vgg16Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X1326395265SE +/- 0.48, N = 15SE +/- 0.09, N = 3SE +/- 0.07, N = 15SE +/- 0.13, N = 15SE +/- 0.08, N = 428.2450.7555.9742.6056.55MIN: 25.72 / MAX: 45.6MIN: 49.97 / MAX: 60.2MIN: 54.91 / MAX: 64.62MIN: 41.52 / MAX: 50.92MIN: 55.53 / MAX: 62.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet18Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X48121620SE +/- 0.15, N = 15SE +/- 0.05, N = 3SE +/- 0.03, N = 15SE +/- 0.05, N = 15SE +/- 0.17, N = 49.6612.5013.0210.2714.28MIN: 7.55 / MAX: 14.7MIN: 12.28 / MAX: 12.82MIN: 12.77 / MAX: 32.91MIN: 9.71 / MAX: 12.09MIN: 13.94 / MAX: 16.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: alexnetCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X3691215SE +/- 0.05, N = 15SE +/- 0.01, N = 3SE +/- 0.01, N = 15SE +/- 0.03, N = 15SE +/- 0.13, N = 47.5810.0111.839.6211.08MIN: 7.18 / MAX: 9.34MIN: 9.92 / MAX: 12.23MIN: 11.63 / MAX: 18.46MIN: 8.9 / MAX: 11.14MIN: 10.7 / MAX: 12.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: resnet50Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X612182430SE +/- 0.07, N = 15SE +/- 0.04, N = 3SE +/- 0.07, N = 15SE +/- 0.09, N = 15SE +/- 0.34, N = 416.8421.2120.2218.1324.40MIN: 16.32 / MAX: 21.73MIN: 20.94 / MAX: 23.21MIN: 19.74 / MAX: 28.22MIN: 17.64 / MAX: 24.72MIN: 23.71 / MAX: 27.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: yolov4-tinyCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X510152025SE +/- 0.33, N = 15SE +/- 0.40, N = 3SE +/- 0.21, N = 15SE +/- 0.19, N = 15SE +/- 0.32, N = 415.8620.6319.7814.8020.49MIN: 14.24 / MAX: 21MIN: 19.48 / MAX: 21.6MIN: 18.64 / MAX: 21.1MIN: 14 / MAX: 16.98MIN: 19.6 / MAX: 21.741. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: squeezenet_ssdCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X48121620SE +/- 0.17, N = 15SE +/- 0.02, N = 3SE +/- 0.06, N = 15SE +/- 0.05, N = 15SE +/- 0.09, N = 413.2713.5116.7612.4414.55MIN: 12.19 / MAX: 43.4MIN: 13.16 / MAX: 20.71MIN: 16.11 / MAX: 23.01MIN: 12.03 / MAX: 14.14MIN: 13.65 / MAX: 21.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20210720Target: CPU - Model: regnety_400mCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X3691215SE +/- 0.22, N = 15SE +/- 0.01, N = 3SE +/- 0.01, N = 15SE +/- 0.01, N = 15SE +/- 0.08, N = 47.398.435.935.189.61MIN: 6.17 / MAX: 27.59MIN: 8.36 / MAX: 8.75MIN: 5.83 / MAX: 7.68MIN: 5.07 / MAX: 12.17MIN: 9.37 / MAX: 11.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

TNN

TNN is an open-source deep learning reasoning framework developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: DenseNetCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X6001200180024003000SE +/- 1.88, N = 3SE +/- 3.11, N = 3SE +/- 0.87, N = 3SE +/- 1.54, N = 3SE +/- 10.08, N = 31792.972563.763016.482609.842425.59MIN: 1751.55 / MAX: 1868.31MIN: 2481.34 / MAX: 2640.2MIN: 2943.1 / MAX: 3087.97MIN: 2559.5 / MAX: 2657.73MIN: 2339.59 / MAX: 2518.621. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: MobileNet v2Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X60120180240300SE +/- 0.31, N = 4SE +/- 0.47, N = 4SE +/- 0.18, N = 3SE +/- 0.14, N = 4SE +/- 0.71, N = 4170.58224.20272.04233.18224.26MIN: 157.87 / MAX: 209.34MIN: 218.68 / MAX: 249.25MIN: 270.94 / MAX: 276.3MIN: 232.19 / MAX: 237.22MIN: 219.36 / MAX: 242.571. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v1.1Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X60120180240300SE +/- 0.06, N = 5SE +/- 1.66, N = 4SE +/- 0.18, N = 3SE +/- 0.03, N = 4SE +/- 1.16, N = 4133.97213.78266.19222.33213.32MIN: 133.46 / MAX: 134.81MIN: 210.63 / MAX: 219.97MIN: 265.87 / MAX: 266.68MIN: 222.15 / MAX: 222.66MIN: 209.34 / MAX: 215.211. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.3Target: CPU - Model: SqueezeNet v2Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X1428425670SE +/- 0.08, N = 10SE +/- 0.22, N = 9SE +/- 0.12, N = 8SE +/- 0.13, N = 9SE +/- 0.08, N = 938.9250.8563.6153.2250.94MIN: 38.37 / MAX: 39.92MIN: 49.98 / MAX: 52.59MIN: 62.79 / MAX: 64.28MIN: 52.43 / MAX: 54.16MIN: 50.34 / MAX: 52.461. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -fvisibility=default -O3 -rdynamic -ldl

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: mobilenetV3Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.5381.0761.6142.1522.69SE +/- 0.007, N = 3SE +/- 0.018, N = 3SE +/- 0.005, N = 3SE +/- 0.006, N = 3SE +/- 0.003, N = 31.1741.8381.1561.0822.391MIN: 1.15 / MAX: 2.06MIN: 1.79 / MAX: 2.07MIN: 1.14 / MAX: 1.73MIN: 1.06 / MAX: 2.25MIN: 1.87 / MAX: 3.851. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: squeezenetv1.1Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.89731.79462.69193.58924.4865SE +/- 0.049, N = 3SE +/- 0.098, N = 3SE +/- 0.017, N = 3SE +/- 0.006, N = 3SE +/- 0.098, N = 32.4053.3942.8052.5903.988MIN: 2.33 / MAX: 3.56MIN: 3.15 / MAX: 4.21MIN: 2.76 / MAX: 10.42MIN: 2.55 / MAX: 4.48MIN: 3.72 / MAX: 4.791. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: resnet-v2-50Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X612182430SE +/- 1.14, N = 3SE +/- 0.14, N = 3SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.14, N = 323.0924.5018.4416.3620.52MIN: 21.7 / MAX: 30.17MIN: 23.86 / MAX: 52.44MIN: 18.25 / MAX: 25.86MIN: 16 / MAX: 24.17MIN: 19.84 / MAX: 24.051. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: SqueezeNetV1.0Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X1.1772.3543.5314.7085.885SE +/- 0.177, N = 3SE +/- 0.058, N = 3SE +/- 0.032, N = 3SE +/- 0.010, N = 3SE +/- 0.113, N = 34.1514.7584.5364.2135.231MIN: 3.91 / MAX: 6.32MIN: 4.59 / MAX: 12.51MIN: 4.47 / MAX: 5.71MIN: 4.16 / MAX: 5.46MIN: 4.95 / MAX: 6.511. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: MobileNetV2_224Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.73691.47382.21072.94763.6845SE +/- 0.020, N = 3SE +/- 0.045, N = 3SE +/- 0.029, N = 3SE +/- 0.011, N = 3SE +/- 0.014, N = 32.4102.9751.9421.8313.275MIN: 2.36 / MAX: 3.86MIN: 2.88 / MAX: 3.44MIN: 1.9 / MAX: 3.6MIN: 1.79 / MAX: 2.92MIN: 3.21 / MAX: 10.861. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: mobilenet-v1-1.0Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.91621.83242.74863.66484.581SE +/- 0.008, N = 3SE +/- 0.037, N = 3SE +/- 0.011, N = 3SE +/- 0.010, N = 3SE +/- 0.067, N = 32.8914.0721.8161.6762.639MIN: 2.85 / MAX: 8.54MIN: 3.95 / MAX: 4.31MIN: 1.79 / MAX: 3.08MIN: 1.63 / MAX: 2.97MIN: 2.52 / MAX: 11.271. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.2Model: inception-v3Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X612182430SE +/- 0.66, N = 3SE +/- 0.37, N = 3SE +/- 0.09, N = 3SE +/- 0.09, N = 3SE +/- 0.13, N = 324.3023.6925.6923.1925.73MIN: 22.9 / MAX: 36.09MIN: 22.99 / MAX: 31.36MIN: 25.45 / MAX: 31.44MIN: 22.93 / MAX: 31MIN: 25.09 / MAX: 33.941. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

ECP-CANDLE

The CANDLE benchmark codes implement deep learning architectures relevant to problems in cancer. These architectures address problems at different biological scales, specifically problems at the molecular, cellular and population scales. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P1B2Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X81624324020.9430.2434.8429.9931.39

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P3B1Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X30060090012001500429.611174.471158.671023.321309.59

OpenBenchmarking.orgSeconds, Fewer Is BetterECP-CANDLE 0.4Benchmark: P3B2Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X300600900120015001503.26667.28390.27553.75654.35

Mlpack Benchmark

Mlpack benchmark scripts for machine learning libraries Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_svmCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X510152025SE +/- 0.01, N = 4SE +/- 0.08, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 310.5416.1520.2916.1716.56

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_linearridgeregressionCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.40950.8191.22851.6382.0475SE +/- 0.05, N = 12SE +/- 0.00, N = 3SE +/- 0.02, N = 7SE +/- 0.00, N = 3SE +/- 0.01, N = 31.601.581.821.631.69

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_qdaCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X1224364860SE +/- 0.13, N = 3SE +/- 0.34, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.20, N = 327.3151.3952.5738.2755.06

OpenBenchmarking.orgSeconds, Fewer Is BetterMlpack BenchmarkBenchmark: scikit_icaCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X1020304050SE +/- 0.02, N = 3SE +/- 0.24, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 332.9339.3739.1634.8841.77

OpenFOAM

OpenFOAM is the leading free, open source software for computational fluid dynamics (CFD). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30MCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X4080120160200SE +/- 0.18, N = 3SE +/- 0.15, N = 3SE +/- 0.25, N = 3SE +/- 0.32, N = 3SE +/- 0.24, N = 384.4796.16177.6080.2998.44-lfoamToVTK -llagrangian -lfileFormats1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 60MCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X30060090012001500SE +/- 0.14, N = 3SE +/- 0.77, N = 3SE +/- 0.23, N = 3SE +/- 1.12, N = 3SE +/- 0.26, N = 3487.801277.551270.721090.211382.54-lfoamToVTK -llagrangian -lfileFormats1. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -ldynamicMesh -lgenericPatchFields -lOpenFOAM -ldl -lm

Xcompact3d Incompact3d

Xcompact3d Incompact3d is a Fortran-MPI based, finite difference high-performance code for solving the incompressible Navier-Stokes equation and as many as you need scalar transport equations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per DirectionCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X918273645SE +/- 0.01, N = 4SE +/- 0.32, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 314.5532.1537.2127.1833.951. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 193 Cells Per DirectionCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X306090120150SE +/- 0.58, N = 3SE +/- 0.05, N = 3SE +/- 1.17, N = 9SE +/- 0.11, N = 3SE +/- 0.16, N = 355.61144.47141.21126.86156.161. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Open Porous Media Git

This is a test of Open Porous Media, a set of open-source tools concerning simulation of flow and transport of fluids in porous media. This test profile builds OPM and its dependencies from upstream Git. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Norne - Threads: 1Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X60120180240300SE +/- 0.07, N = 3SE +/- 0.23, N = 3SE +/- 0.17, N = 3SE +/- 0.10, N = 3SE +/- 1.36, N = 3247.89273.89281.47224.10270.811. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Norne - Threads: 2Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X50100150200250SE +/- 0.42, N = 3SE +/- 0.16, N = 3SE +/- 0.53, N = 3SE +/- 0.14, N = 3SE +/- 0.42, N = 3223.20203.98234.63187.49203.891. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Norne - Threads: 4Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X60120180240300SE +/- 0.28, N = 3SE +/- 0.18, N = 3SE +/- 0.16, N = 3SE +/- 0.14, N = 3SE +/- 0.31, N = 3280.85233.63275.40223.97232.611. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Norne - Threads: 8Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X110220330440550SE +/- 0.26, N = 3SE +/- 0.23, N = 3SE +/- 0.04, N = 3SE +/- 0.25, N = 3SE +/- 0.16, N = 3515.71361.84447.63361.91357.861. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Norne-4C MSW - Threads: 1Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X130260390520650SE +/- 1.75, N = 3SE +/- 0.96, N = 3SE +/- 1.44, N = 3SE +/- 0.27, N = 3SE +/- 2.73, N = 3537.38572.60601.31476.91569.141. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Norne-4C MSW - Threads: 2Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X110220330440550SE +/- 0.98, N = 3SE +/- 0.35, N = 3SE +/- 0.29, N = 3SE +/- 0.45, N = 3SE +/- 0.66, N = 3475.95441.14511.31403.35441.071. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Norne-4C MSW - Threads: 4Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X100200300400500SE +/- 0.53, N = 3SE +/- 0.78, N = 3SE +/- 0.49, N = 3SE +/- 0.21, N = 3SE +/- 0.39, N = 3451.00378.33454.34357.33377.031. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Norne-4C MSW - Threads: 8Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2004006008001000SE +/- 0.65, N = 3SE +/- 0.16, N = 3SE +/- 0.32, N = 3SE +/- 0.28, N = 3SE +/- 0.32, N = 3825.56581.35733.91581.46575.131. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Extra - Threads: 1Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2004006008001000SE +/- 2.12, N = 3SE +/- 4.75, N = 3SE +/- 5.46, N = 3SE +/- 2.03, N = 3SE +/- 3.75, N = 31053.381073.711154.09997.341074.331. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Extra - Threads: 2Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2004006008001000SE +/- 0.67, N = 3SE +/- 1.56, N = 3SE +/- 2.38, N = 3SE +/- 0.69, N = 3SE +/- 1.57, N = 3717.04713.34799.40672.48712.621. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Extra - Threads: 4Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2004006008001000SE +/- 0.81, N = 3SE +/- 0.23, N = 3SE +/- 0.80, N = 3SE +/- 0.34, N = 3SE +/- 0.65, N = 3671.56691.38829.49643.96690.621. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

OpenBenchmarking.orgSeconds, Fewer Is BetterOpen Porous Media GitOPM Benchmark: Flow MPI Extra - Threads: 8Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2004006008001000SE +/- 0.86, N = 3SE +/- 0.19, N = 3SE +/- 0.32, N = 3SE +/- 1.20, N = 3SE +/- 0.30, N = 3891.78810.331027.17781.98808.051. (CXX) g++ options: -pipe -pthread -fopenmp -O3 -mtune=native -UNDEBUG -lm -ldl -lrt2. Core i9 12900K: Build Time Thu Apr 28 06:45:36 PM EDT 20223. Ryzen 9 5900X: Build Time Mon Apr 25 06:10:54 PM EDT 20224. Ryzen 7 5800X: Build Time Mon Apr 25 06:10:54 PM EDT 20225. Ryzen 7 5800X3D: Build Time Mon Apr 25 06:10:54 PM EDT 20226. Ryzen 9 5950X: Build Time Mon Apr 25 06:10:54 PM EDT 2022

WebP2 Image Encode

This is a test of Google's libwebp2 library with the WebP2 image encode utility and using a sample 6000x4000 pixel JPEG image as the input, similar to the WebP/libwebp test profile. WebP2 is currently experimental and under heavy development as ultimately the successor to WebP. WebP2 supports 10-bit HDR, more efficienct lossy compression, improved lossless compression, animation support, and full multi-threading support compared to WebP. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20220422Encode Settings: DefaultCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X0.81431.62862.44293.25724.0715SE +/- 0.010, N = 11SE +/- 0.008, N = 10SE +/- 0.006, N = 8SE +/- 0.007, N = 9SE +/- 0.013, N = 102.0622.4393.6193.1702.1651. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20220422Encode Settings: Quality 75, Compression Effort 7Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X4080120160200SE +/- 0.23, N = 3SE +/- 0.51, N = 3SE +/- 0.74, N = 3SE +/- 0.62, N = 3SE +/- 1.48, N = 3101.57129.73199.65178.01110.281. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20220422Encode Settings: Quality 95, Compression Effort 7Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X90180270360450SE +/- 0.51, N = 3SE +/- 0.76, N = 3SE +/- 1.68, N = 3SE +/- 0.77, N = 3SE +/- 1.17, N = 3215.58268.61420.03376.96234.931. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20220422Encode Settings: Quality 100, Compression Effort 5Core i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X1.32192.64383.96575.28766.6095SE +/- 0.003, N = 9SE +/- 0.003, N = 8SE +/- 0.004, N = 7SE +/- 0.003, N = 7SE +/- 0.006, N = 92.9363.7095.8755.1693.2491. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20220422Encode Settings: Quality 100, Lossless CompressionCore i9 12900KRyzen 9 5900XRyzen 7 5800XRyzen 7 5800X3DRyzen 9 5950X2004006008001000SE +/- 0.96, N = 3SE +/- 4.10, N = 3SE +/- 1.21, N = 3SE +/- 2.17, N = 3SE +/- 2.59, N = 3476.45614.81974.59864.94536.771. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -ldl

96 Results Shown

ONNX Runtime:
  yolov4 - CPU - Standard
  yolov4 - CPU - Parallel
  fcn-resnet101-11 - CPU - Standard
  fcn-resnet101-11 - CPU - Parallel
  super-resolution-10 - CPU - Standard
  super-resolution-10 - CPU - Parallel
  bertsquad-12 - CPU - Standard
  bertsquad-12 - CPU - Parallel
  GPT-2 - CPU - Standard
  GPT-2 - CPU - Parallel
  ArcFace ResNet-100 - CPU - Standard
  ArcFace ResNet-100 - CPU - Parallel
ASKAP:
  Hogbom Clean OpenMP
  tConvolve OpenMP - Gridding
  tConvolve OpenMP - Degridding
  tConvolve MT - Gridding
  tConvolve MT - Degridding
  tConvolve MPI - Degridding
  tConvolve MPI - Gridding
LeelaChessZero:
  BLAS
  Eigen
Numpy Benchmark
Caffe:
  AlexNet - CPU - 100
  AlexNet - CPU - 200
  GoogleNet - CPU - 100
  GoogleNet - CPU - 200
oneDNN:
  Convolution Batch Shapes Auto - f32 - CPU
  Convolution Batch Shapes Auto - u8s8f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
  Deconvolution Batch shapes_1d - u8s8f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
  Deconvolution Batch shapes_3d - u8s8f32 - CPU
  IP Shapes 1D - f32 - CPU
  IP Shapes 1D - u8s8f32 - CPU
  IP Shapes 3D - f32 - CPU
  IP Shapes 3D - u8s8f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Recurrent Neural Network Training - u8s8f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - f32 - CPU
  Recurrent Neural Network Inference - u8s8f32 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
NCNN:
  CPU - mobilenet
  CPU-v2-v2 - mobilenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU - shufflenet-v2
  CPU - mnasnet
  CPU - efficientnet-b0
  CPU - blazeface
  CPU - googlenet
  CPU - vgg16
  CPU - resnet18
  CPU - alexnet
  CPU - resnet50
  CPU - yolov4-tiny
  CPU - squeezenet_ssd
  CPU - regnety_400m
TNN:
  CPU - DenseNet
  CPU - MobileNet v2
  CPU - SqueezeNet v1.1
  CPU - SqueezeNet v2
Mobile Neural Network:
  mobilenetV3
  squeezenetv1.1
  resnet-v2-50
  SqueezeNetV1.0
  MobileNetV2_224
  mobilenet-v1-1.0
  inception-v3
ECP-CANDLE:
  P1B2
  P3B1
  P3B2
Mlpack Benchmark:
  scikit_svm
  scikit_linearridgeregression
  scikit_qda
  scikit_ica
OpenFOAM:
  Motorbike 30M
  Motorbike 60M
Xcompact3d Incompact3d:
  input.i3d 129 Cells Per Direction
  input.i3d 193 Cells Per Direction
Open Porous Media Git:
  Flow MPI Norne - 1
  Flow MPI Norne - 2
  Flow MPI Norne - 4
  Flow MPI Norne - 8
  Flow MPI Norne-4C MSW - 1
  Flow MPI Norne-4C MSW - 2
  Flow MPI Norne-4C MSW - 4
  Flow MPI Norne-4C MSW - 8
  Flow MPI Extra - 1
  Flow MPI Extra - 2
  Flow MPI Extra - 4
  Flow MPI Extra - 8
WebP2 Image Encode:
  Default
  Quality 75, Compression Effort 7
  Quality 95, Compression Effort 7
  Quality 100, Compression Effort 5
  Quality 100, Lossless Compression