Intel Optimized Power Mode Xeon Platinum Benchmarks

2 x INTEL XEON PLATINUM 8592 testing by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2312153-NE-XEONEMRPO30
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

C++ Boost Tests 3 Tests
Timed Code Compilation 3 Tests
C/C++ Compiler Tests 9 Tests
Compression Tests 2 Tests
CPU Massive 12 Tests
Creator Workloads 9 Tests
Database Test Suite 4 Tests
Encoding 6 Tests
Game Development 2 Tests
HPC - High Performance Computing 6 Tests
Java Tests 2 Tests
Common Kernel Benchmarks 2 Tests
Machine Learning 3 Tests
Multi-Core 15 Tests
Intel oneAPI 2 Tests
OpenMPI Tests 3 Tests
Programmer / Developer System Benchmarks 4 Tests
Python Tests 7 Tests
Scientific Computing 2 Tests
Server 8 Tests
Server CPU Tests 9 Tests
Single-Threaded 2 Tests
Video Encoding 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
Default
December 14 2023
  16 Hours, 25 Minutes
Optimized Power Mode
December 15 2023
  1 Day, 52 Minutes
Invert Hiding All Results Option
  20 Hours, 38 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


Intel Optimized Power Mode Xeon Platinum BenchmarksOpenBenchmarking.orgPhoronix Test Suite2 x INTEL XEON PLATINUM 8592+ @ 3.90GHz (128 Cores / 256 Threads)Quanta Cloud S6Q-MB-MPS (3B05.TEL4P1 BIOS)Intel Device 1bce1008GB3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Intel X710 for 10GBASE-TUbuntu 23.106.5.0-13-generic (x86_64)GCC 13.2.0ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelCompilerFile-SystemScreen ResolutionIntel Optimized Power Mode Xeon Platinum Benchmarks PerformanceSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x21000161- OpenJDK Runtime Environment (build 11.0.21+9-post-Ubuntu-0ubuntu123.10)- Python 3.11.6- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected

Default vs. Optimized Power Mode ComparisonPhoronix Test SuiteBaseline+13.5%+13.5%+27%+27%+40.5%+40.5%+54%+54%15%10.7%8.4%4.6%4.2%3.7%2.7%2.5%1:1054.1%CPU - 256 - ResNet-5041.2%CPU - 256 - ResNet-15239.3%CPU - 64 - ResNet-15238.5%50035.1%CPU - 64 - ResNet-5033.7%motorBike - Execution Time33.3%100031.7%GET - 5024.5%12 - Compression Speed22.6%CPU - 64 - Efficientnet_v2_l20.2%GhostRider - 1M19.4%Bosphorus 4K - Faster19.1%Bosphorus 4K - Ultra Fast18.3%CPU - 256 - Efficientnet_v2_l18.3%19 - Compression Speed17.9%10 - G.M.O.A.Q16.9%e.G.B.S - 120016.5%19, Long Mode - Compression Speed15.7%Barbershop - CPU-OnlyPreset 12 - Bosphorus 1080p14.4%e.G.B.S - 240014.4%IMDB14.2%Bosphorus 4K - Super Fast14.2%Bosphorus 4K - Very Fast14.2%Create - 100 - 10000014%50012.4%19 - D.S12.3%Preset 8 - Bosphorus 1080p12.3%Preset 8 - Bosphorus 4K12.2%libx265 - Live11.5%Bosphorus 1080p - Faster10.9%12 - D.S10.9%1B32 - 256 - 5710.5%Bosphorus 1080p - Fast10.3%VMAF Optimized - Bosphorus 1080p10.1%64 - 256 - 579.9%Preset 12 - Bosphorus 4K9.9%Preset 4 - Bosphorus 4K9.8%Preset 13 - Bosphorus 4K9.6%Preset 13 - Bosphorus 1080p9.4%R.O.R.S.I9.4%Bosphorus 4K - Ultra Fast9.4%Bosphorus 1080p - Very Fast9.2%Bosphorus 4K - Fast9.2%Bosphorus 1080p - Ultra Fast8.7%Bosphorus 1080p - Super Fast8.6%d.S.M.S - Mesh Time19, Long Mode - D.S8.2%P.S.O - Bosphorus 1080p8%Time To Compile7.5%100 - 1000 - Read Write - Average Latency7.2%100 - 1000 - Read Write7.1%V.Q.O - Bosphorus 1080p6.9%Bosphorus 1080p - Ultra Fast6.9%Bosphorus 4K - Very Fast6.8%Preset 4 - Bosphorus 1080p6.5%10005.9%Unix Makefiles5.9%TPC-H Parquet5.8%Bosphorus 4K - Super Fast5.7%Compression Rating5.4%libx265 - Platform5.2%1:1005.1%Bosphorus 1080p - Super Fast5%128 - 256 - 575%Bosphorus 1080p - Very Fast4.8%SET - 5004.8%R.5.S.I - A.M.S4.7%R.5.S.I - A.M.S4.7%Classroom - CPU-Onlylibx265 - Video On Demand4.2%RT.ldr_alb_nrm.3840x2160 - CPU-Onlylibx265 - Upload3.8%128 - 256 - 32A.G.R.R.0.F.I - CPU3.6%32 - 256 - 5123.4%256 - 256 - 573.2%defconfig3%GET - 5002.8%128 - 256 - 512A.G.R.R.0.F.I - CPU2.5%Bumper BeamNinja2.5%10 - Q0119%10 - Q0240.2%10 - Q0319.3%10 - Q0413.7%10 - Q0539.4%10 - Q0626.8%10 - Q0722.5%10 - Q0830.5%10 - Q0923.3%10 - Q1018.4%10 - Q1133.9%10 - Q1226.8%10 - Q1315.8%10 - Q1413.3%10 - Q158.7%10 - Q1613.8%10 - Q173.4%10 - Q1810.1%10 - Q1912.6%10 - Q2019.6%10 - Q2117.7%10 - Q2210.5%MemcachedPyTorchPyTorchPyTorchnginxPyTorchOpenFOAMnginxRedisZstd CompressionPyTorchXmrigVVenCuvg266PyTorchZstd CompressionApache Spark TPC-HeasyWaveZstd CompressionBlenderSVT-AV1easyWaveDuckDBuvg266uvg266Apache HadoopApache HTTP ServerZstd CompressionSVT-AV1SVT-AV1FFmpegVVenCZstd CompressionY-CruncherLiquid-DSPVVenCSVT-VP9Liquid-DSPSVT-AV1SVT-AV1SVT-AV1SVT-AV1OpenRadiossKvazaaruvg266VVenCuvg266uvg266OpenFOAMZstd CompressionSVT-VP9Timed GCC CompilationPostgreSQLPostgreSQLSVT-VP9KvazaarKvazaarSVT-AV1Apache HTTP ServerTimed LLVM CompilationDuckDBKvazaar7-Zip CompressionFFmpegMemcachedKvazaarLiquid-DSPKvazaarRedisNeural Magic DeepSparseNeural Magic DeepSparseBlenderFFmpegIntel Open Image DenoiseFFmpegLiquid-DSPOpenVINOLiquid-DSPLiquid-DSPTimed Linux Kernel CompilationRedisLiquid-DSPOpenVINOOpenRadiossTimed LLVM CompilationApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HApache Spark TPC-HDefaultOptimized Power Mode

Intel Optimized Power Mode Xeon Platinum Benchmarksxmrig: KawPow - 1Mxmrig: Monero - 1Mxmrig: Wownero - 1Mxmrig: GhostRider - 1Mxmrig: CryptoNight-Heavy - 1Mxmrig: CryptoNight-Femto UPX2 - 1Mquantlib: Multi-Threadedquantlib: Single-Threadedopenradioss: Bumper Beamopenradioss: Chrysler Neon 1Mopenradioss: Cell Phone Drop Testopenradioss: Bird Strike on Windshieldopenradioss: Rubber O-Ring Seal Installationopenradioss: INIVOL and Fluid Structure Interaction Drop Containerdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streampytorch: CPU - 64 - ResNet-50pytorch: CPU - 256 - ResNet-50pytorch: CPU - 64 - ResNet-152pytorch: CPU - 256 - ResNet-152pytorch: CPU - 64 - Efficientnet_v2_lpytorch: CPU - 256 - Efficientnet_v2_lopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUy-cruncher: 1Bopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenfoam: motorBike - Execution Timeopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeqmcpack: Li2_STO_aecompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingbuild-llvm: Ninjabuild-llvm: Unix Makefilescompress-zstd: 12 - Compression Speedcompress-zstd: 12 - Decompression Speedcompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedbuild-gcc: Time To Compilebuild-linux-kernel: defconfigbuild-linux-kernel: allmodconfigkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 1080p - Slowkvazaar: Bosphorus 1080p - Mediumkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Super Fastkvazaar: Bosphorus 4K - Ultra Fastkvazaar: Bosphorus 1080p - Very Fastkvazaar: Bosphorus 1080p - Super Fastkvazaar: Bosphorus 1080p - Ultra Fastsvt-vp9: VMAF Optimized - Bosphorus 1080psvt-vp9: PSNR/SSIM Optimized - Bosphorus 1080psvt-vp9: Visual Quality Optimized - Bosphorus 1080psvt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Ksvt-av1: Preset 4 - Bosphorus 1080psvt-av1: Preset 8 - Bosphorus 1080psvt-av1: Preset 12 - Bosphorus 1080psvt-av1: Preset 13 - Bosphorus 1080pblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Barbershop - CPU-Onlyffmpeg: libx265 - Liveffmpeg: libx265 - Uploadffmpeg: libx265 - Platformffmpeg: libx265 - Video On Demanduvg266: Bosphorus 4K - Slowuvg266: Bosphorus 4K - Mediumuvg266: Bosphorus 1080p - Slowuvg266: Bosphorus 1080p - Mediumuvg266: Bosphorus 4K - Very Fastuvg266: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Ultra Fastuvg266: Bosphorus 1080p - Very Fastuvg266: Bosphorus 1080p - Super Fastuvg266: Bosphorus 1080p - Ultra Fastvvenc: Bosphorus 4K - Fastvvenc: Bosphorus 4K - Fastervvenc: Bosphorus 1080p - Fastvvenc: Bosphorus 1080p - Fasteroidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyeasywave: e2Asean Grid + BengkuluSept2007 Source - 2400easywave: e2Asean Grid + BengkuluSept2007 Source - 1200liquid-dsp: 32 - 256 - 32liquid-dsp: 32 - 256 - 57liquid-dsp: 64 - 256 - 32liquid-dsp: 64 - 256 - 57liquid-dsp: 128 - 256 - 32liquid-dsp: 128 - 256 - 57liquid-dsp: 256 - 256 - 32liquid-dsp: 256 - 256 - 57liquid-dsp: 32 - 256 - 512liquid-dsp: 64 - 256 - 512liquid-dsp: 128 - 256 - 512liquid-dsp: 256 - 256 - 512spark-tpch: 10 - Geometric Mean Of All Queriesduckdb: IMDBduckdb: TPC-H Parquetnginx: 500nginx: 1000apache: 500apache: 1000hadoop: Create - 100 - 100000memcached: 1:10memcached: 1:100redis: GET - 50redis: SET - 50redis: GET - 500redis: SET - 500pgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Writepgbench: 100 - 1000 - Read Write - Average Latencyspark-tpch: 10 - Q01spark-tpch: 10 - Q02spark-tpch: 10 - Q03spark-tpch: 10 - Q04spark-tpch: 10 - Q05spark-tpch: 10 - Q06spark-tpch: 10 - Q07spark-tpch: 10 - Q08spark-tpch: 10 - Q09spark-tpch: 10 - Q10spark-tpch: 10 - Q11spark-tpch: 10 - Q12spark-tpch: 10 - Q13spark-tpch: 10 - Q14spark-tpch: 10 - Q15spark-tpch: 10 - Q16spark-tpch: 10 - Q17spark-tpch: 10 - Q18spark-tpch: 10 - Q19spark-tpch: 10 - Q20spark-tpch: 10 - Q21spark-tpch: 10 - Q22DefaultOptimized Power Mode70383.270469.875002.316522.570573.069677.8388447.23650.584.7885.8225.74111.2381.6997.87137.3439463.08544580.631813.94511778.594535.931411244.79415.6732834.176776.5445160.4064397.77311791.273735.6695862.138274.07221235.406351.7033182.1591350.90081899.222333.6385137.3540462.993243.9144.5817.6717.852.622.59748.1042.72539.69236.708930.0014.3224795.175.1995.152390.2553.511106.3228.8548795.972.4510205.6112.513330.1938.41121577.550.404.1328830.0422323.46982294.38468951663731196.518177.431333.61422.719.11173.59.901183.8706.25923.756149.79241.4541.99127.25131.1469.3671.0577.53268.49263.88282.05542.98542.17463.277.29769.466218.452218.76320.952142.265502.707628.75512.8934.94149.49131.8127.0053.9853.6327.2029.7980.3088.4656.0657.8759.59160.63158.23166.476.92110.83619.91532.0914.5389.98436.546121176666714291000002476033333281016666736118666674329866667618560000058355000005082450001012336667147390000021445666678.35919683121.967148.55273684.71243384.1990823.0680826.7154903360511.123441439.714744838.93060186.423240001.582503177.678930891.1246396215.6348.841855697.6981140812.590769778.1769722312.944283492.5661711711.4746182812.0530392317.5612882011.879122105.997661757.933933265.644020565.778444924.245615804.9724410412.6007382112.894165685.094584628.5617198926.481879554.4956595169196.770298.875701.713832.470338.669847.9391468.43641.482.7386.3025.37111.7389.3798.49138.0114461.14694590.337513.90951768.910836.128610738.13525.9385834.270676.5042162.5895392.45151776.525235.9561863.347473.98961237.025751.6176183.7104346.74111883.657333.9032138.1497460.685132.8331.5812.7612.812.182.19750.4342.59538.20237.368943.8814.3024922.404.6975.132392.3853.451100.7628.9949494.062.4210168.0612.553297.8138.79117369.550.415.5089727.7087323.34883392.67265445963813698.905187.837272.21282.716.21044.68.561094.2759.21624.476151.27641.1341.69125.18130.6464.9767.2170.87256.19251.33263.96493.26502.05433.536.64561.914198.742199.54919.669126.697439.477574.59912.6733.40129.94118.2426.0251.3151.4526.7129.6278.9386.7849.1050.6850.36147.08145.75153.176.3409.09618.05328.9264.72102.92842.55912045000001293333333246086666725565733333745233333412500000063014833335656600000491426667999663333151346666721410000009.77304127139.326157.101202507.42184861.9980817.7376315.6548142180648.213274425.323811819.503050498.003152359.752389636.678784681.1415969516.75210.5253196410.7942171115.018873089.2947692918.041547783.2531764214.0577049315.7232208221.6599576114.062806408.0304369910.057694166.534819476.548335144.614900045.6564852813.0284314914.199875835.7373820010.2426731931.161191124.96957016OpenBenchmarking.org

Xmrig

Xmrig is an open-source cross-platform CPU/GPU miner for RandomX, KawPow, CryptoNight and AstroBWT. This test profile is setup to measure the Xmrig CPU mining performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: KawPow - Hash Count: 1MOptimized Power ModeDefault15K30K45K60K75KSE +/- 943.50, N = 3SE +/- 82.14, N = 469196.770383.21. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Monero - Hash Count: 1MOptimized Power ModeDefault15K30K45K60K75KSE +/- 41.44, N = 3SE +/- 27.69, N = 470298.870469.81. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Wownero - Hash Count: 1MOptimized Power ModeDefault16K32K48K64K80KSE +/- 24.14, N = 4SE +/- 677.05, N = 475701.775002.31. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: GhostRider - Hash Count: 1MOptimized Power ModeDefault4K8K12K16K20KSE +/- 22.03, N = 3SE +/- 67.39, N = 313832.416522.51. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Heavy - Hash Count: 1MOptimized Power ModeDefault15K30K45K60K75KSE +/- 96.69, N = 3SE +/- 49.63, N = 470338.670573.01. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: CryptoNight-Femto UPX2 - Hash Count: 1MOptimized Power ModeDefault15K30K45K60K75KSE +/- 443.65, N = 3SE +/- 844.15, N = 469847.969677.81. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

QuantLib

QuantLib is an open-source library/framework around quantitative finance for modeling, trading and risk management scenarios. QuantLib is written in C++ with Boost and its built-in benchmark used reports the QuantLib Benchmark Index benchmark score. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Multi-ThreadedOptimized Power ModeDefault80K160K240K320K400KSE +/- 52.43, N = 3SE +/- 127.57, N = 3391468.4388447.21. (CXX) g++ options: -O3 -march=native -fPIE -pie

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.32Configuration: Single-ThreadedOptimized Power ModeDefault8001600240032004000SE +/- 2.92, N = 3SE +/- 1.44, N = 33641.43650.51. (CXX) g++ options: -O3 -march=native -fPIE -pie

OpenRadioss

OpenRadioss is an open-source AGPL-licensed finite element solver for dynamic event analysis OpenRadioss is based on Altair Radioss and open-sourced in 2022. This open-source finite element solver is benchmarked with various example models available from https://www.openradioss.org/models/ and https://github.com/OpenRadioss/ModelExchange/tree/main/Examples. This test is currently using a reference OpenRadioss binary build offered via GitHub. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bumper BeamOptimized Power ModeDefault20406080100SE +/- 0.13, N = 3SE +/- 0.18, N = 382.7384.78

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Chrysler Neon 1MOptimized Power ModeDefault20406080100SE +/- 0.17, N = 3SE +/- 0.14, N = 386.3085.82

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Cell Phone Drop TestOptimized Power ModeDefault612182430SE +/- 0.12, N = 3SE +/- 0.14, N = 325.3725.74

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Bird Strike on WindshieldOptimized Power ModeDefault306090120150SE +/- 0.28, N = 3SE +/- 0.65, N = 3111.73111.23

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: Rubber O-Ring Seal InstallationOptimized Power ModeDefault20406080100SE +/- 0.13, N = 3SE +/- 0.03, N = 389.3781.69

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2023.09.15Model: INIVOL and Fluid Structure Interaction Drop ContainerOptimized Power ModeDefault20406080100SE +/- 0.08, N = 3SE +/- 0.07, N = 398.4997.87

Neural Magic DeepSparse

This is a benchmark of Neural Magic's DeepSparse using its built-in deepsparse.benchmark utility and various models from their SparseZoo (https://sparsezoo.neuralmagic.com/). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault306090120150SE +/- 0.12, N = 3SE +/- 0.21, N = 3138.01137.34

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault100200300400500SE +/- 0.42, N = 3SE +/- 0.27, N = 3461.15463.09

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault10002000300040005000SE +/- 2.64, N = 3SE +/- 6.99, N = 34590.344580.63

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault48121620SE +/- 0.01, N = 3SE +/- 0.02, N = 313.9113.95

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault400800120016002000SE +/- 6.86, N = 3SE +/- 3.42, N = 31768.911778.59

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault816243240SE +/- 0.14, N = 3SE +/- 0.08, N = 336.1335.93

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault2K4K6K8K10KSE +/- 12.41, N = 3SE +/- 7.42, N = 310738.1411244.79

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault1.33622.67244.00865.34486.681SE +/- 0.0065, N = 3SE +/- 0.0033, N = 35.93855.6732

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault2004006008001000SE +/- 0.60, N = 3SE +/- 0.97, N = 3834.27834.18

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault20406080100SE +/- 0.04, N = 3SE +/- 0.04, N = 376.5076.54

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault4080120160200SE +/- 0.33, N = 3SE +/- 1.36, N = 3162.59160.41

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault90180270360450SE +/- 0.71, N = 3SE +/- 3.53, N = 3392.45397.77

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault400800120016002000SE +/- 3.31, N = 3SE +/- 7.80, N = 31776.531791.27

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault816243240SE +/- 0.07, N = 3SE +/- 0.16, N = 335.9635.67

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault2004006008001000SE +/- 0.10, N = 3SE +/- 0.30, N = 3863.35862.14

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault1632486480SE +/- 0.03, N = 3SE +/- 0.05, N = 373.9974.07

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault30060090012001500SE +/- 2.51, N = 3SE +/- 1.85, N = 31237.031235.41

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault1224364860SE +/- 0.07, N = 3SE +/- 0.05, N = 351.6251.70

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault4080120160200SE +/- 0.02, N = 3SE +/- 0.43, N = 3183.71182.16

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault80160240320400SE +/- 0.05, N = 3SE +/- 0.69, N = 3346.74350.90

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault400800120016002000SE +/- 0.88, N = 3SE +/- 1.67, N = 31883.661899.22

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault816243240SE +/- 0.01, N = 3SE +/- 0.03, N = 333.9033.64

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault306090120150SE +/- 0.30, N = 3SE +/- 0.09, N = 3138.15137.35

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamOptimized Power ModeDefault100200300400500SE +/- 0.70, N = 3SE +/- 0.03, N = 3460.69462.99

PyTorch

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: ResNet-50Optimized Power ModeDefault1020304050SE +/- 0.39, N = 3SE +/- 0.55, N = 432.8343.91MIN: 15.85 / MAX: 36.48MIN: 17.29 / MAX: 46.23

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: ResNet-50Optimized Power ModeDefault1020304050SE +/- 0.02, N = 3SE +/- 0.51, N = 431.5844.58MIN: 14.91 / MAX: 34.67MIN: 19.44 / MAX: 46.72

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: ResNet-152Optimized Power ModeDefault48121620SE +/- 0.12, N = 12SE +/- 0.23, N = 312.7617.67MIN: 5.75 / MAX: 15.71MIN: 10.89 / MAX: 18.38

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: ResNet-152Optimized Power ModeDefault48121620SE +/- 0.17, N = 3SE +/- 0.13, N = 1112.8117.85MIN: 6.6 / MAX: 13.9MIN: 7.02 / MAX: 19.19

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 64 - Model: Efficientnet_v2_lOptimized Power ModeDefault0.58951.1791.76852.3582.9475SE +/- 0.02, N = 3SE +/- 0.02, N = 32.182.62MIN: 0.98 / MAX: 3.41MIN: 1.32 / MAX: 3.9

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 256 - Model: Efficientnet_v2_lOptimized Power ModeDefault0.58281.16561.74842.33122.914SE +/- 0.02, N = 3SE +/- 0.00, N = 32.192.59MIN: 0.98 / MAX: 3.3MIN: 1.19 / MAX: 4

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUOptimized Power ModeDefault160320480640800SE +/- 0.90, N = 3SE +/- 0.71, N = 3750.43748.101. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Detection FP16 - Device: CPUOptimized Power ModeDefault1020304050SE +/- 0.05, N = 3SE +/- 0.04, N = 342.5942.72MIN: 32.12 / MAX: 128.96MIN: 31.92 / MAX: 84.11. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault120240360480600SE +/- 1.35, N = 3SE +/- 0.65, N = 3538.20539.691. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault50100150200250SE +/- 0.55, N = 3SE +/- 0.35, N = 3237.36236.70MIN: 160.6 / MAX: 286.17MIN: 157.14 / MAX: 268.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault2K4K6K8K10KSE +/- 4.32, N = 3SE +/- 5.30, N = 38943.888930.001. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 314.3014.32MIN: 12.18 / MAX: 47.3MIN: 12.39 / MAX: 39.161. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUOptimized Power ModeDefault5K10K15K20K25KSE +/- 6.86, N = 3SE +/- 42.09, N = 324922.4024795.171. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

Y-Cruncher

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.2Pi Digits To Calculate: 1BOptimized Power ModeDefault1.16982.33963.50944.67925.849SE +/- 0.009, N = 5SE +/- 0.018, N = 54.6975.199

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Face Detection Retail FP16-INT8 - Device: CPUOptimized Power ModeDefault1.15882.31763.47644.63525.794SE +/- 0.00, N = 3SE +/- 0.01, N = 35.135.15MIN: 4.64 / MAX: 29.56MIN: 4.63 / MAX: 27.91. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUOptimized Power ModeDefault5001000150020002500SE +/- 7.16, N = 3SE +/- 3.42, N = 32392.382390.251. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Road Segmentation ADAS FP16-INT8 - Device: CPUOptimized Power ModeDefault1224364860SE +/- 0.16, N = 3SE +/- 0.08, N = 353.4553.51MIN: 45.01 / MAX: 111.52MIN: 43.94 / MAX: 101.681. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUOptimized Power ModeDefault2004006008001000SE +/- 3.05, N = 3SE +/- 9.12, N = 31100.761106.321. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Machine Translation EN To DE FP16 - Device: CPUOptimized Power ModeDefault714212835SE +/- 0.08, N = 3SE +/- 0.23, N = 328.9928.85MIN: 22.06 / MAX: 222.83MIN: 21.11 / MAX: 222.121. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault11K22K33K44K55KSE +/- 229.19, N = 3SE +/- 490.25, N = 349494.0648795.971. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUOptimized Power ModeDefault0.55131.10261.65392.20522.7565SE +/- 0.00, N = 3SE +/- 0.01, N = 32.422.45MIN: 1.96 / MAX: 28.3MIN: 2.03 / MAX: 23.831. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUOptimized Power ModeDefault2K4K6K8K10KSE +/- 6.61, N = 3SE +/- 7.22, N = 310168.0610205.611. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUOptimized Power ModeDefault3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 312.5512.51MIN: 10.77 / MAX: 42.91MIN: 10.84 / MAX: 42.361. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUOptimized Power ModeDefault7001400210028003500SE +/- 6.73, N = 3SE +/- 7.06, N = 33297.813330.191. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Handwritten English Recognition FP16-INT8 - Device: CPUOptimized Power ModeDefault918273645SE +/- 0.08, N = 3SE +/- 0.08, N = 338.7938.41MIN: 36.26 / MAX: 60.12MIN: 35.95 / MAX: 58.711. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUOptimized Power ModeDefault30K60K90K120K150KSE +/- 1361.46, N = 3SE +/- 775.18, N = 3117369.55121577.551. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2023.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUOptimized Power ModeDefault0.09230.18460.27690.36920.4615SE +/- 0.01, N = 3SE +/- 0.00, N = 30.410.40MIN: 0.19 / MAX: 15.84MIN: 0.18 / MAX: 14.471. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: motorBike - Execution TimeOptimized Power ModeDefault1.23952.4793.71854.9586.19755.508974.132881. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh TimeOptimized Power ModeDefault71421283527.7130.041. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution TimeOptimized Power ModeDefault61218243023.3523.471. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

QMCPACK

QMCPACK is a modern high-performance open-source Quantum Monte Carlo (QMC) simulation code making use of MPI for this benchmark of the H20 example code. QMCPACK is an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids. QMCPACK is supported by the U.S. Department of Energy. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.17.1Input: Li2_STO_aeOptimized Power ModeDefault20406080100SE +/- 1.02, N = 5SE +/- 0.40, N = 392.6794.381. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl

7-Zip Compression

This is a test of 7-Zip compression/decompression with its integrated benchmark feature. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatingOptimized Power ModeDefault150K300K450K600K750KSE +/- 1440.87, N = 3SE +/- 869.44, N = 36544596895161. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatingOptimized Power ModeDefault140K280K420K560K700KSE +/- 9574.07, N = 3SE +/- 4040.78, N = 36381366373111. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaOptimized Power ModeDefault20406080100SE +/- 0.20, N = 3SE +/- 0.29, N = 398.9196.52

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Unix MakefilesOptimized Power ModeDefault4080120160200SE +/- 0.58, N = 3SE +/- 0.30, N = 3187.84177.43

Zstd Compression

This test measures the time needed to compress/decompress a sample file (silesia.tar) using Zstd (Zstandard) compression with options for different compression levels / settings. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Compression SpeedOptimized Power ModeDefault70140210280350SE +/- 2.98, N = 5SE +/- 3.46, N = 3272.2333.61. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 12 - Decompression SpeedOptimized Power ModeDefault30060090012001500SE +/- 1.74, N = 5SE +/- 1.98, N = 31282.71422.71. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression SpeedOptimized Power ModeDefault510152025SE +/- 0.19, N = 3SE +/- 0.06, N = 316.219.11. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression SpeedOptimized Power ModeDefault30060090012001500SE +/- 2.93, N = 3SE +/- 1.19, N = 31044.61173.51. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression SpeedOptimized Power ModeDefault3691215SE +/- 0.01, N = 3SE +/- 0.02, N = 38.569.901. (CC) gcc options: -O3 -pthread -lz -llzma

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression SpeedOptimized Power ModeDefault30060090012001500SE +/- 2.17, N = 3SE +/- 16.60, N = 31094.21183.81. (CC) gcc options: -O3 -pthread -lz -llzma

Timed GCC Compilation

This test times how long it takes to build the GNU Compiler Collection (GCC) open-source compiler. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GCC Compilation 13.2Time To CompileOptimized Power ModeDefault160320480640800SE +/- 4.73, N = 3SE +/- 2.37, N = 3759.22706.26

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigOptimized Power ModeDefault612182430SE +/- 0.18, N = 11SE +/- 0.21, N = 824.4823.76

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfigOptimized Power ModeDefault306090120150SE +/- 0.48, N = 3SE +/- 0.51, N = 3151.28149.79

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: SlowOptimized Power ModeDefault918273645SE +/- 0.06, N = 4SE +/- 0.06, N = 441.1341.451. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumOptimized Power ModeDefault1020304050SE +/- 0.05, N = 4SE +/- 0.06, N = 441.6941.991. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: SlowOptimized Power ModeDefault306090120150SE +/- 0.35, N = 8SE +/- 0.38, N = 8125.18127.251. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: MediumOptimized Power ModeDefault306090120150SE +/- 0.58, N = 8SE +/- 0.46, N = 8130.64131.141. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastOptimized Power ModeDefault1530456075SE +/- 0.44, N = 5SE +/- 0.26, N = 564.9769.361. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super FastOptimized Power ModeDefault1632486480SE +/- 0.12, N = 5SE +/- 0.25, N = 567.2171.051. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra FastOptimized Power ModeDefault20406080100SE +/- 0.70, N = 6SE +/- 0.23, N = 670.8777.531. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: Very FastOptimized Power ModeDefault60120180240300SE +/- 1.04, N = 10SE +/- 1.16, N = 10256.19268.491. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: Super FastOptimized Power ModeDefault60120180240300SE +/- 1.69, N = 10SE +/- 1.09, N = 10251.33263.881. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 1080p - Video Preset: Ultra FastOptimized Power ModeDefault60120180240300SE +/- 1.93, N = 10SE +/- 2.07, N = 11263.96282.051. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

SVT-VP9

This is a test of the Intel Open Visual Cloud Scalable Video Technology SVT-VP9 CPU-based multi-threaded video encoder for the VP9 video format with a sample YUV input video file. Learn more via the OpenBenchmarking.org test page.

Tuning: VMAF Optimized - Input: Bosphorus 4K

Default: The test quit with a non-zero exit status.

Optimized Power Mode: The test quit with a non-zero exit status.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: VMAF Optimized - Input: Bosphorus 1080pOptimized Power ModeDefault120240360480600SE +/- 9.88, N = 15SE +/- 2.30, N = 9493.26542.981. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K

Default: The test quit with a non-zero exit status.

Optimized Power Mode: The test quit with a non-zero exit status.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: PSNR/SSIM Optimized - Input: Bosphorus 1080pOptimized Power ModeDefault120240360480600SE +/- 3.72, N = 8SE +/- 4.34, N = 9502.05542.171. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

Tuning: Visual Quality Optimized - Input: Bosphorus 4K

Default: The test quit with a non-zero exit status.

Optimized Power Mode: The test quit with a non-zero exit status.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-VP9 0.3Tuning: Visual Quality Optimized - Input: Bosphorus 1080pOptimized Power ModeDefault100200300400500SE +/- 2.86, N = 8SE +/- 1.93, N = 8433.53463.271. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 4 - Input: Bosphorus 4KOptimized Power ModeDefault246810SE +/- 0.006, N = 3SE +/- 0.098, N = 36.6457.2971. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 8 - Input: Bosphorus 4KOptimized Power ModeDefault1530456075SE +/- 0.56, N = 4SE +/- 0.39, N = 461.9169.471. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 12 - Input: Bosphorus 4KOptimized Power ModeDefault50100150200250SE +/- 2.06, N = 5SE +/- 0.90, N = 6198.74218.451. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 13 - Input: Bosphorus 4KOptimized Power ModeDefault50100150200250SE +/- 1.56, N = 9SE +/- 1.12, N = 5199.55218.761. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 4 - Input: Bosphorus 1080pOptimized Power ModeDefault510152025SE +/- 0.11, N = 5SE +/- 0.16, N = 519.6720.951. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 8 - Input: Bosphorus 1080pOptimized Power ModeDefault306090120150SE +/- 0.77, N = 6SE +/- 0.36, N = 7126.70142.271. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 12 - Input: Bosphorus 1080pOptimized Power ModeDefault110220330440550SE +/- 4.01, N = 15SE +/- 3.53, N = 9439.48502.711. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.8Encoder Mode: Preset 13 - Input: Bosphorus 1080pOptimized Power ModeDefault140280420560700SE +/- 3.16, N = 10SE +/- 4.69, N = 11574.60628.761. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Blender

Blender is an open-source 3D creation and modeling software project. This test is of Blender's Cycles performance with various sample files. GPU computing via NVIDIA OptiX and NVIDIA CUDA is currently supported as well as HIP for AMD Radeon GPUs and Intel oneAPI for Intel Graphics. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: BMW27 - Compute: CPU-OnlyOptimized Power ModeDefault3691215SE +/- 0.04, N = 4SE +/- 0.03, N = 412.6712.89

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: Classroom - Compute: CPU-OnlyOptimized Power ModeDefault816243240SE +/- 0.07, N = 3SE +/- 0.33, N = 333.4034.94

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.0Blend File: Barbershop - Compute: CPU-OnlyOptimized Power ModeDefault306090120150SE +/- 0.11, N = 3SE +/- 6.68, N = 9129.94149.49

FFmpeg

This is a benchmark of the FFmpeg multimedia framework. The FFmpeg test profile is making use of a modified version of vbench from Columbia University's Architecture and Design Lab (ARCADE) [http://arcade.cs.columbia.edu/vbench/] that is a benchmark for video-as-a-service workloads. The test profile offers the options of a range of vbench scenarios based on freely distributable video content and offers the options of using the x264 or x265 video encoders for transcoding. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: LiveOptimized Power ModeDefault306090120150SE +/- 0.79, N = 3SE +/- 1.03, N = 3118.24131.811. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: UploadOptimized Power ModeDefault612182430SE +/- 0.06, N = 3SE +/- 0.06, N = 326.0227.001. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: PlatformOptimized Power ModeDefault1224364860SE +/- 0.14, N = 3SE +/- 0.11, N = 351.3153.981. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

OpenBenchmarking.orgFPS, More Is BetterFFmpeg 6.1Encoder: libx265 - Scenario: Video On DemandOptimized Power ModeDefault1224364860SE +/- 0.10, N = 3SE +/- 0.04, N = 351.4553.631. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma

uvg266

uvg266 is an open-source VVC/H.266 (Versatile Video Coding) encoder based on Kvazaar as part of the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: SlowOptimized Power ModeDefault612182430SE +/- 0.08, N = 3SE +/- 0.08, N = 326.7127.20

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: MediumOptimized Power ModeDefault714212835SE +/- 0.19, N = 3SE +/- 0.08, N = 329.6229.79

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: SlowOptimized Power ModeDefault20406080100SE +/- 0.19, N = 6SE +/- 0.10, N = 678.9380.30

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: MediumOptimized Power ModeDefault20406080100SE +/- 0.25, N = 6SE +/- 0.08, N = 686.7888.46

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Very FastOptimized Power ModeDefault1326395265SE +/- 0.45, N = 15SE +/- 0.07, N = 549.1056.06

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Super FastOptimized Power ModeDefault1326395265SE +/- 0.48, N = 4SE +/- 0.15, N = 550.6857.87

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Ultra FastOptimized Power ModeDefault1326395265SE +/- 0.21, N = 4SE +/- 0.37, N = 550.3659.59

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Very FastOptimized Power ModeDefault4080120160200SE +/- 1.07, N = 11SE +/- 0.74, N = 8147.08160.63

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Super FastOptimized Power ModeDefault306090120150SE +/- 1.05, N = 15SE +/- 1.01, N = 15145.75158.23

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Ultra FastOptimized Power ModeDefault4080120160200SE +/- 1.12, N = 15SE +/- 1.12, N = 9153.17166.47

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: FastOptimized Power ModeDefault246810SE +/- 0.066, N = 3SE +/- 0.067, N = 36.3406.9211. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: FasterOptimized Power ModeDefault3691215SE +/- 0.072, N = 3SE +/- 0.081, N = 39.09610.8361. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: FastOptimized Power ModeDefault510152025SE +/- 0.11, N = 3SE +/- 0.15, N = 318.0519.921. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: FasterOptimized Power ModeDefault714212835SE +/- 0.30, N = 4SE +/- 0.36, N = 328.9332.091. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Intel Open Image Denoise

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.1Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-OnlyOptimized Power ModeDefault1.0622.1243.1864.2485.31SE +/- 0.01, N = 6SE +/- 0.01, N = 64.724.53

easyWave

The easyWave software allows simulating tsunami generation and propagation in the context of early warning systems. EasyWave supports making use of OpenMP for CPU multi-threading and there are also GPU ports available but not currently incorporated as part of this test profile. The easyWave tsunami generation software is run with one of the example/reference input files for measuring the CPU execution time. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400Optimized Power ModeDefault20406080100SE +/- 1.07, N = 3SE +/- 1.41, N = 15102.9389.981. (CXX) g++ options: -O3 -fopenmp

OpenBenchmarking.orgSeconds, Fewer Is BettereasyWave r34Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200Optimized Power ModeDefault1020304050SE +/- 0.41, N = 15SE +/- 0.28, N = 342.5636.551. (CXX) g++ options: -O3 -fopenmp

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 32Optimized Power ModeDefault300M600M900M1200M1500MSE +/- 1193035.34, N = 3SE +/- 1273228.62, N = 3120450000012117666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 57Optimized Power ModeDefault300M600M900M1200M1500MSE +/- 16392511.84, N = 3SE +/- 9950041.88, N = 3129333333314291000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32Optimized Power ModeDefault500M1000M1500M2000M2500MSE +/- 13974301.81, N = 3SE +/- 6835284.27, N = 3246086666724760333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57Optimized Power ModeDefault600M1200M1800M2400M3000MSE +/- 20734086.19, N = 15SE +/- 24169149.30, N = 3255657333328101666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 32Optimized Power ModeDefault800M1600M2400M3200M4000MSE +/- 3883440.63, N = 3SE +/- 1386041.53, N = 3374523333336118666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 57Optimized Power ModeDefault900M1800M2700M3600M4500MSE +/- 34188302.09, N = 3SE +/- 11478143.48, N = 3412500000043298666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 32Optimized Power ModeDefault1300M2600M3900M5200M6500MSE +/- 61606333.10, N = 6SE +/- 7150058.27, N = 3630148333361856000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 57Optimized Power ModeDefault1200M2400M3600M4800M6000MSE +/- 19835069.95, N = 3SE +/- 11767044.38, N = 3565660000058355000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 512Optimized Power ModeDefault110M220M330M440M550MSE +/- 3588947.54, N = 3SE +/- 5123258.57, N = 64914266675082450001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512Optimized Power ModeDefault200M400M600M800M1000MSE +/- 5205466.14, N = 3SE +/- 10474795.68, N = 399966333310123366671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512Optimized Power ModeDefault300M600M900M1200M1500MSE +/- 13505101.92, N = 3SE +/- 4762352.36, N = 3151346666714739000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 256 - Buffer Length: 256 - Filter Length: 512Optimized Power ModeDefault500M1000M1500M2000M2500MSE +/- 1101514.11, N = 3SE +/- 2968913.01, N = 3214100000021445666671. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Apache Spark TPC-H

This is a benchmark of Apache Spark using TPC-H data-set. Apache Spark is an open-source unified analytics engine for large-scale data processing and dealing with big data. This test profile benchmarks the Apache Spark in a single-system configuration using spark-submit. The test makes use of https://github.com/ssavvides/tpch-spark/ for facilitating the TPC-H benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterApache Spark TPC-H 3.5Scale Factor: 10 - Geometric Mean Of All QueriesOptimized Power ModeDefault3691215SE +/- 0.09145601, N = 7SE +/- 0.07178690, N = 39.773041278.35919683MIN: 4.46 / MAX: 32.81MIN: 4.15 / MAX: 27.27

DuckDB

DuckDB is an in-progress SQL OLAP database management system optimized for analytics and features a vectorized and parallel engine. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: IMDBOptimized Power ModeDefault306090120150SE +/- 0.43, N = 3SE +/- 0.49, N = 3139.33121.971. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

OpenBenchmarking.orgSeconds, Fewer Is BetterDuckDB 0.9.1Benchmark: TPC-H ParquetOptimized Power ModeDefault306090120150SE +/- 0.38, N = 3SE +/- 0.31, N = 3157.10148.551. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500Optimized Power ModeDefault60K120K180K240K300KSE +/- 10.52, N = 3SE +/- 25.30, N = 3202507.42273684.711. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000Optimized Power ModeDefault50K100K150K200K250KSE +/- 72.63, N = 2SE +/- 397.04, N = 3184861.99243384.191. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache HTTP Server

This is a test of the Apache HTTPD web server. This Apache HTTPD web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 500Optimized Power ModeDefault20K40K60K80K100KSE +/- 184.86, N = 3SE +/- 265.12, N = 380817.7390823.061. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is BetterApache HTTP Server 2.4.56Concurrent Requests: 1000Optimized Power ModeDefault20K40K60K80K100K76315.6580826.711. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Apache Hadoop

This is a benchmark of the Apache Hadoop making use of its built-in name-node throughput benchmark (NNThroughputBenchmark). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps per sec, More Is BetterApache Hadoop 3.3.6Operation: Create - Threads: 100 - Files: 100000Optimized Power ModeDefault12002400360048006000SE +/- 62.35, N = 3SE +/- 32.35, N = 348145490

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10Optimized Power ModeDefault700K1400K2100K2800K3500KSE +/- 43551.88, N = 15SE +/- 29533.59, N = 152180648.213360511.121. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100Optimized Power ModeDefault700K1400K2100K2800K3500KSE +/- 42725.42, N = 3SE +/- 25091.90, N = 33274425.323441439.711. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Redis

Redis is an open-source in-memory data structure store, used as a database, cache, and message broker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 50Optimized Power ModeDefault1000K2000K3000K4000K5000KSE +/- 1281.58, N = 3SE +/- 8167.04, N = 43811819.504744838.901. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 50Optimized Power ModeDefault700K1400K2100K2800K3500KSE +/- 1584.15, N = 3SE +/- 12726.60, N = 33050498.003060186.421. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: GET - Parallel Connections: 500Optimized Power ModeDefault700K1400K2100K2800K3500KSE +/- 36270.68, N = 3SE +/- 482.98, N = 33152359.753240001.581. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 7.0.4Test: SET - Parallel Connections: 500Optimized Power ModeDefault500K1000K1500K2000K2500KSE +/- 19286.75, N = 15SE +/- 7546.74, N = 32389636.672503177.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

PostgreSQL

This is a benchmark of PostgreSQL using the integrated pgbench for facilitating the database benchmarks. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlyOptimized Power ModeDefault200K400K600K800K1000KSE +/- 11877.48, N = 12SE +/- 17460.42, N = 108784688930891. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencyOptimized Power ModeDefault0.25670.51340.77011.02681.2835SE +/- 0.016, N = 12SE +/- 0.022, N = 101.1411.1241. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteOptimized Power ModeDefault14K28K42K56K70KSE +/- 214.64, N = 3SE +/- 70.87, N = 359695639621. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 16Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencyOptimized Power ModeDefault48121620SE +/- 0.06, N = 3SE +/- 0.02, N = 316.7515.631. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

CPU Peak Freq (Highest CPU Core Frequency) Monitor

OpenBenchmarking.orgMegahertzCPU Peak Freq (Highest CPU Core Frequency) MonitorPhoronix Test Suite System MonitoringOptimized Power ModeDefault10002000300040005000Min: 800 / Avg: 3437 / Max: 5154Min: 500 / Avg: 3374.53 / Max: 5474

CPU Power Consumption Monitor

OpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringOptimized Power ModeDefault140280420560700Min: 88.73 / Avg: 366.23 / Max: 802.53Min: 101.15 / Avg: 445.93 / Max: 802.3

159 Results Shown

Xmrig:
  KawPow - 1M
  Monero - 1M
  Wownero - 1M
  GhostRider - 1M
  CryptoNight-Heavy - 1M
  CryptoNight-Femto UPX2 - 1M
QuantLib:
  Multi-Threaded
  Single-Threaded
OpenRadioss:
  Bumper Beam
  Chrysler Neon 1M
  Cell Phone Drop Test
  Bird Strike on Windshield
  Rubber O-Ring Seal Installation
  INIVOL and Fluid Structure Interaction Drop Container
Neural Magic DeepSparse:
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  ResNet-50, Baseline - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  BERT-Large, NLP Question Answering - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
  NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
    items/sec
    ms/batch
PyTorch:
  CPU - 64 - ResNet-50
  CPU - 256 - ResNet-50
  CPU - 64 - ResNet-152
  CPU - 256 - ResNet-152
  CPU - 64 - Efficientnet_v2_l
  CPU - 256 - Efficientnet_v2_l
OpenVINO:
  Person Detection FP16 - CPU:
    FPS
    ms
  Face Detection FP16-INT8 - CPU:
    FPS
    ms
  Vehicle Detection FP16-INT8 - CPU:
    FPS
    ms
  Face Detection Retail FP16-INT8 - CPU:
    FPS
Y-Cruncher
OpenVINO:
  Face Detection Retail FP16-INT8 - CPU
  Road Segmentation ADAS FP16-INT8 - CPU
  Road Segmentation ADAS FP16-INT8 - CPU
  Machine Translation EN To DE FP16 - CPU
  Machine Translation EN To DE FP16 - CPU
  Weld Porosity Detection FP16-INT8 - CPU
  Weld Porosity Detection FP16-INT8 - CPU
  Person Vehicle Bike Detection FP16 - CPU
  Person Vehicle Bike Detection FP16 - CPU
  Handwritten English Recognition FP16-INT8 - CPU
  Handwritten English Recognition FP16-INT8 - CPU
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU
OpenFOAM:
  motorBike - Execution Time
  drivaerFastback, Small Mesh Size - Mesh Time
  drivaerFastback, Small Mesh Size - Execution Time
QMCPACK
7-Zip Compression:
  Compression Rating
  Decompression Rating
Timed LLVM Compilation:
  Ninja
  Unix Makefiles
Zstd Compression:
  12 - Compression Speed
  12 - Decompression Speed
  19 - Compression Speed
  19 - Decompression Speed
  19, Long Mode - Compression Speed
  19, Long Mode - Decompression Speed
Timed GCC Compilation
Timed Linux Kernel Compilation:
  defconfig
  allmodconfig
Kvazaar:
  Bosphorus 4K - Slow
  Bosphorus 4K - Medium
  Bosphorus 1080p - Slow
  Bosphorus 1080p - Medium
  Bosphorus 4K - Very Fast
  Bosphorus 4K - Super Fast
  Bosphorus 4K - Ultra Fast
  Bosphorus 1080p - Very Fast
  Bosphorus 1080p - Super Fast
  Bosphorus 1080p - Ultra Fast
SVT-VP9:
  VMAF Optimized - Bosphorus 1080p
  PSNR/SSIM Optimized - Bosphorus 1080p
  Visual Quality Optimized - Bosphorus 1080p
SVT-AV1:
  Preset 4 - Bosphorus 4K
  Preset 8 - Bosphorus 4K
  Preset 12 - Bosphorus 4K
  Preset 13 - Bosphorus 4K
  Preset 4 - Bosphorus 1080p
  Preset 8 - Bosphorus 1080p
  Preset 12 - Bosphorus 1080p
  Preset 13 - Bosphorus 1080p
Blender:
  BMW27 - CPU-Only
  Classroom - CPU-Only
  Barbershop - CPU-Only
FFmpeg:
  libx265 - Live
  libx265 - Upload
  libx265 - Platform
  libx265 - Video On Demand
uvg266:
  Bosphorus 4K - Slow
  Bosphorus 4K - Medium
  Bosphorus 1080p - Slow
  Bosphorus 1080p - Medium
  Bosphorus 4K - Very Fast
  Bosphorus 4K - Super Fast
  Bosphorus 4K - Ultra Fast
  Bosphorus 1080p - Very Fast
  Bosphorus 1080p - Super Fast
  Bosphorus 1080p - Ultra Fast
VVenC:
  Bosphorus 4K - Fast
  Bosphorus 4K - Faster
  Bosphorus 1080p - Fast
  Bosphorus 1080p - Faster
Intel Open Image Denoise
easyWave:
  e2Asean Grid + BengkuluSept2007 Source - 2400
  e2Asean Grid + BengkuluSept2007 Source - 1200
Liquid-DSP:
  32 - 256 - 32
  32 - 256 - 57
  64 - 256 - 32
  64 - 256 - 57
  128 - 256 - 32
  128 - 256 - 57
  256 - 256 - 32
  256 - 256 - 57
  32 - 256 - 512
  64 - 256 - 512
  128 - 256 - 512
  256 - 256 - 512
Apache Spark TPC-H
DuckDB:
  IMDB
  TPC-H Parquet
nginx:
  500
  1000
Apache HTTP Server:
  500
  1000
Apache Hadoop
Memcached:
  1:10
  1:100
Redis:
  GET - 50
  SET - 50
  GET - 500
  SET - 500
PostgreSQL:
  100 - 1000 - Read Only
  100 - 1000 - Read Only - Average Latency
  100 - 1000 - Read Write
  100 - 1000 - Read Write - Average Latency
CPU Peak Freq (Highest CPU Core Frequency) Monitor:
  Phoronix Test Suite System Monitoring:
    Megahertz
    Watts