AMD EPYC Zen 5 SMT Comparison

AMD EPYC 9575F 1P SMT comparison benchmarks by Michael Larabel for a future article. Fresh tests repeated with SMT on/off from SMCI BIOS toggle.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2501314-NE-AMDEPYCZE72
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Sensor Monitoring

Show Accumulated Sensor Monitoring Data For Displayed Results
Generate Power Efficiency / Performance Per Watt Results

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
SMT Enabled - Default
January 30
  9 Hours, 36 Minutes
SMT Disabled
January 30
  9 Hours, 38 Minutes
Invert Behavior (Only Show Selected Data)
  9 Hours, 37 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC Zen 5 SMT ComparisonOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores / 128 Threads)AMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores)Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS)AMD 1Ah12 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFF3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 24.106.13.0-phx (x86_64)GNOME Shell 47.0X ServerGCC 14.2.0ext41024x768ProcessorsMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionAMD EPYC Zen 5 SMT Comparison BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116 - Python 3.12.7- SMT Enabled - Default: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB on VMEXIT only + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - SMT Disabled: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB on VMEXIT only + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

SMT Enabled - Default vs. SMT Disabled ComparisonPhoronix Test SuiteBaseline+24.8%+24.8%+49.6%+49.6%+74.4%+74.4%99%89.9%89.2%85.7%84.8%78.9%75%70.8%22.7%17.4%15.8%14.7%14.3%12%11.5%10.8%8.7%8.5%8.5%8.4%8%7.8%7%6.7%5.6%5.6%5.1%4.6%3.3%2.9%2.8%2.7%2.2%2.2%2.2%2.1%2.1%2%R.S.A.F.I - CPUV.D.F.I - CPUResizingP.V.B.D.F - CPUUpdate RandM.T.E.T.D.F - CPUP.R.I.R.F - CPURead While Writing72.7%Update RandD.R61.3%Pathtracer ISPC - Crown56.8%Pathtracer ISPC - Asian Dragon56.2%Pathtracer ISPC - Asian Dragon Obj55.9%WPA PSK54%Read While Writing47.9%bcrypt44.3%Blowfish44.2%ChaCha20-Poly130542.9%CoreMark Size 666 - I.P.S41.9%H.T.M41.5%CPU Stress41.3%ChaCha2041.2%R.C.a.P - CPU41.1%LuxCore Benchmark - CPU40.4%Danish Mood - CPU39.8%Vector Math38.6%Chess Benchmark38.1%3 - 4K - 1 - Path Tracer - CPU37.7%2 - 4K - 1 - Path Tracer - CPU37.4%1 - 4K - 1 - Path Tracer - CPU36.3%1:10036.1%3 - 4K - 32 - Path Tracer - CPU34.4%Enhanced34.1%2 - 4K - 32 - Path Tracer - CPU34.1%SHA25634%100 - 800 - Read Only - Average Latency33.9%Orange Juice - CPU33.8%1 - 4K - 32 - Path Tracer - CPU33.5%100 - 800 - Read Only33.4%v.I33.3%100 - 1000 - Read Only33.1%100 - 1000 - Read Only - Average Latency32.7%Pabellon Barcelona - CPU-Only32.1%MD531.4%Compression Rating30.2%Junkshop - CPU-Only30.1%Barbershop - CPU-Only29.7%100 - 800 - Read Write29.6%100 - 800 - Read Write - Average Latency29.5%DLSC - CPU28.5%Context Switching27.5%Classroom - CPU-Only26.6%BMW27 - CPU-Only25.8%R.R.W.R23.8%Noise-Gaussian23.8%100 - 1000 - Read Write23.7%100 - 1000 - Read Write - Average Latency23.7%SP.BInteger Math21.8%128 - 256 - 51221.6%AVX-512 VNNI19.2%EP.CN.S.P.L.F - CPU16.5%RSA409616.3%50015.9%CG.Callmodconfig15.8%P.R.I.R.F - CPU15.1%FT.CCPU CacheA.w.3.5.A14.1%S.w.1.0.6.A13.9%Ninja13.4%HMAC-SHA51213%Swirl12.4%N.S.P.L.F - CPU12.4%MG.CM.T.E.T.D.F - CPU12%100011.5%LU.C1:1011.4%P.P.B.T.T11%Total Time10.8%EP.DF.D.R.F.I - CPU10.6%I.B.O10%Time To Compile9.6%P.P.B.T.T9.4%SHA5129.1%Bosphorus 4K - SlowBosphorus 4K - FasterBosphorus 4K - MediumHWB Color SpaceBosphorus 4K - FastBosphorus 4K - SlowTime To Compile7.7%P.V.B.D.F - CPU7.6%Time To Compile7.1%BT.C1e137%Bosphorus 4K - MediumAES-256-GCM6.6%Bosphorus 4K - Very Fast6.1%Bosphorus 4K - Very Fast5.9%F.D.R.F.I - CPU5.7%V.D.F.I - CPU5.7%Preset 13 - Bosphorus 4KPreset 8 - Bosphorus 4KS.F.P.RBosphorus 4K - Ultra FastNUMA4.4%Bosphorus 4K - Super FastPreset 5 - Bosphorus 4KSP.C1.R.H.D.S.RH.E.R.F.I - CPU2.3%Bosphorus 4K - Ultra FastMemory Copying2.2%RotateCPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - T.G.1H.E.R.F.I - CPU2.1%Rand Read5K - 162.1%1.R.H.D.F.R.C.CPreset 3 - Bosphorus 4K4K - 162%OpenVINOOpenVINOGraphicsMagickOpenVINORocksDBOpenVINOOpenVINOSpeedbSpeedb7-Zip CompressionEmbreeEmbreeEmbreeJohn The RipperRocksDBJohn The RipperJohn The RipperOpenSSLCoremarkStress-NGStress-NGOpenSSLLuxCoreRenderLuxCoreRenderLuxCoreRenderStress-NGStockfishOSPRay StudioOSPRay StudioOSPRay StudioMemcachedOSPRay StudioGraphicsMagickOSPRay StudioOpenSSLPostgreSQLLuxCoreRenderOSPRay StudioPostgreSQLOpenVKLPostgreSQLPostgreSQLBlenderJohn The Ripper7-Zip CompressionBlenderBlenderPostgreSQLPostgreSQLLuxCoreRenderStress-NGBlenderBlenderRocksDBGraphicsMagickPostgreSQLPostgreSQLNAS Parallel BenchmarksStress-NGLiquid-DSPStress-NGNAS Parallel BenchmarksOpenVINOOpenSSLnginxNAS Parallel BenchmarksTimed Linux Kernel CompilationOpenVINONAS Parallel BenchmarksStress-NGNAMDNAMDTimed LLVM CompilationJohn The RipperGraphicsMagickOpenVINONAS Parallel BenchmarksOpenVINOnginxNAS Parallel BenchmarksMemcachedsrsRAN ProjectTachyonNAS Parallel BenchmarksOpenVINOStress-NGTimed Node.js CompilationsrsRAN ProjectOpenSSLKvazaarVVenCKvazaarGraphicsMagickVVenCuvg266Timed Eigen CompilationOpenVINOTimed Gem5 CompilationNAS Parallel BenchmarksPrimesieveuvg266OpenSSLKvazaaruvg266OpenVINOOpenVINOSVT-AV1SVT-AV1ACES DGEMMuvg266Stress-NGKvazaarSVT-AV1NAS Parallel BenchmarksClickHouseOpenVINOKvazaarStress-NGGraphicsMagickLlama.cppOpenVINOSpeedbC-RayClickHouseSVT-AV1C-RaySMT Enabled - DefaultSMT Disabled

AMD EPYC Zen 5 SMT Comparisonstress-ng: CPU Stressstress-ng: Memory Copyingstress-ng: Vector Mathstress-ng: Context Switchingstress-ng: CPU Cachestress-ng: NUMAstress-ng: AVX-512 VNNIstress-ng: Integer Mathstress-ng: Integer Bit Operationsstress-ng: Hyperbolic Trigonometric Mathopenvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPUllama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128lammps: 20k Atomsnpb: BT.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: LU.Cnpb: SP.Bnpb: SP.Cnpb: IS.Dnpb: MG.Cnpb: CG.Cnamd: ATPase with 327,506 Atomsnamd: STMV with 1,066,628 Atomsopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUmt-dgemm: Sustained Floating-Point Ratecoremark: CoreMark Size 666 - Iterations Per Secondprimesieve: 1e13stockfish: Chess Benchmarkcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingjohn-the-ripper: MD5john-the-ripper: Blowfishjohn-the-ripper: HMAC-SHA512john-the-ripper: bcryptjohn-the-ripper: WPA PSKbuild-llvm: Ninjabuild-linux-kernel: allmodconfigkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Very Fastpalabos: 500laghos: Sedov Blast Wave, ube_922_hex.meshlaghos: Triple Point Problemkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Super Fastkvazaar: Bosphorus 4K - Ultra Fastgraphics-magick: HWB Color Spacegraphics-magick: Noise-Gaussiangraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Swirltachyon: Total Timesvt-av1: Preset 13 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 5 - Bosphorus 4Ksvt-av1: Preset 3 - Bosphorus 4Kc-ray: 4K - 16c-ray: 5K - 16blender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Junkshop - CPU-Onlyuvg266: Bosphorus 4K - Slowuvg266: Bosphorus 4K - Mediumuvg266: Bosphorus 4K - Very Fastuvg266: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Ultra Fastvvenc: Bosphorus 4K - Fastvvenc: Bosphorus 4K - Fasterembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer ISPC - Crownopenvkl: vklBenchmarkCPU ISPCluxcorerender: DLSC - CPUluxcorerender: Rainbow Colors and Prism - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Orange Juice - CPUluxcorerender: Danish Mood - CPUospray-studio: 1 - 4K - 1 - Path Tracer - CPUospray-studio: 1 - 4K - 32 - Path Tracer - CPUospray-studio: 2 - 4K - 1 - Path Tracer - CPUospray-studio: 2 - 4K - 32 - Path Tracer - CPUospray-studio: 3 - 4K - 1 - Path Tracer - CPUospray-studio: 3 - 4K - 32 - Path Tracer - CPUbuild-eigen: Time To Compilebuild-gem5: Time To Compilebuild-nodejs: Time To Compilefinancebench: Bonds OpenMPfinancebench: Repo OpenMPliquid-dsp: 64 - 256 - 512liquid-dsp: 128 - 256 - 512srsran: PUSCH Processor Benchmark, Throughput Totalsrsran: PDSCH Processor Benchmark, Throughput Totalrustls: handshake - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256rustls: handshake - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384rustls: handshake - TLS13_CHACHA20_POLY1305_SHA256rustls: handshake-ticket - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256rustls: handshake-ticket - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384rustls: handshake-ticket - TLS13_CHACHA20_POLY1305_SHA256rustls: handshake-resume - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256rustls: handshake-resume - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384rustls: handshake-resume - TLS13_CHACHA20_POLY1305_SHA256speedb: Rand Readspeedb: Read While Writingspeedb: Read Rand Write Randspeedb: Update Randnginx: 500nginx: 1000openssl: RSA4096openssl: RSA4096openssl: SHA256openssl: SHA512openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20openssl: ChaCha20-Poly1305clickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runmemcached: 1:100memcached: 1:10rocksdb: Rand Readrocksdb: Read While Writingrocksdb: Read Rand Write Randrocksdb: Update Randpgbench: 100 - 800 - Read Writepgbench: 100 - 800 - Read Write - Average Latencypgbench: 100 - 800 - Read Onlypgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 1000 - Read Writepgbench: 100 - 1000 - Read Write - Average Latencypgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Only - Average Latencyopenvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time To First Tokenopenvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time Per Output TokenSMT Enabled - DefaultSMT Disabled207237.2926334.21553414.2752341805.752704684.592094.0713250045.886977901.8419006107.55488803.1478.0650.43117.0352.9353.787329182.789641.8610642.01149810.23284866.89185256.17147801.327000.73159653.7862539.0012.972773.743177670.874.146268.385.05790.5740.4422207.982.83186.9720.072341.2813.639483.913.366539.339.465217.6455954035383.56781524.57624443782263731853321018460333199040427958667199325859423101.113190.67440.4993.25770.439562.40295.0941.35108.96112.0047728133821327529767916.3825456.699199.93960.38816.95333.60659.69814.8841.2546.69146.5720.0127.7530.8474.7576.4778.3612.11425.608137.9946118.3267111.9620239115.1131.5412.2721.9811.4910173524910273554112034132528.337121.003124.15627898.72135417136.7102861515900000183380000020677.2118394.6117122.32708240.91111705.913173966.532250219.43507739.284285728.042766520.21408236.41547341883103863643653992532848574247.25563863.8545520.01684833.11142477381534565608673312937976620831189125968417735906787330502160719143774.37797.78811.0413615896.007150348.985355125811203919674253046964171266926.31548367690.1651146228.72547434900.21115.8412.81146670.4725765.78399202.6041052099.153091182.202004.8711117419.405730701.5217279400.36345394.6778.5550.49119.5652.8653.658352266.6611316.8811786.07171779.61317628.32227311.90151971.937078.61178843.5372412.0411.367093.285527258.662.185824.652.72706.1822.6020077.072.963116.4520.502331.656.858237.151.925615.2710.635484.7897022844496.90631426.28617697322148952533053614054000138073378876000138135558160114.663220.74144.0187.85772.342566.24298.5244.85112.56114.5051722725240328129260418.1512482.463211.07162.11917.29634.28060.95418.7252.2361.69190.1326.0429.9232.9170.5976.7181.9713.08027.77688.340275.883571.4190179411.7622.368.7416.438.2213864707114114764416565555630.523129.535136.08727904.24674517164.1028641512433333150780000018626.3108233.2117111.87708330.20111713.223180516.792238161.49505551.884279636.212750461.27409213.5155890245060127893620451910104495324.89505544.0545691.71448237.7852499963674183237457712859061524731115036388103521259281067351349708757790.42819.66825.3110001671.486416279.96545826282814171759970261286685977828.18136250790.2219268010.79035646430.28015.8912.73OpenBenchmarking.org

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: CPU StressSMT DisabledSMT Enabled - Default40K80K120K160K200KSE +/- 99.86, N = 3SE +/- 559.15, N = 3146670.47207237.291. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Memory CopyingSMT DisabledSMT Enabled - Default6K12K18K24K30KSE +/- 57.84, N = 3SE +/- 54.76, N = 325765.7826334.211. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Vector MathSMT DisabledSMT Enabled - Default120K240K360K480K600KSE +/- 951.54, N = 3SE +/- 306.60, N = 3399202.60553414.271. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Context SwitchingSMT DisabledSMT Enabled - Default11M22M33M44M55MSE +/- 142871.30, N = 3SE +/- 146419.18, N = 341052099.1552341805.751. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: CPU CacheSMT Enabled - DefaultSMT Disabled700K1400K2100K2800K3500KSE +/- 42296.58, N = 12SE +/- 35933.89, N = 32704684.593091182.201. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: NUMASMT DisabledSMT Enabled - Default400800120016002000SE +/- 3.80, N = 3SE +/- 8.23, N = 32004.872094.071. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: AVX-512 VNNISMT DisabledSMT Enabled - Default3M6M9M12M15MSE +/- 2571.79, N = 3SE +/- 37914.71, N = 311117419.4013250045.881. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Integer MathSMT DisabledSMT Enabled - Default1.5M3M4.5M6M7.5MSE +/- 1218.12, N = 3SE +/- 11122.15, N = 35730701.526977901.841. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Integer Bit OperationsSMT DisabledSMT Enabled - Default4M8M12M16M20MSE +/- 10193.47, N = 3SE +/- 3213.22, N = 317279400.3619006107.551. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Hyperbolic Trigonometric MathSMT DisabledSMT Enabled - Default100K200K300K400K500KSE +/- 124.87, N = 3SE +/- 357.40, N = 3345394.67488803.141. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenVINO GenAI

OpenBenchmarking.orgtokens/s, More Is BetterOpenVINO GenAI 2024.5Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPUSMT Enabled - DefaultSMT Disabled20406080100SE +/- 0.62, N = 4SE +/- 0.26, N = 478.0678.55

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128SMT Enabled - DefaultSMT Disabled1122334455SE +/- 0.04, N = 4SE +/- 0.03, N = 450.4350.491. (CXX) g++ options: -O3

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128SMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.46, N = 7SE +/- 0.84, N = 7117.03119.561. (CXX) g++ options: -O3

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128SMT DisabledSMT Enabled - Default1224364860SE +/- 0.05, N = 4SE +/- 0.04, N = 452.8652.931. (CXX) g++ options: -O3

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsSMT DisabledSMT Enabled - Default1224364860SE +/- 0.14, N = 3SE +/- 0.24, N = 353.6653.791. (CXX) g++ options: -O3 -lm -ldl

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CSMT Enabled - DefaultSMT Disabled80K160K240K320K400KSE +/- 2468.08, N = 15SE +/- 491.86, N = 5329182.78352266.661. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CSMT Enabled - DefaultSMT Disabled2K4K6K8K10KSE +/- 94.51, N = 15SE +/- 313.55, N = 159641.8611316.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DSMT Enabled - DefaultSMT Disabled3K6K9K12K15KSE +/- 551.90, N = 15SE +/- 523.50, N = 1210642.0111786.071. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CSMT Enabled - DefaultSMT Disabled40K80K120K160K200KSE +/- 2690.96, N = 15SE +/- 76.78, N = 9149810.23171779.611. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CSMT Enabled - DefaultSMT Disabled70K140K210K280K350KSE +/- 2151.09, N = 15SE +/- 973.24, N = 6284866.89317628.321. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BSMT Enabled - DefaultSMT Disabled50K100K150K200K250KSE +/- 1833.56, N = 15SE +/- 1995.83, N = 15185256.17227311.901. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CSMT Enabled - DefaultSMT Disabled30K60K90K120K150KSE +/- 194.21, N = 5SE +/- 133.32, N = 5147801.32151971.931. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DSMT Enabled - DefaultSMT Disabled15003000450060007500SE +/- 68.21, N = 6SE +/- 15.47, N = 67000.737078.611. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CSMT Enabled - DefaultSMT Disabled40K80K120K160K200KSE +/- 463.56, N = 11SE +/- 337.13, N = 11159653.78178843.531. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CSMT Enabled - DefaultSMT Disabled16K32K48K64K80KSE +/- 1042.31, N = 15SE +/- 301.36, N = 1062539.0072412.041. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: ATPase with 327,506 AtomsSMT DisabledSMT Enabled - Default3691215SE +/- 0.00, N = 6SE +/- 0.09, N = 711.3712.97

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: STMV with 1,066,628 AtomsSMT DisabledSMT Enabled - Default0.84221.68442.52663.36884.211SE +/- 0.00445, N = 3SE +/- 0.00568, N = 43.285523.74317

OpenVINO

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Vehicle Detection FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default16003200480064008000SE +/- 9.98, N = 3SE +/- 4.72, N = 37258.667670.871. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Vehicle Detection FP16-INT8 - Device: CPUSMT Enabled - DefaultSMT Disabled0.93151.8632.79453.7264.6575SE +/- 0.00, N = 3SE +/- 0.00, N = 34.142.18MIN: 2.3 / MAX: 17.74MIN: 1.73 / MAX: 8.31. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT DisabledSMT Enabled - Default13002600390052006500SE +/- 29.65, N = 3SE +/- 3.28, N = 35824.656268.381. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT Enabled - DefaultSMT Disabled1.13632.27263.40894.54525.6815SE +/- 0.00, N = 3SE +/- 0.01, N = 35.052.72MIN: 3.07 / MAX: 13.2MIN: 2.28 / MAX: 17.031. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Machine Translation EN To DE FP16 - Device: CPUSMT DisabledSMT Enabled - Default2004006008001000SE +/- 1.50, N = 3SE +/- 0.82, N = 3706.18790.571. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Machine Translation EN To DE FP16 - Device: CPUSMT Enabled - DefaultSMT Disabled918273645SE +/- 0.04, N = 3SE +/- 0.05, N = 340.4422.60MIN: 22.28 / MAX: 61.21MIN: 19.42 / MAX: 38.391. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Face Detection Retail FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default5K10K15K20K25KSE +/- 26.61, N = 3SE +/- 42.14, N = 320077.0722207.981. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Face Detection Retail FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default0.6661.3321.9982.6643.33SE +/- 0.00, N = 3SE +/- 0.00, N = 32.962.80MIN: 2.34 / MAX: 10.85MIN: 1.63 / MAX: 16.341. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Handwritten English Recognition FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default7001400210028003500SE +/- 2.44, N = 3SE +/- 2.16, N = 33116.453186.971. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Handwritten English Recognition FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 320.5020.07MIN: 19.01 / MAX: 29.66MIN: 12.13 / MAX: 35.831. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Road Segmentation ADAS FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default5001000150020002500SE +/- 3.04, N = 3SE +/- 2.80, N = 32331.652341.281. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Road Segmentation ADAS FP16-INT8 - Device: CPUSMT Enabled - DefaultSMT Disabled48121620SE +/- 0.02, N = 3SE +/- 0.01, N = 313.636.85MIN: 7.43 / MAX: 29.08MIN: 5.51 / MAX: 16.011. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Person Re-Identification Retail FP16 - Device: CPUSMT DisabledSMT Enabled - Default2K4K6K8K10KSE +/- 9.64, N = 3SE +/- 12.91, N = 38237.159483.911. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Person Re-Identification Retail FP16 - Device: CPUSMT Enabled - DefaultSMT Disabled0.7561.5122.2683.0243.78SE +/- 0.00, N = 3SE +/- 0.00, N = 33.361.92MIN: 1.65 / MAX: 16.78MIN: 1.7 / MAX: 10.141. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Noise Suppression Poconet-Like FP16 - Device: CPUSMT DisabledSMT Enabled - Default14002800420056007000SE +/- 2.51, N = 3SE +/- 2.29, N = 35615.276539.331. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Noise Suppression Poconet-Like FP16 - Device: CPUSMT DisabledSMT Enabled - Default3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 310.639.46MIN: 6.99 / MAX: 22.9MIN: 5.8 / MAX: 27.031. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

ACES DGEMM

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateSMT Enabled - DefaultSMT Disabled12002400360048006000SE +/- 4.91, N = 5SE +/- 8.97, N = 55217.655484.791. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondSMT DisabledSMT Enabled - Default900K1800K2700K3600K4500KSE +/- 11150.98, N = 3SE +/- 2817.99, N = 32844496.914035383.571. (CC) gcc options: -O2 -lrt" -lrt

Primesieve

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.6Length: 1e13SMT DisabledSMT Enabled - Default612182430SE +/- 0.05, N = 3SE +/- 0.05, N = 326.2924.581. (CXX) g++ options: -O3

Stockfish

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 17Chess BenchmarkSMT DisabledSMT Enabled - Default50M100M150M200M250MSE +/- 4557708.76, N = 15SE +/- 3026146.64, N = 131769732212444378221. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingSMT DisabledSMT Enabled - Default140K280K420K560K700KSE +/- 5895.67, N = 3SE +/- 5367.73, N = 34895256373181. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingSMT DisabledSMT Enabled - Default110K220K330K440K550KSE +/- 214.48, N = 3SE +/- 447.27, N = 33305365332101. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5SMT DisabledSMT Enabled - Default4M8M12M16M20MSE +/- 8660.25, N = 3SE +/- 49208.17, N = 314054000184603331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishSMT DisabledSMT Enabled - Default40K80K120K160K200KSE +/- 46.23, N = 3SE +/- 89.67, N = 31380731990401. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512SMT DisabledSMT Enabled - Default90M180M270M360M450MSE +/- 2440649.98, N = 3SE +/- 1354899.42, N = 33788760004279586671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptSMT DisabledSMT Enabled - Default40K80K120K160K200KSE +/- 48.59, N = 3SE +/- 205.13, N = 31381351993251. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKSMT DisabledSMT Enabled - Default200K400K600K800K1000KSE +/- 541.28, N = 3SE +/- 400.95, N = 35581608594231. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaSMT DisabledSMT Enabled - Default306090120150SE +/- 0.08, N = 3SE +/- 0.19, N = 3114.66101.11

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfigSMT DisabledSMT Enabled - Default50100150200250SE +/- 0.20, N = 3SE +/- 0.29, N = 3220.74190.67

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: SlowSMT Enabled - DefaultSMT Disabled1020304050SE +/- 0.03, N = 4SE +/- 0.09, N = 440.4944.011. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastSMT DisabledSMT Enabled - Default20406080100SE +/- 0.05, N = 6SE +/- 0.04, N = 687.8593.251. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Palabos

OpenBenchmarking.orgMega Site Updates Per Second, More Is BetterPalabos 2.3Grid Size: 500SMT Enabled - DefaultSMT Disabled170340510680850SE +/- 0.76, N = 3SE +/- 1.81, N = 3770.44772.341. (CXX) g++ options: -std=c++17 -pedantic -O3 -rdynamic -lcrypto -lcurl -lsz -lz -ldl -lm

Laghos

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshSMT Enabled - DefaultSMT Disabled120240360480600SE +/- 2.09, N = 3SE +/- 3.98, N = 3562.40566.241. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point ProblemSMT Enabled - DefaultSMT Disabled70140210280350SE +/- 3.57, N = 3SE +/- 2.92, N = 3295.09298.521. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumSMT Enabled - DefaultSMT Disabled1020304050SE +/- 0.04, N = 4SE +/- 0.02, N = 441.3544.851. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super FastSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.18, N = 7SE +/- 0.04, N = 7108.96112.561. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra FastSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.28, N = 7SE +/- 0.24, N = 7112.00114.501. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color SpaceSMT Enabled - DefaultSMT Disabled110220330440550SE +/- 4.51, N = 3SE +/- 3.51, N = 34775171. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianSMT DisabledSMT Enabled - Default60120180240300SE +/- 0.67, N = 3SE +/- 0.33, N = 32272811. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedSMT DisabledSMT Enabled - Default70140210280350SE +/- 0.00, N = 3SE +/- 2.19, N = 32523381. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: ResizingSMT Enabled - DefaultSMT Disabled90180270360450SE +/- 0.67, N = 3SE +/- 5.24, N = 32134031. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: RotateSMT Enabled - DefaultSMT Disabled60120180240300SE +/- 0.33, N = 3SE +/- 1.76, N = 32752811. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenSMT DisabledSMT Enabled - Default60120180240300SE +/- 0.88, N = 3SE +/- 0.33, N = 32922971. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SwirlSMT DisabledSMT Enabled - Default150300450600750SE +/- 0.33, N = 3SE +/- 2.96, N = 36046791. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. The sample scene used is the Teapot scene ray-traced to 8K x 8K with 32 samples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99.2Total TimeSMT DisabledSMT Enabled - Default48121620SE +/- 0.04, N = 3SE +/- 0.00, N = 418.1516.381. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Bosphorus 4KSMT Enabled - DefaultSMT Disabled100200300400500SE +/- 11.10, N = 15SE +/- 12.29, N = 15456.70482.461. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Bosphorus 4KSMT Enabled - DefaultSMT Disabled50100150200250SE +/- 0.95, N = 4SE +/- 0.89, N = 4199.94211.071. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Bosphorus 4KSMT Enabled - DefaultSMT Disabled1428425670SE +/- 0.21, N = 3SE +/- 0.02, N = 360.3962.121. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Bosphorus 4KSMT Enabled - DefaultSMT Disabled48121620SE +/- 0.05, N = 3SE +/- 0.03, N = 316.9517.301. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

C-Ray

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16SMT DisabledSMT Enabled - Default816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 334.2833.611. (CC) gcc options: -lpthread -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16SMT DisabledSMT Enabled - Default1428425670SE +/- 0.03, N = 3SE +/- 0.01, N = 360.9559.701. (CC) gcc options: -lpthread -lm

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: CPU-OnlySMT DisabledSMT Enabled - Default510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 418.7214.88

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: CPU-OnlySMT DisabledSMT Enabled - Default1224364860SE +/- 0.04, N = 3SE +/- 0.09, N = 352.2341.25

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: CPU-OnlySMT DisabledSMT Enabled - Default1428425670SE +/- 0.03, N = 3SE +/- 0.08, N = 361.6946.69

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: CPU-OnlySMT DisabledSMT Enabled - Default4080120160200SE +/- 0.16, N = 3SE +/- 0.21, N = 3190.13146.57

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: CPU-OnlySMT DisabledSMT Enabled - Default612182430SE +/- 0.04, N = 3SE +/- 0.01, N = 326.0420.01

uvg266

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: SlowSMT Enabled - DefaultSMT Disabled714212835SE +/- 0.02, N = 3SE +/- 0.02, N = 327.7529.92

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: MediumSMT Enabled - DefaultSMT Disabled816243240SE +/- 0.04, N = 3SE +/- 0.09, N = 330.8432.91

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Very FastSMT DisabledSMT Enabled - Default20406080100SE +/- 0.06, N = 5SE +/- 0.10, N = 670.5974.75

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Super FastSMT Enabled - DefaultSMT Disabled20406080100SE +/- 0.06, N = 6SE +/- 0.04, N = 676.4776.71

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Ultra FastSMT Enabled - DefaultSMT Disabled20406080100SE +/- 0.06, N = 6SE +/- 0.08, N = 678.3681.97

VVenC

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.13Video Input: Bosphorus 4K - Video Preset: FastSMT Enabled - DefaultSMT Disabled3691215SE +/- 0.12, N = 3SE +/- 0.13, N = 312.1113.081. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.13Video Input: Bosphorus 4K - Video Preset: FasterSMT Enabled - DefaultSMT Disabled714212835SE +/- 0.07, N = 3SE +/- 0.07, N = 325.6127.781. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian DragonSMT DisabledSMT Enabled - Default306090120150SE +/- 0.04, N = 5SE +/- 0.05, N = 788.34137.99MIN: 87.7 / MAX: 89.3MIN: 136.67 / MAX: 139.92

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon ObjSMT DisabledSMT Enabled - Default306090120150SE +/- 0.01, N = 4SE +/- 0.08, N = 475.88118.33MIN: 75.31 / MAX: 76.83MIN: 116.67 / MAX: 120.05

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: CrownSMT DisabledSMT Enabled - Default306090120150SE +/- 0.03, N = 5SE +/- 0.04, N = 671.42111.96MIN: 70.44 / MAX: 72.43MIN: 110.19 / MAX: 114.2

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 2.0.0Benchmark: vklBenchmarkCPU ISPCSMT DisabledSMT Enabled - Default5001000150020002500SE +/- 1.45, N = 3SE +/- 0.33, N = 317942391MIN: 141 / MAX: 24183MIN: 188 / MAX: 30677

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUSMT DisabledSMT Enabled - Default48121620SE +/- 0.08, N = 3SE +/- 0.04, N = 311.7615.11MIN: 11.25 / MAX: 13.36MIN: 14.74 / MAX: 17.3

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPUSMT DisabledSMT Enabled - Default714212835SE +/- 0.06, N = 5SE +/- 0.10, N = 622.3631.54MIN: 20.03 / MAX: 22.83MIN: 27.49 / MAX: 32.55

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPUSMT DisabledSMT Enabled - Default3691215SE +/- 0.04, N = 3SE +/- 0.09, N = 38.7412.27MIN: 4.38 / MAX: 9.71MIN: 6.11 / MAX: 13.85

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPUSMT DisabledSMT Enabled - Default510152025SE +/- 0.15, N = 3SE +/- 0.07, N = 316.4321.98MIN: 14.19 / MAX: 19.88MIN: 19.44 / MAX: 27.65

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPUSMT DisabledSMT Enabled - Default3691215SE +/- 0.08, N = 3SE +/- 0.04, N = 38.2211.49MIN: 4.38 / MAX: 9.13MIN: 6.03 / MAX: 12.77

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default30060090012001500SE +/- 0.88, N = 3SE +/- 0.33, N = 313861017

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default10K20K30K40K50KSE +/- 101.57, N = 3SE +/- 73.90, N = 34707135249

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default30060090012001500SE +/- 1.86, N = 3SE +/- 0.88, N = 314111027

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default10K20K30K40K50KSE +/- 32.74, N = 3SE +/- 14.15, N = 34764435541

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default400800120016002000SE +/- 0.00, N = 3SE +/- 1.20, N = 316561203

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default12K24K36K48K60KSE +/- 45.71, N = 3SE +/- 91.26, N = 35555641325

Timed Eigen Compilation

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.4.0Time To CompileSMT DisabledSMT Enabled - Default714212835SE +/- 0.01, N = 3SE +/- 0.05, N = 330.5228.34

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To CompileSMT DisabledSMT Enabled - Default306090120150SE +/- 1.06, N = 12SE +/- 0.25, N = 3129.54121.00

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileSMT DisabledSMT Enabled - Default306090120150SE +/- 0.20, N = 3SE +/- 0.10, N = 3136.09124.16

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPSMT DisabledSMT Enabled - Default6K12K18K24K30KSE +/- 17.22, N = 3SE +/- 11.50, N = 327904.2527898.721. (CXX) g++ options: -O3 -march=native -fopenmp

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPSMT DisabledSMT Enabled - Default4K8K12K16K20KSE +/- 18.58, N = 3SE +/- 15.72, N = 317164.1017136.711. (CXX) g++ options: -O3 -march=native -fopenmp

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512SMT DisabledSMT Enabled - Default300M600M900M1200M1500MSE +/- 1589898.67, N = 3SE +/- 1734935.16, N = 3151243333315159000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512SMT DisabledSMT Enabled - Default400M800M1200M1600M2000MSE +/- 3661056.31, N = 3SE +/- 3470350.61, N = 3150780000018338000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

srsRAN Project

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 24.10Test: PUSCH Processor Benchmark, Throughput TotalSMT DisabledSMT Enabled - Default4K8K12K16K20KSE +/- 176.66, N = 3SE +/- 157.68, N = 318626.320677.21. (CXX) g++ options: -O3 -march=native -mtune=generic -fno-trapping-math -fno-math-errno -ldl

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 24.10Test: PDSCH Processor Benchmark, Throughput TotalSMT DisabledSMT Enabled - Default30K60K90K120K150KSE +/- 524.68, N = 6SE +/- 550.27, N = 4108233.2118394.61. (CXX) g++ options: -O3 -march=native -mtune=generic -fno-trapping-math -fno-math-errno -ldl

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256SMT DisabledSMT Enabled - Default30K60K90K120K150KSE +/- 69.93, N = 4SE +/- 115.83, N = 4117111.87117122.321. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384SMT Enabled - DefaultSMT Disabled150K300K450K600K750KSE +/- 862.31, N = 3SE +/- 432.07, N = 3708240.91708330.201. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS13_CHACHA20_POLY1305_SHA256SMT Enabled - DefaultSMT Disabled20K40K60K80K100KSE +/- 81.03, N = 4SE +/- 21.33, N = 4111705.91111713.221. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256SMT Enabled - DefaultSMT Disabled700K1400K2100K2800K3500KSE +/- 7293.15, N = 3SE +/- 11239.17, N = 33173966.533180516.791. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384SMT DisabledSMT Enabled - Default500K1000K1500K2000K2500KSE +/- 2755.66, N = 3SE +/- 3277.78, N = 32238161.492250219.431. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS13_CHACHA20_POLY1305_SHA256SMT DisabledSMT Enabled - Default110K220K330K440K550KSE +/- 1009.73, N = 3SE +/- 1861.82, N = 3505551.88507739.281. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256SMT DisabledSMT Enabled - Default900K1800K2700K3600K4500KSE +/- 24017.92, N = 3SE +/- 13138.92, N = 34279636.214285728.041. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384SMT DisabledSMT Enabled - Default600K1200K1800K2400K3000KSE +/- 4997.81, N = 3SE +/- 4024.00, N = 32750461.272766520.211. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS13_CHACHA20_POLY1305_SHA256SMT Enabled - DefaultSMT Disabled90K180K270K360K450KSE +/- 2869.68, N = 3SE +/- 5190.96, N = 3408236.41409213.511. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Random ReadSMT Enabled - DefaultSMT Disabled120M240M360M480M600MSE +/- 1478695.31, N = 3SE +/- 1746081.52, N = 35473418835589024501. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read While WritingSMT DisabledSMT Enabled - Default2M4M6M8M10MSE +/- 52167.60, N = 8SE +/- 210416.01, N = 156012789103863641. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read Random Write RandomSMT DisabledSMT Enabled - Default800K1600K2400K3200K4000KSE +/- 9776.13, N = 3SE +/- 2099.62, N = 3362045136539921. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Update RandomSMT Enabled - DefaultSMT Disabled200K400K600K800K1000KSE +/- 369.07, N = 3SE +/- 1284.63, N = 35328489101041. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500SMT DisabledSMT Enabled - Default120K240K360K480K600KSE +/- 183.72, N = 3SE +/- 1273.97, N = 3495324.89574247.251. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000SMT DisabledSMT Enabled - Default120K240K360K480K600KSE +/- 357.80, N = 3SE +/- 889.76, N = 3505544.05563863.851. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.3Algorithm: RSA4096SMT Enabled - DefaultSMT Disabled10K20K30K40K50KSE +/- 114.98, N = 3SE +/- 103.99, N = 345520.045691.71. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.3Algorithm: RSA4096SMT DisabledSMT Enabled - Default400K800K1200K1600K2000KSE +/- 1433.70, N = 3SE +/- 1937.35, N = 31448237.71684833.11. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA256SMT DisabledSMT Enabled - Default20000M40000M60000M80000M100000MSE +/- 258379314.76, N = 3SE +/- 110969434.67, N = 3852499963671142477381531. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA512SMT DisabledSMT Enabled - Default10000M20000M30000M40000M50000MSE +/- 93655164.94, N = 3SE +/- 96053213.66, N = 341832374577456560867331. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-128-GCMSMT DisabledSMT Enabled - Default300000M600000M900000M1200000M1500000MSE +/- 1010958412.39, N = 3SE +/- 1917677180.61, N = 3128590615247312937976620831. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-256-GCMSMT DisabledSMT Enabled - Default300000M600000M900000M1200000M1500000MSE +/- 470340232.96, N = 3SE +/- 970142767.67, N = 3111503638810311891259684171. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha20SMT DisabledSMT Enabled - Default160000M320000M480000M640000M800000MSE +/- 147174578.01, N = 3SE +/- 25742159.30, N = 35212592810677359067873301. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha20-Poly1305SMT DisabledSMT Enabled - Default110000M220000M330000M440000M550000MSE +/- 44777742.02, N = 3SE +/- 333074643.80, N = 33513497087575021607191431. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

ClickHouse

ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold CacheSMT Enabled - DefaultSMT Disabled2004006008001000SE +/- 3.81, N = 3SE +/- 7.72, N = 3774.37790.42MIN: 66.08 / MAX: 8571.43MIN: 58.77 / MAX: 8571.43

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second RunSMT Enabled - DefaultSMT Disabled2004006008001000SE +/- 8.66, N = 3SE +/- 4.42, N = 3797.78819.66MIN: 67.04 / MAX: 8571.43MIN: 59.41 / MAX: 8571.43

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third RunSMT Enabled - DefaultSMT Disabled2004006008001000SE +/- 9.65, N = 3SE +/- 2.69, N = 3811.04825.31MIN: 66.52 / MAX: 8571.43MIN: 59.64 / MAX: 8571.43

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100SMT DisabledSMT Enabled - Default3M6M9M12M15MSE +/- 73959.82, N = 3SE +/- 119540.65, N = 1510001671.4813615896.001. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10SMT DisabledSMT Enabled - Default1.5M3M4.5M6M7.5MSE +/- 31229.00, N = 3SE +/- 14479.53, N = 36416279.967150348.981. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadSMT Enabled - DefaultSMT Disabled120M240M360M480M600MSE +/- 715514.99, N = 3SE +/- 4177488.26, N = 105355125815458262821. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingSMT DisabledSMT Enabled - Default3M6M9M12M15MSE +/- 88547.74, N = 4SE +/- 63473.09, N = 38141717120391961. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write RandomSMT DisabledSMT Enabled - Default1.6M3.2M4.8M6.4M8MSE +/- 42893.92, N = 3SE +/- 36529.56, N = 3599702674253041. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update RandomSMT Enabled - DefaultSMT Disabled300K600K900K1200K1500KSE +/- 692.00, N = 3SE +/- 11648.08, N = 369641712866851. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

PostgreSQL

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read WriteSMT DisabledSMT Enabled - Default30K60K90K120K150KSE +/- 144.52, N = 3SE +/- 322.78, N = 3977821266921. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average LatencySMT DisabledSMT Enabled - Default246810SE +/- 0.012, N = 3SE +/- 0.016, N = 38.1816.3151. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read OnlySMT DisabledSMT Enabled - Default1000K2000K3000K4000K5000KSE +/- 8567.74, N = 3SE +/- 14712.71, N = 3362507948367691. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average LatencySMT DisabledSMT Enabled - Default0.04970.09940.14910.19880.2485SE +/- 0.000, N = 3SE +/- 0.001, N = 30.2210.1651. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteSMT DisabledSMT Enabled - Default20K40K60K80K100KSE +/- 151.71, N = 3SE +/- 547.52, N = 3926801146221. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencySMT DisabledSMT Enabled - Default3691215SE +/- 0.018, N = 3SE +/- 0.042, N = 310.7908.7251. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlySMT DisabledSMT Enabled - Default1000K2000K3000K4000K5000KSE +/- 1560.76, N = 3SE +/- 17122.75, N = 3356464347434901. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencySMT DisabledSMT Enabled - Default0.0630.1260.1890.2520.315SE +/- 0.000, N = 3SE +/- 0.001, N = 30.2800.2111. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

154 Results Shown

Stress-NG:
  CPU Stress
  Memory Copying
  Vector Math
  Context Switching
  CPU Cache
  NUMA
  AVX-512 VNNI
  Integer Math
  Integer Bit Operations
  Hyperbolic Trigonometric Math
OpenVINO GenAI
Llama.cpp:
  CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128
  CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128
  CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128
LAMMPS Molecular Dynamics Simulator
NAS Parallel Benchmarks:
  BT.C
  EP.C
  EP.D
  FT.C
  LU.C
  SP.B
  SP.C
  IS.D
  MG.C
  CG.C
NAMD:
  ATPase with 327,506 Atoms
  STMV with 1,066,628 Atoms
OpenVINO:
  Vehicle Detection FP16-INT8 - CPU:
    FPS
    ms
  Person Vehicle Bike Detection FP16 - CPU:
    FPS
    ms
  Machine Translation EN To DE FP16 - CPU:
    FPS
    ms
  Face Detection Retail FP16-INT8 - CPU:
    FPS
    ms
  Handwritten English Recognition FP16-INT8 - CPU:
    FPS
    ms
  Road Segmentation ADAS FP16-INT8 - CPU:
    FPS
    ms
  Person Re-Identification Retail FP16 - CPU:
    FPS
    ms
  Noise Suppression Poconet-Like FP16 - CPU:
    FPS
    ms
ACES DGEMM
Coremark
Primesieve
Stockfish
7-Zip Compression:
  Compression Rating
  Decompression Rating
John The Ripper:
  MD5
  Blowfish
  HMAC-SHA512
  bcrypt
  WPA PSK
Timed LLVM Compilation
Timed Linux Kernel Compilation
Kvazaar:
  Bosphorus 4K - Slow
  Bosphorus 4K - Very Fast
Palabos
Laghos:
  Sedov Blast Wave, ube_922_hex.mesh
  Triple Point Problem
Kvazaar:
  Bosphorus 4K - Medium
  Bosphorus 4K - Super Fast
  Bosphorus 4K - Ultra Fast
GraphicsMagick:
  HWB Color Space
  Noise-Gaussian
  Enhanced
  Resizing
  Rotate
  Sharpen
  Swirl
Tachyon
SVT-AV1:
  Preset 13 - Bosphorus 4K
  Preset 8 - Bosphorus 4K
  Preset 5 - Bosphorus 4K
  Preset 3 - Bosphorus 4K
C-Ray:
  4K - 16
  5K - 16
Blender:
  BMW27 - CPU-Only
  Classroom - CPU-Only
  Pabellon Barcelona - CPU-Only
  Barbershop - CPU-Only
  Junkshop - CPU-Only
uvg266:
  Bosphorus 4K - Slow
  Bosphorus 4K - Medium
  Bosphorus 4K - Very Fast
  Bosphorus 4K - Super Fast
  Bosphorus 4K - Ultra Fast
VVenC:
  Bosphorus 4K - Fast
  Bosphorus 4K - Faster
Embree:
  Pathtracer ISPC - Asian Dragon
  Pathtracer ISPC - Asian Dragon Obj
  Pathtracer ISPC - Crown
OpenVKL
LuxCoreRender:
  DLSC - CPU
  Rainbow Colors and Prism - CPU
  LuxCore Benchmark - CPU
  Orange Juice - CPU
  Danish Mood - CPU
OSPRay Studio:
  1 - 4K - 1 - Path Tracer - CPU
  1 - 4K - 32 - Path Tracer - CPU
  2 - 4K - 1 - Path Tracer - CPU
  2 - 4K - 32 - Path Tracer - CPU
  3 - 4K - 1 - Path Tracer - CPU
  3 - 4K - 32 - Path Tracer - CPU
Timed Eigen Compilation
Timed Gem5 Compilation
Timed Node.js Compilation
FinanceBench:
  Bonds OpenMP
  Repo OpenMP
Liquid-DSP:
  64 - 256 - 512
  128 - 256 - 512
srsRAN Project:
  PUSCH Processor Benchmark, Throughput Total
  PDSCH Processor Benchmark, Throughput Total
Rustls:
  handshake - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256
  handshake - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384
  handshake - TLS13_CHACHA20_POLY1305_SHA256
  handshake-ticket - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256
  handshake-ticket - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384
  handshake-ticket - TLS13_CHACHA20_POLY1305_SHA256
  handshake-resume - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256
  handshake-resume - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384
  handshake-resume - TLS13_CHACHA20_POLY1305_SHA256
Speedb:
  Rand Read
  Read While Writing
  Read Rand Write Rand
  Update Rand
nginx:
  500
  1000
OpenSSL:
  RSA4096:
    sign/s
    verify/s
  SHA256:
    byte/s
  SHA512:
    byte/s
  AES-128-GCM:
    byte/s
  AES-256-GCM:
    byte/s
  ChaCha20:
    byte/s
  ChaCha20-Poly1305:
    byte/s
ClickHouse:
  100M Rows Hits Dataset, First Run / Cold Cache
  100M Rows Hits Dataset, Second Run
  100M Rows Hits Dataset, Third Run
Memcached:
  1:100
  1:10
RocksDB:
  Rand Read
  Read While Writing
  Read Rand Write Rand
  Update Rand
PostgreSQL:
  100 - 800 - Read Write
  100 - 800 - Read Write - Average Latency
  100 - 800 - Read Only
  100 - 800 - Read Only - Average Latency
  100 - 1000 - Read Write
  100 - 1000 - Read Write - Average Latency
  100 - 1000 - Read Only
  100 - 1000 - Read Only - Average Latency