AMD EPYC Zen 5 SMT Comparison

AMD EPYC 9575F 1P SMT comparison benchmarks by Michael Larabel for a future article. Fresh tests repeated with SMT on/off from SMCI BIOS toggle.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2501314-NE-AMDEPYCZE72
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Sensor Monitoring

Show Accumulated Sensor Monitoring Data For Displayed Results
Generate Power Efficiency / Performance Per Watt Results

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
SMT Enabled - Default
January 30
  9 Hours, 36 Minutes
SMT Disabled
January 30
  9 Hours, 38 Minutes
Invert Behavior (Only Show Selected Data)
  9 Hours, 37 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC Zen 5 SMT ComparisonOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores / 128 Threads)AMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores)Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS)AMD 1Ah12 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFF3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 24.106.13.0-phx (x86_64)GNOME Shell 47.0X ServerGCC 14.2.0ext41024x768ProcessorsMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionAMD EPYC Zen 5 SMT Comparison BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116 - Python 3.12.7- SMT Enabled - Default: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB on VMEXIT only + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - SMT Disabled: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB on VMEXIT only + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

SMT Enabled - Default vs. SMT Disabled ComparisonPhoronix Test SuiteBaseline+24.8%+24.8%+49.6%+49.6%+74.4%+74.4%99%89.9%89.2%85.7%84.8%78.9%75%70.8%22.7%17.4%15.8%14.7%14.3%12%11.5%10.8%8.7%8.5%8.5%8.4%8%7.8%7%6.7%5.6%5.6%5.1%4.6%3.3%2.9%2.8%2.7%2.2%2.2%2.2%2.1%2.1%2%R.S.A.F.I - CPUV.D.F.I - CPUResizingP.V.B.D.F - CPUUpdate RandM.T.E.T.D.F - CPUP.R.I.R.F - CPURead While Writing72.7%Update RandD.R61.3%Pathtracer ISPC - Crown56.8%Pathtracer ISPC - Asian Dragon56.2%Pathtracer ISPC - Asian Dragon Obj55.9%WPA PSK54%Read While Writing47.9%bcrypt44.3%Blowfish44.2%ChaCha20-Poly130542.9%CoreMark Size 666 - I.P.S41.9%H.T.M41.5%CPU Stress41.3%ChaCha2041.2%R.C.a.P - CPU41.1%LuxCore Benchmark - CPU40.4%Danish Mood - CPU39.8%Vector Math38.6%Chess Benchmark38.1%3 - 4K - 1 - Path Tracer - CPU37.7%2 - 4K - 1 - Path Tracer - CPU37.4%1 - 4K - 1 - Path Tracer - CPU36.3%1:10036.1%3 - 4K - 32 - Path Tracer - CPU34.4%Enhanced34.1%2 - 4K - 32 - Path Tracer - CPU34.1%SHA25634%100 - 800 - Read Only - Average Latency33.9%Orange Juice - CPU33.8%1 - 4K - 32 - Path Tracer - CPU33.5%100 - 800 - Read Only33.4%v.I33.3%100 - 1000 - Read Only33.1%100 - 1000 - Read Only - Average Latency32.7%Pabellon Barcelona - CPU-Only32.1%MD531.4%Compression Rating30.2%Junkshop - CPU-Only30.1%Barbershop - CPU-Only29.7%100 - 800 - Read Write29.6%100 - 800 - Read Write - Average Latency29.5%DLSC - CPU28.5%Context Switching27.5%Classroom - CPU-Only26.6%BMW27 - CPU-Only25.8%R.R.W.R23.8%Noise-Gaussian23.8%100 - 1000 - Read Write23.7%100 - 1000 - Read Write - Average Latency23.7%SP.BInteger Math21.8%128 - 256 - 51221.6%AVX-512 VNNI19.2%EP.CN.S.P.L.F - CPU16.5%RSA409616.3%50015.9%CG.Callmodconfig15.8%P.R.I.R.F - CPU15.1%FT.CCPU CacheA.w.3.5.A14.1%S.w.1.0.6.A13.9%Ninja13.4%HMAC-SHA51213%Swirl12.4%N.S.P.L.F - CPU12.4%MG.CM.T.E.T.D.F - CPU12%100011.5%LU.C1:1011.4%P.P.B.T.T11%Total Time10.8%EP.DF.D.R.F.I - CPU10.6%I.B.O10%Time To Compile9.6%P.P.B.T.T9.4%SHA5129.1%Bosphorus 4K - SlowBosphorus 4K - FasterBosphorus 4K - MediumHWB Color SpaceBosphorus 4K - FastBosphorus 4K - SlowTime To Compile7.7%P.V.B.D.F - CPU7.6%Time To Compile7.1%BT.C1e137%Bosphorus 4K - MediumAES-256-GCM6.6%Bosphorus 4K - Very Fast6.1%Bosphorus 4K - Very Fast5.9%F.D.R.F.I - CPU5.7%V.D.F.I - CPU5.7%Preset 13 - Bosphorus 4KPreset 8 - Bosphorus 4KS.F.P.RBosphorus 4K - Ultra FastNUMA4.4%Bosphorus 4K - Super FastPreset 5 - Bosphorus 4KSP.C1.R.H.D.S.RH.E.R.F.I - CPU2.3%Bosphorus 4K - Ultra FastMemory Copying2.2%RotateCPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - T.G.1H.E.R.F.I - CPU2.1%Rand Read5K - 162.1%1.R.H.D.F.R.C.CPreset 3 - Bosphorus 4K4K - 162%OpenVINOOpenVINOGraphicsMagickOpenVINORocksDBOpenVINOOpenVINOSpeedbSpeedb7-Zip CompressionEmbreeEmbreeEmbreeJohn The RipperRocksDBJohn The RipperJohn The RipperOpenSSLCoremarkStress-NGStress-NGOpenSSLLuxCoreRenderLuxCoreRenderLuxCoreRenderStress-NGStockfishOSPRay StudioOSPRay StudioOSPRay StudioMemcachedOSPRay StudioGraphicsMagickOSPRay StudioOpenSSLPostgreSQLLuxCoreRenderOSPRay StudioPostgreSQLOpenVKLPostgreSQLPostgreSQLBlenderJohn The Ripper7-Zip CompressionBlenderBlenderPostgreSQLPostgreSQLLuxCoreRenderStress-NGBlenderBlenderRocksDBGraphicsMagickPostgreSQLPostgreSQLNAS Parallel BenchmarksStress-NGLiquid-DSPStress-NGNAS Parallel BenchmarksOpenVINOOpenSSLnginxNAS Parallel BenchmarksTimed Linux Kernel CompilationOpenVINONAS Parallel BenchmarksStress-NGNAMDNAMDTimed LLVM CompilationJohn The RipperGraphicsMagickOpenVINONAS Parallel BenchmarksOpenVINOnginxNAS Parallel BenchmarksMemcachedsrsRAN ProjectTachyonNAS Parallel BenchmarksOpenVINOStress-NGTimed Node.js CompilationsrsRAN ProjectOpenSSLKvazaarVVenCKvazaarGraphicsMagickVVenCuvg266Timed Eigen CompilationOpenVINOTimed Gem5 CompilationNAS Parallel BenchmarksPrimesieveuvg266OpenSSLKvazaaruvg266OpenVINOOpenVINOSVT-AV1SVT-AV1ACES DGEMMuvg266Stress-NGKvazaarSVT-AV1NAS Parallel BenchmarksClickHouseOpenVINOKvazaarStress-NGGraphicsMagickLlama.cppOpenVINOSpeedbC-RayClickHouseSVT-AV1C-RaySMT Enabled - DefaultSMT Disabled

AMD EPYC Zen 5 SMT Comparisonopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPUopenvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time To First Tokenopenvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time Per Output Tokenllama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128stress-ng: CPU Stressstress-ng: Memory Copyingstress-ng: Vector Mathstress-ng: Context Switchingstress-ng: CPU Cachestress-ng: NUMAstress-ng: AVX-512 VNNIstress-ng: Integer Mathstress-ng: Integer Bit Operationsstress-ng: Hyperbolic Trigonometric Mathsrsran: PUSCH Processor Benchmark, Throughput Totalsrsran: PDSCH Processor Benchmark, Throughput Totalblender: BMW27 - CPU-Onlyblender: Classroom - CPU-Onlyblender: Pabellon Barcelona - CPU-Onlyblender: Barbershop - CPU-Onlyblender: Junkshop - CPU-Onlyluxcorerender: DLSC - CPUluxcorerender: Rainbow Colors and Prism - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: Orange Juice - CPUluxcorerender: Danish Mood - CPUnpb: BT.Cnpb: EP.Cnpb: EP.Dnpb: FT.Cnpb: LU.Cnpb: SP.Bnpb: SP.Cnpb: IS.Dnpb: MG.Cnpb: CG.Cospray-studio: 1 - 4K - 1 - Path Tracer - CPUospray-studio: 1 - 4K - 32 - Path Tracer - CPUospray-studio: 2 - 4K - 1 - Path Tracer - CPUospray-studio: 2 - 4K - 32 - Path Tracer - CPUospray-studio: 3 - 4K - 1 - Path Tracer - CPUospray-studio: 3 - 4K - 32 - Path Tracer - CPUopenvkl: vklBenchmarkCPU ISPCembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer ISPC - Crownbuild-eigen: Time To Compilebuild-linux-kernel: allmodconfigbuild-llvm: Ninjabuild-nodejs: Time To Compilebuild-gem5: Time To Compilepalabos: 500laghos: Sedov Blast Wave, ube_922_hex.meshlaghos: Triple Point Problemsvt-av1: Preset 13 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 5 - Bosphorus 4Ksvt-av1: Preset 3 - Bosphorus 4Kkvazaar: Bosphorus 4K - Slowkvazaar: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Very Fastkvazaar: Bosphorus 4K - Super Fastkvazaar: Bosphorus 4K - Ultra Fastvvenc: Bosphorus 4K - Fastvvenc: Bosphorus 4K - Fasteruvg266: Bosphorus 4K - Slowuvg266: Bosphorus 4K - Mediumuvg266: Bosphorus 4K - Very Fastuvg266: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Ultra Fastliquid-dsp: 64 - 256 - 512liquid-dsp: 128 - 256 - 512nginx: 500nginx: 1000rustls: handshake - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256rustls: handshake - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384rustls: handshake - TLS13_CHACHA20_POLY1305_SHA256rustls: handshake-ticket - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256rustls: handshake-ticket - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384rustls: handshake-ticket - TLS13_CHACHA20_POLY1305_SHA256rustls: handshake-resume - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256rustls: handshake-resume - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384rustls: handshake-resume - TLS13_CHACHA20_POLY1305_SHA256openssl: RSA4096openssl: RSA4096openssl: SHA256openssl: SHA512openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20openssl: ChaCha20-Poly1305john-the-ripper: MD5john-the-ripper: Blowfishjohn-the-ripper: HMAC-SHA512john-the-ripper: bcryptjohn-the-ripper: WPA PSKclickhouse: 100M Rows Hits Dataset, First Run / Cold Cacheclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, Third Runpgbench: 100 - 800 - Read Writepgbench: 100 - 800 - Read Write - Average Latencypgbench: 100 - 800 - Read Onlypgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 1000 - Read Writepgbench: 100 - 1000 - Read Write - Average Latencypgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Only - Average Latencymemcached: 1:100memcached: 1:10rocksdb: Rand Readrocksdb: Read While Writingrocksdb: Read Rand Write Randrocksdb: Update Randspeedb: Rand Readspeedb: Read While Writingspeedb: Read Rand Write Randspeedb: Update Randcoremark: CoreMark Size 666 - Iterations Per Secondgraphics-magick: HWB Color Spacegraphics-magick: Noise-Gaussiangraphics-magick: Enhancedgraphics-magick: Resizinggraphics-magick: Rotategraphics-magick: Sharpengraphics-magick: Swirlmt-dgemm: Sustained Floating-Point Ratelammps: 20k Atomsnamd: ATPase with 327,506 Atomsnamd: STMV with 1,066,628 Atomsfinancebench: Bonds OpenMPfinancebench: Repo OpenMPcompress-7zip: Compression Ratingcompress-7zip: Decompression Ratingprimesieve: 1e13stockfish: Chess Benchmarktachyon: Total Timec-ray: 4K - 16c-ray: 5K - 16SMT Enabled - DefaultSMT Disabled7670.874.146268.385.05790.5740.4422207.982.83186.9720.072341.2813.639483.913.366539.339.4678.0615.8412.8150.43117.0352.93207237.2926334.21553414.2752341805.752704684.592094.0713250045.886977901.8419006107.55488803.1420677.2118394.614.8841.2546.69146.5720.0115.1131.5412.2721.9811.49329182.789641.8610642.01149810.23284866.89185256.17147801.327000.73159653.7862539.001017352491027355411203413252391137.9946118.3267111.962028.337190.674101.113124.156121.003770.439562.40295.09456.699199.93960.38816.95340.4941.3593.25108.96112.0012.11425.60827.7530.8474.7576.4778.3615159000001833800000574247.25563863.85117122.32708240.91111705.913173966.532250219.43507739.284285728.042766520.21408236.4145520.01684833.1114247738153456560867331293797662083118912596841773590678733050216071914318460333199040427958667199325859423774.37797.78811.041266926.31548367690.1651146228.72547434900.21113615896.007150348.985355125811203919674253046964175473418831038636436539925328484035383.5678154772813382132752976795217.64559553.78712.972773.7431727898.72135417136.71028663731853321024.57624443782216.382533.60659.6987258.662.185824.652.72706.1822.6020077.072.963116.4520.502331.656.858237.151.925615.2710.6378.5515.8912.7350.49119.5652.86146670.4725765.78399202.6041052099.153091182.202004.8711117419.405730701.5217279400.36345394.6718626.3108233.218.7252.2361.69190.1326.0411.7622.368.7416.438.22352266.6611316.8811786.07171779.61317628.32227311.90151971.937078.61178843.5372412.04138647071141147644165655556179488.340275.883571.419030.523220.741114.663136.087129.535772.342566.24298.52482.463211.07162.11917.29644.0144.8587.85112.56114.5013.08027.77629.9232.9170.5976.7181.9715124333331507800000495324.89505544.05117111.87708330.20111713.223180516.792238161.49505551.884279636.212750461.27409213.5145691.71448237.785249996367418323745771285906152473111503638810352125928106735134970875714054000138073378876000138135558160790.42819.66825.31977828.18136250790.2219268010.79035646430.28010001671.486416279.96545826282814171759970261286685558902450601278936204519101042844496.9063145172272524032812926045484.78970253.65811.367093.2855227904.24674517164.10286448952533053626.28617697322118.151234.28060.954OpenBenchmarking.org

OpenVINO

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Vehicle Detection FP16-INT8 - Device: CPUSMT Enabled - DefaultSMT Disabled16003200480064008000SE +/- 4.72, N = 3SE +/- 9.98, N = 37670.877258.661. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Vehicle Detection FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default0.93151.8632.79453.7264.6575SE +/- 0.00, N = 3SE +/- 0.00, N = 32.184.14MIN: 1.73 / MAX: 8.3MIN: 2.3 / MAX: 17.741. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT Enabled - DefaultSMT Disabled13002600390052006500SE +/- 3.28, N = 3SE +/- 29.65, N = 36268.385824.651. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT DisabledSMT Enabled - Default1.13632.27263.40894.54525.6815SE +/- 0.01, N = 3SE +/- 0.00, N = 32.725.05MIN: 2.28 / MAX: 17.03MIN: 3.07 / MAX: 13.21. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Machine Translation EN To DE FP16 - Device: CPUSMT Enabled - DefaultSMT Disabled2004006008001000SE +/- 0.82, N = 3SE +/- 1.50, N = 3790.57706.181. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Machine Translation EN To DE FP16 - Device: CPUSMT DisabledSMT Enabled - Default918273645SE +/- 0.05, N = 3SE +/- 0.04, N = 322.6040.44MIN: 19.42 / MAX: 38.39MIN: 22.28 / MAX: 61.211. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Face Detection Retail FP16-INT8 - Device: CPUSMT Enabled - DefaultSMT Disabled5K10K15K20K25KSE +/- 42.14, N = 3SE +/- 26.61, N = 322207.9820077.071. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Face Detection Retail FP16-INT8 - Device: CPUSMT Enabled - DefaultSMT Disabled0.6661.3321.9982.6643.33SE +/- 0.00, N = 3SE +/- 0.00, N = 32.802.96MIN: 1.63 / MAX: 16.34MIN: 2.34 / MAX: 10.851. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Handwritten English Recognition FP16-INT8 - Device: CPUSMT Enabled - DefaultSMT Disabled7001400210028003500SE +/- 2.16, N = 3SE +/- 2.44, N = 33186.973116.451. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Handwritten English Recognition FP16-INT8 - Device: CPUSMT Enabled - DefaultSMT Disabled510152025SE +/- 0.01, N = 3SE +/- 0.02, N = 320.0720.50MIN: 12.13 / MAX: 35.83MIN: 19.01 / MAX: 29.661. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Road Segmentation ADAS FP16-INT8 - Device: CPUSMT Enabled - DefaultSMT Disabled5001000150020002500SE +/- 2.80, N = 3SE +/- 3.04, N = 32341.282331.651. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Road Segmentation ADAS FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default48121620SE +/- 0.01, N = 3SE +/- 0.02, N = 36.8513.63MIN: 5.51 / MAX: 16.01MIN: 7.43 / MAX: 29.081. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Person Re-Identification Retail FP16 - Device: CPUSMT Enabled - DefaultSMT Disabled2K4K6K8K10KSE +/- 12.91, N = 3SE +/- 9.64, N = 39483.918237.151. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Person Re-Identification Retail FP16 - Device: CPUSMT DisabledSMT Enabled - Default0.7561.5122.2683.0243.78SE +/- 0.00, N = 3SE +/- 0.00, N = 31.923.36MIN: 1.7 / MAX: 10.14MIN: 1.65 / MAX: 16.781. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Noise Suppression Poconet-Like FP16 - Device: CPUSMT Enabled - DefaultSMT Disabled14002800420056007000SE +/- 2.29, N = 3SE +/- 2.51, N = 36539.335615.271. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Noise Suppression Poconet-Like FP16 - Device: CPUSMT Enabled - DefaultSMT Disabled3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 39.4610.63MIN: 5.8 / MAX: 27.03MIN: 6.99 / MAX: 22.91. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenVINO GenAI

OpenBenchmarking.orgtokens/s, More Is BetterOpenVINO GenAI 2024.5Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPUSMT DisabledSMT Enabled - Default20406080100SE +/- 0.26, N = 4SE +/- 0.62, N = 478.5578.06

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128SMT DisabledSMT Enabled - Default1122334455SE +/- 0.03, N = 4SE +/- 0.04, N = 450.4950.431. (CXX) g++ options: -O3

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128SMT DisabledSMT Enabled - Default306090120150SE +/- 0.84, N = 7SE +/- 0.46, N = 7119.56117.031. (CXX) g++ options: -O3

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128SMT Enabled - DefaultSMT Disabled1224364860SE +/- 0.04, N = 4SE +/- 0.05, N = 452.9352.861. (CXX) g++ options: -O3

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: CPU StressSMT Enabled - DefaultSMT Disabled40K80K120K160K200KSE +/- 559.15, N = 3SE +/- 99.86, N = 3207237.29146670.471. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Memory CopyingSMT Enabled - DefaultSMT Disabled6K12K18K24K30KSE +/- 54.76, N = 3SE +/- 57.84, N = 326334.2125765.781. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Vector MathSMT Enabled - DefaultSMT Disabled120K240K360K480K600KSE +/- 306.60, N = 3SE +/- 951.54, N = 3553414.27399202.601. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Context SwitchingSMT Enabled - DefaultSMT Disabled11M22M33M44M55MSE +/- 146419.18, N = 3SE +/- 142871.30, N = 352341805.7541052099.151. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: CPU CacheSMT DisabledSMT Enabled - Default700K1400K2100K2800K3500KSE +/- 35933.89, N = 3SE +/- 42296.58, N = 123091182.202704684.591. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: NUMASMT Enabled - DefaultSMT Disabled400800120016002000SE +/- 8.23, N = 3SE +/- 3.80, N = 32094.072004.871. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: AVX-512 VNNISMT Enabled - DefaultSMT Disabled3M6M9M12M15MSE +/- 37914.71, N = 3SE +/- 2571.79, N = 313250045.8811117419.401. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Integer MathSMT Enabled - DefaultSMT Disabled1.5M3M4.5M6M7.5MSE +/- 11122.15, N = 3SE +/- 1218.12, N = 36977901.845730701.521. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Integer Bit OperationsSMT Enabled - DefaultSMT Disabled4M8M12M16M20MSE +/- 3213.22, N = 3SE +/- 10193.47, N = 319006107.5517279400.361. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Hyperbolic Trigonometric MathSMT Enabled - DefaultSMT Disabled100K200K300K400K500KSE +/- 357.40, N = 3SE +/- 124.87, N = 3488803.14345394.671. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

srsRAN Project

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 24.10Test: PUSCH Processor Benchmark, Throughput TotalSMT Enabled - DefaultSMT Disabled4K8K12K16K20KSE +/- 157.68, N = 3SE +/- 176.66, N = 320677.218626.31. (CXX) g++ options: -O3 -march=native -mtune=generic -fno-trapping-math -fno-math-errno -ldl

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 24.10Test: PDSCH Processor Benchmark, Throughput TotalSMT Enabled - DefaultSMT Disabled30K60K90K120K150KSE +/- 550.27, N = 4SE +/- 524.68, N = 6118394.6108233.21. (CXX) g++ options: -O3 -march=native -mtune=generic -fno-trapping-math -fno-math-errno -ldl

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: CPU-OnlySMT Enabled - DefaultSMT Disabled510152025SE +/- 0.01, N = 4SE +/- 0.01, N = 314.8818.72

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: CPU-OnlySMT Enabled - DefaultSMT Disabled1224364860SE +/- 0.09, N = 3SE +/- 0.04, N = 341.2552.23

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: CPU-OnlySMT Enabled - DefaultSMT Disabled1428425670SE +/- 0.08, N = 3SE +/- 0.03, N = 346.6961.69

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: CPU-OnlySMT Enabled - DefaultSMT Disabled4080120160200SE +/- 0.21, N = 3SE +/- 0.16, N = 3146.57190.13

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: CPU-OnlySMT Enabled - DefaultSMT Disabled612182430SE +/- 0.01, N = 3SE +/- 0.04, N = 320.0126.04

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUSMT Enabled - DefaultSMT Disabled48121620SE +/- 0.04, N = 3SE +/- 0.08, N = 315.1111.76MIN: 14.74 / MAX: 17.3MIN: 11.25 / MAX: 13.36

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPUSMT Enabled - DefaultSMT Disabled714212835SE +/- 0.10, N = 6SE +/- 0.06, N = 531.5422.36MIN: 27.49 / MAX: 32.55MIN: 20.03 / MAX: 22.83

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPUSMT Enabled - DefaultSMT Disabled3691215SE +/- 0.09, N = 3SE +/- 0.04, N = 312.278.74MIN: 6.11 / MAX: 13.85MIN: 4.38 / MAX: 9.71

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPUSMT Enabled - DefaultSMT Disabled510152025SE +/- 0.07, N = 3SE +/- 0.15, N = 321.9816.43MIN: 19.44 / MAX: 27.65MIN: 14.19 / MAX: 19.88

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPUSMT Enabled - DefaultSMT Disabled3691215SE +/- 0.04, N = 3SE +/- 0.08, N = 311.498.22MIN: 6.03 / MAX: 12.77MIN: 4.38 / MAX: 9.13

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CSMT DisabledSMT Enabled - Default80K160K240K320K400KSE +/- 491.86, N = 5SE +/- 2468.08, N = 15352266.66329182.781. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CSMT DisabledSMT Enabled - Default2K4K6K8K10KSE +/- 313.55, N = 15SE +/- 94.51, N = 1511316.889641.861. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DSMT DisabledSMT Enabled - Default3K6K9K12K15KSE +/- 523.50, N = 12SE +/- 551.90, N = 1511786.0710642.011. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CSMT DisabledSMT Enabled - Default40K80K120K160K200KSE +/- 76.78, N = 9SE +/- 2690.96, N = 15171779.61149810.231. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CSMT DisabledSMT Enabled - Default70K140K210K280K350KSE +/- 973.24, N = 6SE +/- 2151.09, N = 15317628.32284866.891. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BSMT DisabledSMT Enabled - Default50K100K150K200K250KSE +/- 1995.83, N = 15SE +/- 1833.56, N = 15227311.90185256.171. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CSMT DisabledSMT Enabled - Default30K60K90K120K150KSE +/- 133.32, N = 5SE +/- 194.21, N = 5151971.93147801.321. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DSMT DisabledSMT Enabled - Default15003000450060007500SE +/- 15.47, N = 6SE +/- 68.21, N = 67078.617000.731. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CSMT DisabledSMT Enabled - Default40K80K120K160K200KSE +/- 337.13, N = 11SE +/- 463.56, N = 11178843.53159653.781. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CSMT DisabledSMT Enabled - Default16K32K48K64K80KSE +/- 301.36, N = 10SE +/- 1042.31, N = 1572412.0462539.001. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUSMT Enabled - DefaultSMT Disabled30060090012001500SE +/- 0.33, N = 3SE +/- 0.88, N = 310171386

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUSMT Enabled - DefaultSMT Disabled10K20K30K40K50KSE +/- 73.90, N = 3SE +/- 101.57, N = 33524947071

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUSMT Enabled - DefaultSMT Disabled30060090012001500SE +/- 0.88, N = 3SE +/- 1.86, N = 310271411

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUSMT Enabled - DefaultSMT Disabled10K20K30K40K50KSE +/- 14.15, N = 3SE +/- 32.74, N = 33554147644

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUSMT Enabled - DefaultSMT Disabled400800120016002000SE +/- 1.20, N = 3SE +/- 0.00, N = 312031656

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUSMT Enabled - DefaultSMT Disabled12K24K36K48K60KSE +/- 91.26, N = 3SE +/- 45.71, N = 34132555556

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 2.0.0Benchmark: vklBenchmarkCPU ISPCSMT Enabled - DefaultSMT Disabled5001000150020002500SE +/- 0.33, N = 3SE +/- 1.45, N = 323911794MIN: 188 / MAX: 30677MIN: 141 / MAX: 24183

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian DragonSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.05, N = 7SE +/- 0.04, N = 5137.9988.34MIN: 136.67 / MAX: 139.92MIN: 87.7 / MAX: 89.3

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon ObjSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.08, N = 4SE +/- 0.01, N = 4118.3375.88MIN: 116.67 / MAX: 120.05MIN: 75.31 / MAX: 76.83

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: CrownSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.04, N = 6SE +/- 0.03, N = 5111.9671.42MIN: 110.19 / MAX: 114.2MIN: 70.44 / MAX: 72.43

Timed Eigen Compilation

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.4.0Time To CompileSMT Enabled - DefaultSMT Disabled714212835SE +/- 0.05, N = 3SE +/- 0.01, N = 328.3430.52

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfigSMT Enabled - DefaultSMT Disabled50100150200250SE +/- 0.29, N = 3SE +/- 0.20, N = 3190.67220.74

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.19, N = 3SE +/- 0.08, N = 3101.11114.66

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.10, N = 3SE +/- 0.20, N = 3124.16136.09

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To CompileSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.25, N = 3SE +/- 1.06, N = 12121.00129.54

Palabos

OpenBenchmarking.orgMega Site Updates Per Second, More Is BetterPalabos 2.3Grid Size: 500SMT DisabledSMT Enabled - Default170340510680850SE +/- 1.81, N = 3SE +/- 0.76, N = 3772.34770.441. (CXX) g++ options: -std=c++17 -pedantic -O3 -rdynamic -lcrypto -lcurl -lsz -lz -ldl -lm

Laghos

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshSMT DisabledSMT Enabled - Default120240360480600SE +/- 3.98, N = 3SE +/- 2.09, N = 3566.24562.401. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point ProblemSMT DisabledSMT Enabled - Default70140210280350SE +/- 2.92, N = 3SE +/- 3.57, N = 3298.52295.091. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Bosphorus 4KSMT DisabledSMT Enabled - Default100200300400500SE +/- 12.29, N = 15SE +/- 11.10, N = 15482.46456.701. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Bosphorus 4KSMT DisabledSMT Enabled - Default50100150200250SE +/- 0.89, N = 4SE +/- 0.95, N = 4211.07199.941. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Bosphorus 4KSMT DisabledSMT Enabled - Default1428425670SE +/- 0.02, N = 3SE +/- 0.21, N = 362.1260.391. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Bosphorus 4KSMT DisabledSMT Enabled - Default48121620SE +/- 0.03, N = 3SE +/- 0.05, N = 317.3016.951. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: SlowSMT DisabledSMT Enabled - Default1020304050SE +/- 0.09, N = 4SE +/- 0.03, N = 444.0140.491. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumSMT DisabledSMT Enabled - Default1020304050SE +/- 0.02, N = 4SE +/- 0.04, N = 444.8541.351. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastSMT Enabled - DefaultSMT Disabled20406080100SE +/- 0.04, N = 6SE +/- 0.05, N = 693.2587.851. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super FastSMT DisabledSMT Enabled - Default306090120150SE +/- 0.04, N = 7SE +/- 0.18, N = 7112.56108.961. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra FastSMT DisabledSMT Enabled - Default306090120150SE +/- 0.24, N = 7SE +/- 0.28, N = 7114.50112.001. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

VVenC

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.13Video Input: Bosphorus 4K - Video Preset: FastSMT DisabledSMT Enabled - Default3691215SE +/- 0.13, N = 3SE +/- 0.12, N = 313.0812.111. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.13Video Input: Bosphorus 4K - Video Preset: FasterSMT DisabledSMT Enabled - Default714212835SE +/- 0.07, N = 3SE +/- 0.07, N = 327.7825.611. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

uvg266

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: SlowSMT DisabledSMT Enabled - Default714212835SE +/- 0.02, N = 3SE +/- 0.02, N = 329.9227.75

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: MediumSMT DisabledSMT Enabled - Default816243240SE +/- 0.09, N = 3SE +/- 0.04, N = 332.9130.84

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Very FastSMT Enabled - DefaultSMT Disabled20406080100SE +/- 0.10, N = 6SE +/- 0.06, N = 574.7570.59

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Super FastSMT DisabledSMT Enabled - Default20406080100SE +/- 0.04, N = 6SE +/- 0.06, N = 676.7176.47

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Ultra FastSMT DisabledSMT Enabled - Default20406080100SE +/- 0.08, N = 6SE +/- 0.06, N = 681.9778.36

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512SMT Enabled - DefaultSMT Disabled300M600M900M1200M1500MSE +/- 1734935.16, N = 3SE +/- 1589898.67, N = 3151590000015124333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512SMT Enabled - DefaultSMT Disabled400M800M1200M1600M2000MSE +/- 3470350.61, N = 3SE +/- 3661056.31, N = 3183380000015078000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500SMT Enabled - DefaultSMT Disabled120K240K360K480K600KSE +/- 1273.97, N = 3SE +/- 183.72, N = 3574247.25495324.891. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000SMT Enabled - DefaultSMT Disabled120K240K360K480K600KSE +/- 889.76, N = 3SE +/- 357.80, N = 3563863.85505544.051. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256SMT Enabled - DefaultSMT Disabled30K60K90K120K150KSE +/- 115.83, N = 4SE +/- 69.93, N = 4117122.32117111.871. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384SMT DisabledSMT Enabled - Default150K300K450K600K750KSE +/- 432.07, N = 3SE +/- 862.31, N = 3708330.20708240.911. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS13_CHACHA20_POLY1305_SHA256SMT DisabledSMT Enabled - Default20K40K60K80K100KSE +/- 21.33, N = 4SE +/- 81.03, N = 4111713.22111705.911. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256SMT DisabledSMT Enabled - Default700K1400K2100K2800K3500KSE +/- 11239.17, N = 3SE +/- 7293.15, N = 33180516.793173966.531. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384SMT Enabled - DefaultSMT Disabled500K1000K1500K2000K2500KSE +/- 3277.78, N = 3SE +/- 2755.66, N = 32250219.432238161.491. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS13_CHACHA20_POLY1305_SHA256SMT Enabled - DefaultSMT Disabled110K220K330K440K550KSE +/- 1861.82, N = 3SE +/- 1009.73, N = 3507739.28505551.881. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256SMT Enabled - DefaultSMT Disabled900K1800K2700K3600K4500KSE +/- 13138.92, N = 3SE +/- 24017.92, N = 34285728.044279636.211. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384SMT Enabled - DefaultSMT Disabled600K1200K1800K2400K3000KSE +/- 4024.00, N = 3SE +/- 4997.81, N = 32766520.212750461.271. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS13_CHACHA20_POLY1305_SHA256SMT DisabledSMT Enabled - Default90K180K270K360K450KSE +/- 5190.96, N = 3SE +/- 2869.68, N = 3409213.51408236.411. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.3Algorithm: RSA4096SMT DisabledSMT Enabled - Default10K20K30K40K50KSE +/- 103.99, N = 3SE +/- 114.98, N = 345691.745520.01. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.3Algorithm: RSA4096SMT Enabled - DefaultSMT Disabled400K800K1200K1600K2000KSE +/- 1937.35, N = 3SE +/- 1433.70, N = 31684833.11448237.71. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA256SMT Enabled - DefaultSMT Disabled20000M40000M60000M80000M100000MSE +/- 110969434.67, N = 3SE +/- 258379314.76, N = 3114247738153852499963671. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA512SMT Enabled - DefaultSMT Disabled10000M20000M30000M40000M50000MSE +/- 96053213.66, N = 3SE +/- 93655164.94, N = 345656086733418323745771. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-128-GCMSMT Enabled - DefaultSMT Disabled300000M600000M900000M1200000M1500000MSE +/- 1917677180.61, N = 3SE +/- 1010958412.39, N = 3129379766208312859061524731. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-256-GCMSMT Enabled - DefaultSMT Disabled300000M600000M900000M1200000M1500000MSE +/- 970142767.67, N = 3SE +/- 470340232.96, N = 3118912596841711150363881031. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha20SMT Enabled - DefaultSMT Disabled160000M320000M480000M640000M800000MSE +/- 25742159.30, N = 3SE +/- 147174578.01, N = 37359067873305212592810671. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha20-Poly1305SMT Enabled - DefaultSMT Disabled110000M220000M330000M440000M550000MSE +/- 333074643.80, N = 3SE +/- 44777742.02, N = 35021607191433513497087571. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5SMT Enabled - DefaultSMT Disabled4M8M12M16M20MSE +/- 49208.17, N = 3SE +/- 8660.25, N = 318460333140540001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishSMT Enabled - DefaultSMT Disabled40K80K120K160K200KSE +/- 89.67, N = 3SE +/- 46.23, N = 31990401380731. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512SMT Enabled - DefaultSMT Disabled90M180M270M360M450MSE +/- 1354899.42, N = 3SE +/- 2440649.98, N = 34279586673788760001. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptSMT Enabled - DefaultSMT Disabled40K80K120K160K200KSE +/- 205.13, N = 3SE +/- 48.59, N = 31993251381351. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKSMT Enabled - DefaultSMT Disabled200K400K600K800K1000KSE +/- 400.95, N = 3SE +/- 541.28, N = 38594235581601. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

ClickHouse

ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold CacheSMT DisabledSMT Enabled - Default2004006008001000SE +/- 7.72, N = 3SE +/- 3.81, N = 3790.42774.37MIN: 58.77 / MAX: 8571.43MIN: 66.08 / MAX: 8571.43

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second RunSMT DisabledSMT Enabled - Default2004006008001000SE +/- 4.42, N = 3SE +/- 8.66, N = 3819.66797.78MIN: 59.41 / MAX: 8571.43MIN: 67.04 / MAX: 8571.43

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third RunSMT DisabledSMT Enabled - Default2004006008001000SE +/- 2.69, N = 3SE +/- 9.65, N = 3825.31811.04MIN: 59.64 / MAX: 8571.43MIN: 66.52 / MAX: 8571.43

PostgreSQL

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read WriteSMT Enabled - DefaultSMT Disabled30K60K90K120K150KSE +/- 322.78, N = 3SE +/- 144.52, N = 3126692977821. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average LatencySMT Enabled - DefaultSMT Disabled246810SE +/- 0.016, N = 3SE +/- 0.012, N = 36.3158.1811. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read OnlySMT Enabled - DefaultSMT Disabled1000K2000K3000K4000K5000KSE +/- 14712.71, N = 3SE +/- 8567.74, N = 3483676936250791. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average LatencySMT Enabled - DefaultSMT Disabled0.04970.09940.14910.19880.2485SE +/- 0.001, N = 3SE +/- 0.000, N = 30.1650.2211. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteSMT Enabled - DefaultSMT Disabled20K40K60K80K100KSE +/- 547.52, N = 3SE +/- 151.71, N = 3114622926801. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencySMT Enabled - DefaultSMT Disabled3691215SE +/- 0.042, N = 3SE +/- 0.018, N = 38.72510.7901. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlySMT Enabled - DefaultSMT Disabled1000K2000K3000K4000K5000KSE +/- 17122.75, N = 3SE +/- 1560.76, N = 3474349035646431. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencySMT Enabled - DefaultSMT Disabled0.0630.1260.1890.2520.315SE +/- 0.001, N = 3SE +/- 0.000, N = 30.2110.2801. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100SMT Enabled - DefaultSMT Disabled3M6M9M12M15MSE +/- 119540.65, N = 15SE +/- 73959.82, N = 313615896.0010001671.481. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10SMT Enabled - DefaultSMT Disabled1.5M3M4.5M6M7.5MSE +/- 14479.53, N = 3SE +/- 31229.00, N = 37150348.986416279.961. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadSMT DisabledSMT Enabled - Default120M240M360M480M600MSE +/- 4177488.26, N = 10SE +/- 715514.99, N = 35458262825355125811. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingSMT Enabled - DefaultSMT Disabled3M6M9M12M15MSE +/- 63473.09, N = 3SE +/- 88547.74, N = 41203919681417171. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write RandomSMT Enabled - DefaultSMT Disabled1.6M3.2M4.8M6.4M8MSE +/- 36529.56, N = 3SE +/- 42893.92, N = 3742530459970261. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update RandomSMT DisabledSMT Enabled - Default300K600K900K1200K1500KSE +/- 11648.08, N = 3SE +/- 692.00, N = 312866856964171. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Random ReadSMT DisabledSMT Enabled - Default120M240M360M480M600MSE +/- 1746081.52, N = 3SE +/- 1478695.31, N = 35589024505473418831. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read While WritingSMT Enabled - DefaultSMT Disabled2M4M6M8M10MSE +/- 210416.01, N = 15SE +/- 52167.60, N = 81038636460127891. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read Random Write RandomSMT Enabled - DefaultSMT Disabled800K1600K2400K3200K4000KSE +/- 2099.62, N = 3SE +/- 9776.13, N = 3365399236204511. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Update RandomSMT DisabledSMT Enabled - Default200K400K600K800K1000KSE +/- 1284.63, N = 3SE +/- 369.07, N = 39101045328481. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondSMT Enabled - DefaultSMT Disabled900K1800K2700K3600K4500KSE +/- 2817.99, N = 3SE +/- 11150.98, N = 34035383.572844496.911. (CC) gcc options: -O2 -lrt" -lrt

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color SpaceSMT DisabledSMT Enabled - Default110220330440550SE +/- 3.51, N = 3SE +/- 4.51, N = 35174771. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianSMT Enabled - DefaultSMT Disabled60120180240300SE +/- 0.33, N = 3SE +/- 0.67, N = 32812271. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedSMT Enabled - DefaultSMT Disabled70140210280350SE +/- 2.19, N = 3SE +/- 0.00, N = 33382521. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: ResizingSMT DisabledSMT Enabled - Default90180270360450SE +/- 5.24, N = 3SE +/- 0.67, N = 34032131. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: RotateSMT DisabledSMT Enabled - Default60120180240300SE +/- 1.76, N = 3SE +/- 0.33, N = 32812751. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenSMT Enabled - DefaultSMT Disabled60120180240300SE +/- 0.33, N = 3SE +/- 0.88, N = 32972921. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SwirlSMT Enabled - DefaultSMT Disabled150300450600750SE +/- 2.96, N = 3SE +/- 0.33, N = 36796041. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

ACES DGEMM

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateSMT DisabledSMT Enabled - Default12002400360048006000SE +/- 8.97, N = 5SE +/- 4.91, N = 55484.795217.651. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsSMT Enabled - DefaultSMT Disabled1224364860SE +/- 0.24, N = 3SE +/- 0.14, N = 353.7953.661. (CXX) g++ options: -O3 -lm -ldl

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: ATPase with 327,506 AtomsSMT Enabled - DefaultSMT Disabled3691215SE +/- 0.09, N = 7SE +/- 0.00, N = 612.9711.37

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: STMV with 1,066,628 AtomsSMT Enabled - DefaultSMT Disabled0.84221.68442.52663.36884.211SE +/- 0.00568, N = 4SE +/- 0.00445, N = 33.743173.28552

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPSMT Enabled - DefaultSMT Disabled6K12K18K24K30KSE +/- 11.50, N = 3SE +/- 17.22, N = 327898.7227904.251. (CXX) g++ options: -O3 -march=native -fopenmp

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPSMT Enabled - DefaultSMT Disabled4K8K12K16K20KSE +/- 15.72, N = 3SE +/- 18.58, N = 317136.7117164.101. (CXX) g++ options: -O3 -march=native -fopenmp

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingSMT Enabled - DefaultSMT Disabled140K280K420K560K700KSE +/- 5367.73, N = 3SE +/- 5895.67, N = 36373184895251. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingSMT Enabled - DefaultSMT Disabled110K220K330K440K550KSE +/- 447.27, N = 3SE +/- 214.48, N = 35332103305361. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Primesieve

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.6Length: 1e13SMT Enabled - DefaultSMT Disabled612182430SE +/- 0.05, N = 3SE +/- 0.05, N = 324.5826.291. (CXX) g++ options: -O3

Stockfish

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 17Chess BenchmarkSMT Enabled - DefaultSMT Disabled50M100M150M200M250MSE +/- 3026146.64, N = 13SE +/- 4557708.76, N = 152444378221769732211. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. The sample scene used is the Teapot scene ray-traced to 8K x 8K with 32 samples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99.2Total TimeSMT Enabled - DefaultSMT Disabled48121620SE +/- 0.00, N = 4SE +/- 0.04, N = 316.3818.151. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

C-Ray

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16SMT Enabled - DefaultSMT Disabled816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 333.6134.281. (CC) gcc options: -lpthread -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16SMT Enabled - DefaultSMT Disabled1428425670SE +/- 0.01, N = 3SE +/- 0.03, N = 359.7060.951. (CC) gcc options: -lpthread -lm

154 Results Shown

OpenVINO:
  Vehicle Detection FP16-INT8 - CPU:
    FPS
    ms
  Person Vehicle Bike Detection FP16 - CPU:
    FPS
    ms
  Machine Translation EN To DE FP16 - CPU:
    FPS
    ms
  Face Detection Retail FP16-INT8 - CPU:
    FPS
    ms
  Handwritten English Recognition FP16-INT8 - CPU:
    FPS
    ms
  Road Segmentation ADAS FP16-INT8 - CPU:
    FPS
    ms
  Person Re-Identification Retail FP16 - CPU:
    FPS
    ms
  Noise Suppression Poconet-Like FP16 - CPU:
    FPS
    ms
OpenVINO GenAI
Llama.cpp:
  CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128
  CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128
  CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128
Stress-NG:
  CPU Stress
  Memory Copying
  Vector Math
  Context Switching
  CPU Cache
  NUMA
  AVX-512 VNNI
  Integer Math
  Integer Bit Operations
  Hyperbolic Trigonometric Math
srsRAN Project:
  PUSCH Processor Benchmark, Throughput Total
  PDSCH Processor Benchmark, Throughput Total
Blender:
  BMW27 - CPU-Only
  Classroom - CPU-Only
  Pabellon Barcelona - CPU-Only
  Barbershop - CPU-Only
  Junkshop - CPU-Only
LuxCoreRender:
  DLSC - CPU
  Rainbow Colors and Prism - CPU
  LuxCore Benchmark - CPU
  Orange Juice - CPU
  Danish Mood - CPU
NAS Parallel Benchmarks:
  BT.C
  EP.C
  EP.D
  FT.C
  LU.C
  SP.B
  SP.C
  IS.D
  MG.C
  CG.C
OSPRay Studio:
  1 - 4K - 1 - Path Tracer - CPU
  1 - 4K - 32 - Path Tracer - CPU
  2 - 4K - 1 - Path Tracer - CPU
  2 - 4K - 32 - Path Tracer - CPU
  3 - 4K - 1 - Path Tracer - CPU
  3 - 4K - 32 - Path Tracer - CPU
OpenVKL
Embree:
  Pathtracer ISPC - Asian Dragon
  Pathtracer ISPC - Asian Dragon Obj
  Pathtracer ISPC - Crown
Timed Eigen Compilation
Timed Linux Kernel Compilation
Timed LLVM Compilation
Timed Node.js Compilation
Timed Gem5 Compilation
Palabos
Laghos:
  Sedov Blast Wave, ube_922_hex.mesh
  Triple Point Problem
SVT-AV1:
  Preset 13 - Bosphorus 4K
  Preset 8 - Bosphorus 4K
  Preset 5 - Bosphorus 4K
  Preset 3 - Bosphorus 4K
Kvazaar:
  Bosphorus 4K - Slow
  Bosphorus 4K - Medium
  Bosphorus 4K - Very Fast
  Bosphorus 4K - Super Fast
  Bosphorus 4K - Ultra Fast
VVenC:
  Bosphorus 4K - Fast
  Bosphorus 4K - Faster
uvg266:
  Bosphorus 4K - Slow
  Bosphorus 4K - Medium
  Bosphorus 4K - Very Fast
  Bosphorus 4K - Super Fast
  Bosphorus 4K - Ultra Fast
Liquid-DSP:
  64 - 256 - 512
  128 - 256 - 512
nginx:
  500
  1000
Rustls:
  handshake - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256
  handshake - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384
  handshake - TLS13_CHACHA20_POLY1305_SHA256
  handshake-ticket - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256
  handshake-ticket - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384
  handshake-ticket - TLS13_CHACHA20_POLY1305_SHA256
  handshake-resume - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256
  handshake-resume - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384
  handshake-resume - TLS13_CHACHA20_POLY1305_SHA256
OpenSSL:
  RSA4096:
    sign/s
    verify/s
  SHA256:
    byte/s
  SHA512:
    byte/s
  AES-128-GCM:
    byte/s
  AES-256-GCM:
    byte/s
  ChaCha20:
    byte/s
  ChaCha20-Poly1305:
    byte/s
John The Ripper:
  MD5
  Blowfish
  HMAC-SHA512
  bcrypt
  WPA PSK
ClickHouse:
  100M Rows Hits Dataset, First Run / Cold Cache
  100M Rows Hits Dataset, Second Run
  100M Rows Hits Dataset, Third Run
PostgreSQL:
  100 - 800 - Read Write
  100 - 800 - Read Write - Average Latency
  100 - 800 - Read Only
  100 - 800 - Read Only - Average Latency
  100 - 1000 - Read Write
  100 - 1000 - Read Write - Average Latency
  100 - 1000 - Read Only
  100 - 1000 - Read Only - Average Latency
Memcached:
  1:100
  1:10
RocksDB:
  Rand Read
  Read While Writing
  Read Rand Write Rand
  Update Rand
Speedb:
  Rand Read
  Read While Writing
  Read Rand Write Rand
  Update Rand
Coremark
GraphicsMagick:
  HWB Color Space
  Noise-Gaussian
  Enhanced
  Resizing
  Rotate
  Sharpen
  Swirl
ACES DGEMM
LAMMPS Molecular Dynamics Simulator
NAMD:
  ATPase with 327,506 Atoms
  STMV with 1,066,628 Atoms
FinanceBench:
  Bonds OpenMP
  Repo OpenMP
7-Zip Compression:
  Compression Rating
  Decompression Rating
Primesieve
Stockfish
Tachyon
C-Ray:
  4K - 16
  5K - 16