AMD EPYC Zen 5 SMT Comparison

AMD EPYC 9575F 1P SMT comparison benchmarks by Michael Larabel for a future article. Fresh tests repeated with SMT on/off from SMCI BIOS toggle.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2501314-NE-AMDEPYCZE72
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Sensor Monitoring

Show Accumulated Sensor Monitoring Data For Displayed Results
Generate Power Efficiency / Performance Per Watt Results

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
SMT Enabled - Default
January 30
  9 Hours, 36 Minutes
SMT Disabled
January 30
  9 Hours, 38 Minutes
Invert Behavior (Only Show Selected Data)
  9 Hours, 37 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC Zen 5 SMT ComparisonOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores / 128 Threads)AMD EPYC 9575F 64-Core @ 3.30GHz (64 Cores)Supermicro Super Server H13SSL-N v1.01 (3.0 BIOS)AMD 1Ah12 x 64GB DDR5-6000MT/s Micron MTC40F2046S1RC64BDY QSFF3201GB Micron_7450_MTFDKCB3T2TFSASPEED2 x Broadcom NetXtreme BCM5720 PCIeUbuntu 24.106.13.0-phx (x86_64)GNOME Shell 47.0X ServerGCC 14.2.0ext41024x768ProcessorsMotherboardChipsetMemoryDiskGraphicsNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen ResolutionAMD EPYC Zen 5 SMT Comparison BenchmarksSystem Logs- Transparent Huge Pages: madvise- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xb002116 - Python 3.12.7- SMT Enabled - Default: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB on VMEXIT only + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - SMT Disabled: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of IBPB on VMEXIT only + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: disabled; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

SMT Enabled - Default vs. SMT Disabled ComparisonPhoronix Test SuiteBaseline+24.8%+24.8%+49.6%+49.6%+74.4%+74.4%99%89.9%89.2%85.7%84.8%78.9%75%70.8%22.7%17.4%15.8%14.7%14.3%12%11.5%10.8%8.7%8.5%8.5%8.4%8%7.8%7%6.7%5.6%5.6%5.1%4.6%3.3%2.9%2.8%2.7%2.2%2.2%2.2%2.1%2.1%2%R.S.A.F.I - CPUV.D.F.I - CPUResizingP.V.B.D.F - CPUUpdate RandM.T.E.T.D.F - CPUP.R.I.R.F - CPURead While Writing72.7%Update RandD.R61.3%Pathtracer ISPC - Crown56.8%Pathtracer ISPC - Asian Dragon56.2%Pathtracer ISPC - Asian Dragon Obj55.9%WPA PSK54%Read While Writing47.9%bcrypt44.3%Blowfish44.2%ChaCha20-Poly130542.9%CoreMark Size 666 - I.P.S41.9%H.T.M41.5%CPU Stress41.3%ChaCha2041.2%R.C.a.P - CPU41.1%LuxCore Benchmark - CPU40.4%Danish Mood - CPU39.8%Vector Math38.6%Chess Benchmark38.1%3 - 4K - 1 - Path Tracer - CPU37.7%2 - 4K - 1 - Path Tracer - CPU37.4%1 - 4K - 1 - Path Tracer - CPU36.3%1:10036.1%3 - 4K - 32 - Path Tracer - CPU34.4%Enhanced34.1%2 - 4K - 32 - Path Tracer - CPU34.1%SHA25634%100 - 800 - Read Only - Average Latency33.9%Orange Juice - CPU33.8%1 - 4K - 32 - Path Tracer - CPU33.5%100 - 800 - Read Only33.4%v.I33.3%100 - 1000 - Read Only33.1%100 - 1000 - Read Only - Average Latency32.7%Pabellon Barcelona - CPU-Only32.1%MD531.4%Compression Rating30.2%Junkshop - CPU-Only30.1%Barbershop - CPU-Only29.7%100 - 800 - Read Write29.6%100 - 800 - Read Write - Average Latency29.5%DLSC - CPU28.5%Context Switching27.5%Classroom - CPU-Only26.6%BMW27 - CPU-Only25.8%R.R.W.R23.8%Noise-Gaussian23.8%100 - 1000 - Read Write23.7%100 - 1000 - Read Write - Average Latency23.7%SP.BInteger Math21.8%128 - 256 - 51221.6%AVX-512 VNNI19.2%EP.CN.S.P.L.F - CPU16.5%RSA409616.3%50015.9%CG.Callmodconfig15.8%P.R.I.R.F - CPU15.1%FT.CCPU CacheA.w.3.5.A14.1%S.w.1.0.6.A13.9%Ninja13.4%HMAC-SHA51213%Swirl12.4%N.S.P.L.F - CPU12.4%MG.CM.T.E.T.D.F - CPU12%100011.5%LU.C1:1011.4%P.P.B.T.T11%Total Time10.8%EP.DF.D.R.F.I - CPU10.6%I.B.O10%Time To Compile9.6%P.P.B.T.T9.4%SHA5129.1%Bosphorus 4K - SlowBosphorus 4K - FasterBosphorus 4K - MediumHWB Color SpaceBosphorus 4K - FastBosphorus 4K - SlowTime To Compile7.7%P.V.B.D.F - CPU7.6%Time To Compile7.1%BT.C1e137%Bosphorus 4K - MediumAES-256-GCM6.6%Bosphorus 4K - Very Fast6.1%Bosphorus 4K - Very Fast5.9%F.D.R.F.I - CPU5.7%V.D.F.I - CPU5.7%Preset 13 - Bosphorus 4KPreset 8 - Bosphorus 4KS.F.P.RBosphorus 4K - Ultra FastNUMA4.4%Bosphorus 4K - Super FastPreset 5 - Bosphorus 4KSP.C1.R.H.D.S.RH.E.R.F.I - CPU2.3%Bosphorus 4K - Ultra FastMemory Copying2.2%RotateCPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - T.G.1H.E.R.F.I - CPU2.1%Rand Read5K - 162.1%1.R.H.D.F.R.C.CPreset 3 - Bosphorus 4K4K - 162%OpenVINOOpenVINOGraphicsMagickOpenVINORocksDBOpenVINOOpenVINOSpeedbSpeedb7-Zip CompressionEmbreeEmbreeEmbreeJohn The RipperRocksDBJohn The RipperJohn The RipperOpenSSLCoremarkStress-NGStress-NGOpenSSLLuxCoreRenderLuxCoreRenderLuxCoreRenderStress-NGStockfishOSPRay StudioOSPRay StudioOSPRay StudioMemcachedOSPRay StudioGraphicsMagickOSPRay StudioOpenSSLPostgreSQLLuxCoreRenderOSPRay StudioPostgreSQLOpenVKLPostgreSQLPostgreSQLBlenderJohn The Ripper7-Zip CompressionBlenderBlenderPostgreSQLPostgreSQLLuxCoreRenderStress-NGBlenderBlenderRocksDBGraphicsMagickPostgreSQLPostgreSQLNAS Parallel BenchmarksStress-NGLiquid-DSPStress-NGNAS Parallel BenchmarksOpenVINOOpenSSLnginxNAS Parallel BenchmarksTimed Linux Kernel CompilationOpenVINONAS Parallel BenchmarksStress-NGNAMDNAMDTimed LLVM CompilationJohn The RipperGraphicsMagickOpenVINONAS Parallel BenchmarksOpenVINOnginxNAS Parallel BenchmarksMemcachedsrsRAN ProjectTachyonNAS Parallel BenchmarksOpenVINOStress-NGTimed Node.js CompilationsrsRAN ProjectOpenSSLKvazaarVVenCKvazaarGraphicsMagickVVenCuvg266Timed Eigen CompilationOpenVINOTimed Gem5 CompilationNAS Parallel BenchmarksPrimesieveuvg266OpenSSLKvazaaruvg266OpenVINOOpenVINOSVT-AV1SVT-AV1ACES DGEMMuvg266Stress-NGKvazaarSVT-AV1NAS Parallel BenchmarksClickHouseOpenVINOKvazaarStress-NGGraphicsMagickLlama.cppOpenVINOSpeedbC-RayClickHouseSVT-AV1C-RaySMT Enabled - DefaultSMT Disabled

AMD EPYC Zen 5 SMT Comparisonopenvkl: vklBenchmarkCPU ISPCstockfish: Chess Benchmarkbuild-gem5: Time To Compileopenssl: RSA4096openssl: RSA4096speedb: Read While Writingrustls: handshake-resume - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384rustls: handshake-ticket - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384build-linux-kernel: allmodconfigmemcached: 1:100openssl: ChaCha20-Poly1305openssl: ChaCha20openssl: AES-256-GCMopenssl: AES-128-GCMopenssl: SHA512openssl: SHA256blender: Barbershop - CPU-Onlylammps: 20k Atomsrustls: handshake-resume - TLS13_CHACHA20_POLY1305_SHA256svt-av1: Preset 3 - Bosphorus 4Krustls: handshake-ticket - TLS13_CHACHA20_POLY1305_SHA256pgbench: 100 - 1000 - Read Only - Average Latencypgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Write - Average Latencypgbench: 100 - 1000 - Read Writepgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 800 - Read Onlypgbench: 100 - 800 - Read Write - Average Latencypgbench: 100 - 800 - Read Writebuild-nodejs: Time To Compilerocksdb: Rand Readclickhouse: 100M Rows Hits Dataset, Third Runclickhouse: 100M Rows Hits Dataset, Second Runclickhouse: 100M Rows Hits Dataset, First Run / Cold Cachebuild-llvm: Ninjanginx: 1000nginx: 500stress-ng: CPU Cacherustls: handshake-resume - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256rustls: handshake-ticket - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256rocksdb: Read While Writingmemcached: 1:10ospray-studio: 3 - 4K - 1 - Path Tracer - CPUospray-studio: 2 - 4K - 1 - Path Tracer - CPUospray-studio: 1 - 4K - 1 - Path Tracer - CPUluxcorerender: Orange Juice - CPUluxcorerender: Danish Mood - CPUluxcorerender: LuxCore Benchmark - CPUluxcorerender: DLSC - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUopenvino: Noise Suppression Poconet-Like FP16 - CPUnpb: EP.Dopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUc-ray: 5K - 16openvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Person Re-Identification Retail FP16 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Road Segmentation ADAS FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Face Detection Retail FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUopenvino: Handwritten English Recognition FP16-INT8 - CPUspeedb: Update Randrocksdb: Update Randjohn-the-ripper: MD5graphics-magick: Sharpengraphics-magick: Noise-Gaussianspeedb: Read Rand Write Randgraphics-magick: Rotategraphics-magick: Resizinggraphics-magick: Enhancedgraphics-magick: HWB Color Spacerocksdb: Read Rand Write Randspeedb: Rand Readgraphics-magick: Swirljohn-the-ripper: HMAC-SHA512laghos: Sedov Blast Wave, ube_922_hex.meshblender: Pabellon Barcelona - CPU-Onlyospray-studio: 3 - 4K - 32 - Path Tracer - CPUpalabos: 500vvenc: Bosphorus 4K - Fastblender: Classroom - CPU-Onlyospray-studio: 2 - 4K - 32 - Path Tracer - CPUospray-studio: 1 - 4K - 32 - Path Tracer - CPUsvt-av1: Preset 5 - Bosphorus 4Krustls: handshake - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384laghos: Triple Point Problemfinancebench: Bonds OpenMPc-ray: 4K - 16npb: BT.Cliquid-dsp: 128 - 256 - 512liquid-dsp: 64 - 256 - 512john-the-ripper: WPA PSKjohn-the-ripper: bcryptjohn-the-ripper: Blowfishstress-ng: NUMAstress-ng: Memory Copyingstress-ng: Vector Mathstress-ng: CPU Stressstress-ng: Integer Mathstress-ng: Hyperbolic Trigonometric Mathstress-ng: Integer Bit Operationsstress-ng: AVX-512 VNNIstress-ng: Context Switchingsvt-av1: Preset 13 - Bosphorus 4Kbuild-eigen: Time To Compilesrsran: PUSCH Processor Benchmark, Throughput Totalnpb: LU.Cprimesieve: 1e13coremark: CoreMark Size 666 - Iterations Per Secondfinancebench: Repo OpenMPblender: Junkshop - CPU-Onlyvvenc: Bosphorus 4K - Fastercompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingnamd: STMV with 1,066,628 Atomsuvg266: Bosphorus 4K - Slowtachyon: Total Timeopenvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time Per Output Tokenopenvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time To First Tokenopenvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPUblender: BMW27 - CPU-Onlyrustls: handshake - TLS13_CHACHA20_POLY1305_SHA256kvazaar: Bosphorus 4K - Slowuvg266: Bosphorus 4K - Mediumkvazaar: Bosphorus 4K - Mediumrustls: handshake - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128embree: Pathtracer ISPC - Asian Dragon Objnpb: SP.Cllama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128svt-av1: Preset 8 - Bosphorus 4Ksrsran: PDSCH Processor Benchmark, Throughput Totaluvg266: Bosphorus 4K - Super Fastembree: Pathtracer ISPC - Crownmt-dgemm: Sustained Floating-Point Rateluxcorerender: Rainbow Colors and Prism - CPUuvg266: Bosphorus 4K - Very Fastuvg266: Bosphorus 4K - Ultra Fastnpb: FT.Cnpb: IS.Dembree: Pathtracer ISPC - Asian Dragonnamd: ATPase with 327,506 Atomsllama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128kvazaar: Bosphorus 4K - Very Fastnpb: SP.Bkvazaar: Bosphorus 4K - Super Fastnpb: CG.Ckvazaar: Bosphorus 4K - Ultra Fastnpb: EP.Cnpb: MG.CSMT Enabled - DefaultSMT Disabled2391244437822121.0031684833.145520.0103863642766520.212250219.43190.67413615896.005021607191437359067873301189125968417129379766208345656086733114247738153146.5753.787408236.4116.953507739.280.21147434908.7251146220.16548367696.315126692124.156535512581811.04797.78774.37101.113563863.85574247.252704684.594285728.043173966.53120391967150348.9812031027101721.9811.4912.2715.119.466539.3310642.0140.44790.5759.6985.056268.383.369483.9113.632341.282.822207.984.147670.8720.073186.975328486964171846033329728136539922752133384777425304547341883679427958667562.4046.6941325770.43912.11441.25355413524960.388708240.91295.0927898.72135433.606329182.78183380000015159000008594231993251990402094.0726334.21553414.27207237.296977901.84488803.1419006107.5513250045.8852341805.75456.69928.33720677.2284866.8924.5764035383.56781517136.71028620.0125.6085332106373183.7431727.7516.382512.8115.8478.0614.88111705.9140.4930.8441.35117122.3250.43118.3267147801.3252.93199.939118394.676.47111.96205217.64559531.5474.7578.36149810.237000.73137.994612.97277117.0393.25185256.17108.9662539.00112.009641.86159653.781794176973221129.5351448237.745691.760127892750461.272238161.49220.74110001671.48351349708757521259281067111503638810312859061524734183237457785249996367190.1353.658409213.5117.296505551.880.280356464310.790926800.22136250798.18197782136.087545826282825.31819.66790.42114.663505544.05495324.893091182.204279636.213180516.7981417176416279.9616561411138616.438.228.7411.7610.635615.2711786.0722.60706.1860.9542.725824.651.928237.156.852331.652.9620077.072.187258.6620.503116.4591010412866851405400029222736204512814032525175997026558902450604378876000566.2461.6955556772.34213.08052.23476444707162.119708330.20298.5227904.24674534.280352266.66150780000015124333335581601381351380732004.8725765.78399202.60146670.475730701.52345394.6717279400.3611117419.4041052099.15482.46330.52318626.3317628.3226.2862844496.90631417164.10286426.0427.7763305364895253.2855229.9218.151212.7315.8978.5518.72111713.2244.0132.9144.85117111.8750.4975.8835151971.9352.86211.071108233.276.7171.41905484.78970222.3670.5981.97171779.617078.6188.340211.36709119.5687.85227311.90112.5672412.04114.5011316.88178843.53OpenBenchmarking.org

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 2.0.0Benchmark: vklBenchmarkCPU ISPCSMT DisabledSMT Enabled - Default5001000150020002500SE +/- 1.45, N = 3SE +/- 0.33, N = 317942391MIN: 141 / MAX: 24183MIN: 188 / MAX: 30677

Stockfish

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 17Chess BenchmarkSMT DisabledSMT Enabled - Default50M100M150M200M250MSE +/- 4557708.76, N = 15SE +/- 3026146.64, N = 131769732212444378221. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver

Timed Gem5 Compilation

This test times how long it takes to compile Gem5. Gem5 is a simulator for computer system architecture research. Gem5 is widely used for computer architecture research within the industry, academia, and more. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 23.0.1Time To CompileSMT DisabledSMT Enabled - Default306090120150SE +/- 1.06, N = 12SE +/- 0.25, N = 3129.54121.00

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.3Algorithm: RSA4096SMT DisabledSMT Enabled - Default400K800K1200K1600K2000KSE +/- 1433.70, N = 3SE +/- 1937.35, N = 31448237.71684833.11. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.3Algorithm: RSA4096SMT Enabled - DefaultSMT Disabled10K20K30K40K50KSE +/- 114.98, N = 3SE +/- 103.99, N = 345520.045691.71. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read While WritingSMT DisabledSMT Enabled - Default2M4M6M8M10MSE +/- 52167.60, N = 8SE +/- 210416.01, N = 156012789103863641. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384SMT DisabledSMT Enabled - Default600K1200K1800K2400K3000KSE +/- 4997.81, N = 3SE +/- 4024.00, N = 32750461.272766520.211. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384SMT DisabledSMT Enabled - Default500K1000K1500K2000K2500KSE +/- 2755.66, N = 3SE +/- 3277.78, N = 32238161.492250219.431. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.8Build: allmodconfigSMT DisabledSMT Enabled - Default50100150200250SE +/- 0.20, N = 3SE +/- 0.29, N = 3220.74190.67

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:100SMT DisabledSMT Enabled - Default3M6M9M12M15MSE +/- 73959.82, N = 3SE +/- 119540.65, N = 1510001671.4813615896.001. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OpenSSL

OpenSSL is an open-source toolkit that implements SSL (Secure Sockets Layer) and TLS (Transport Layer Security) protocols. This test profile makes use of the built-in "openssl speed" benchmarking capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha20-Poly1305SMT DisabledSMT Enabled - Default110000M220000M330000M440000M550000MSE +/- 44777742.02, N = 3SE +/- 333074643.80, N = 33513497087575021607191431. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: ChaCha20SMT DisabledSMT Enabled - Default160000M320000M480000M640000M800000MSE +/- 147174578.01, N = 3SE +/- 25742159.30, N = 35212592810677359067873301. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-256-GCMSMT DisabledSMT Enabled - Default300000M600000M900000M1200000M1500000MSE +/- 470340232.96, N = 3SE +/- 970142767.67, N = 3111503638810311891259684171. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: AES-128-GCMSMT DisabledSMT Enabled - Default300000M600000M900000M1200000M1500000MSE +/- 1010958412.39, N = 3SE +/- 1917677180.61, N = 3128590615247312937976620831. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA512SMT DisabledSMT Enabled - Default10000M20000M30000M40000M50000MSE +/- 93655164.94, N = 3SE +/- 96053213.66, N = 341832374577456560867331. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.3Algorithm: SHA256SMT DisabledSMT Enabled - Default20000M40000M60000M80000M100000MSE +/- 258379314.76, N = 3SE +/- 110969434.67, N = 3852499963671142477381531. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: CPU-OnlySMT DisabledSMT Enabled - Default4080120160200SE +/- 0.16, N = 3SE +/- 0.21, N = 3190.13146.57

LAMMPS Molecular Dynamics Simulator

LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 23Jun2022Model: 20k AtomsSMT DisabledSMT Enabled - Default1224364860SE +/- 0.14, N = 3SE +/- 0.24, N = 353.6653.791. (CXX) g++ options: -O3 -lm -ldl

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS13_CHACHA20_POLY1305_SHA256SMT Enabled - DefaultSMT Disabled90K180K270K360K450KSE +/- 2869.68, N = 3SE +/- 5190.96, N = 3408236.41409213.511. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Bosphorus 4KSMT Enabled - DefaultSMT Disabled48121620SE +/- 0.05, N = 3SE +/- 0.03, N = 316.9517.301. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS13_CHACHA20_POLY1305_SHA256SMT DisabledSMT Enabled - Default110K220K330K440K550KSE +/- 1009.73, N = 3SE +/- 1861.82, N = 3505551.88507739.281. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

PostgreSQL

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average LatencySMT DisabledSMT Enabled - Default0.0630.1260.1890.2520.315SE +/- 0.000, N = 3SE +/- 0.001, N = 30.2800.2111. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read OnlySMT DisabledSMT Enabled - Default1000K2000K3000K4000K5000KSE +/- 1560.76, N = 3SE +/- 17122.75, N = 3356464347434901. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average LatencySMT DisabledSMT Enabled - Default3691215SE +/- 0.018, N = 3SE +/- 0.042, N = 310.7908.7251. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 1000 - Mode: Read WriteSMT DisabledSMT Enabled - Default20K40K60K80K100KSE +/- 151.71, N = 3SE +/- 547.52, N = 3926801146221. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average LatencySMT DisabledSMT Enabled - Default0.04970.09940.14910.19880.2485SE +/- 0.000, N = 3SE +/- 0.001, N = 30.2210.1651. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read OnlySMT DisabledSMT Enabled - Default1000K2000K3000K4000K5000KSE +/- 8567.74, N = 3SE +/- 14712.71, N = 3362507948367691. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read Write - Average LatencySMT DisabledSMT Enabled - Default246810SE +/- 0.012, N = 3SE +/- 0.016, N = 38.1816.3151. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 17Scaling Factor: 100 - Clients: 800 - Mode: Read WriteSMT DisabledSMT Enabled - Default30K60K90K120K150KSE +/- 144.52, N = 3SE +/- 322.78, N = 3977821266921. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpq -lpgcommon -lpgport -lm

Timed Node.js Compilation

This test profile times how long it takes to build/compile Node.js itself from source. Node.js is a JavaScript run-time built from the Chrome V8 JavaScript engine while itself is written in C/C++. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 21.7.2Time To CompileSMT DisabledSMT Enabled - Default306090120150SE +/- 0.20, N = 3SE +/- 0.10, N = 3136.09124.16

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Random ReadSMT Enabled - DefaultSMT Disabled120M240M360M480M600MSE +/- 715514.99, N = 3SE +/- 4177488.26, N = 105355125815458262821. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

ClickHouse

ClickHouse is an open-source, high performance OLAP data management system. This test profile uses ClickHouse's standard benchmark recommendations per https://clickhouse.com/docs/en/operations/performance-test/ / https://github.com/ClickHouse/ClickBench/tree/main/clickhouse with the 100 million rows web analytics dataset. The reported value is the query processing time using the geometric mean of all separate queries performed as an aggregate. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Third RunSMT Enabled - DefaultSMT Disabled2004006008001000SE +/- 9.65, N = 3SE +/- 2.69, N = 3811.04825.31MIN: 66.52 / MAX: 8571.43MIN: 59.64 / MAX: 8571.43

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, Second RunSMT Enabled - DefaultSMT Disabled2004006008001000SE +/- 8.66, N = 3SE +/- 4.42, N = 3797.78819.66MIN: 67.04 / MAX: 8571.43MIN: 59.41 / MAX: 8571.43

OpenBenchmarking.orgQueries Per Minute, Geo Mean, More Is BetterClickHouse 22.12.3.5100M Rows Hits Dataset, First Run / Cold CacheSMT Enabled - DefaultSMT Disabled2004006008001000SE +/- 3.81, N = 3SE +/- 7.72, N = 3774.37790.42MIN: 66.08 / MAX: 8571.43MIN: 58.77 / MAX: 8571.43

Timed LLVM Compilation

This test times how long it takes to compile/build the LLVM compiler stack. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjaSMT DisabledSMT Enabled - Default306090120150SE +/- 0.08, N = 3SE +/- 0.19, N = 3114.66101.11

nginx

This is a benchmark of the lightweight Nginx HTTP(S) web-server. This Nginx web server benchmark test profile makes use of the wrk program for facilitating the HTTP requests over a fixed period time with a configurable number of concurrent clients/connections. HTTPS with a self-signed OpenSSL certificate is used by this test for local benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000SMT DisabledSMT Enabled - Default120K240K360K480K600KSE +/- 357.80, N = 3SE +/- 889.76, N = 3505544.05563863.851. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500SMT DisabledSMT Enabled - Default120K240K360K480K600KSE +/- 183.72, N = 3SE +/- 1273.97, N = 3495324.89574247.251. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: CPU CacheSMT Enabled - DefaultSMT Disabled700K1400K2100K2800K3500KSE +/- 42296.58, N = 12SE +/- 35933.89, N = 32704684.593091182.201. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256SMT DisabledSMT Enabled - Default900K1800K2700K3600K4500KSE +/- 24017.92, N = 3SE +/- 13138.92, N = 34279636.214285728.041. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256SMT Enabled - DefaultSMT Disabled700K1400K2100K2800K3500KSE +/- 7293.15, N = 3SE +/- 11239.17, N = 33173966.533180516.791. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read While WritingSMT DisabledSMT Enabled - Default3M6M9M12M15MSE +/- 88547.74, N = 4SE +/- 63473.09, N = 38141717120391961. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Memcached

Memcached is a high performance, distributed memory object caching system. This Memcached test profiles makes use of memtier_benchmark for excuting this CPU/memory-focused server benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.19Set To Get Ratio: 1:10SMT DisabledSMT Enabled - Default1.5M3M4.5M6M7.5MSE +/- 31229.00, N = 3SE +/- 14479.53, N = 36416279.967150348.981. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default400800120016002000SE +/- 0.00, N = 3SE +/- 1.20, N = 316561203

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default30060090012001500SE +/- 1.86, N = 3SE +/- 0.88, N = 314111027

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default30060090012001500SE +/- 0.88, N = 3SE +/- 0.33, N = 313861017

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPUSMT DisabledSMT Enabled - Default510152025SE +/- 0.15, N = 3SE +/- 0.07, N = 316.4321.98MIN: 14.19 / MAX: 19.88MIN: 19.44 / MAX: 27.65

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPUSMT DisabledSMT Enabled - Default3691215SE +/- 0.08, N = 3SE +/- 0.04, N = 38.2211.49MIN: 4.38 / MAX: 9.13MIN: 6.03 / MAX: 12.77

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: LuxCore Benchmark - Acceleration: CPUSMT DisabledSMT Enabled - Default3691215SE +/- 0.04, N = 3SE +/- 0.09, N = 38.7412.27MIN: 4.38 / MAX: 9.71MIN: 6.11 / MAX: 13.85

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: DLSC - Acceleration: CPUSMT DisabledSMT Enabled - Default48121620SE +/- 0.08, N = 3SE +/- 0.04, N = 311.7615.11MIN: 11.25 / MAX: 13.36MIN: 14.74 / MAX: 17.3

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Noise Suppression Poconet-Like FP16 - Device: CPUSMT DisabledSMT Enabled - Default3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 310.639.46MIN: 6.99 / MAX: 22.9MIN: 5.8 / MAX: 27.031. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Noise Suppression Poconet-Like FP16 - Device: CPUSMT DisabledSMT Enabled - Default14002800420056007000SE +/- 2.51, N = 3SE +/- 2.29, N = 35615.276539.331. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.DSMT Enabled - DefaultSMT Disabled3K6K9K12K15KSE +/- 551.90, N = 15SE +/- 523.50, N = 1210642.0111786.071. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Machine Translation EN To DE FP16 - Device: CPUSMT Enabled - DefaultSMT Disabled918273645SE +/- 0.04, N = 3SE +/- 0.05, N = 340.4422.60MIN: 22.28 / MAX: 61.21MIN: 19.42 / MAX: 38.391. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Machine Translation EN To DE FP16 - Device: CPUSMT DisabledSMT Enabled - Default2004006008001000SE +/- 1.50, N = 3SE +/- 0.82, N = 3706.18790.571. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

C-Ray

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 5K - Rays Per Pixel: 16SMT DisabledSMT Enabled - Default1428425670SE +/- 0.03, N = 3SE +/- 0.01, N = 360.9559.701. (CC) gcc options: -lpthread -lm

OpenVINO

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT Enabled - DefaultSMT Disabled1.13632.27263.40894.54525.6815SE +/- 0.00, N = 3SE +/- 0.01, N = 35.052.72MIN: 3.07 / MAX: 13.2MIN: 2.28 / MAX: 17.031. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Person Vehicle Bike Detection FP16 - Device: CPUSMT DisabledSMT Enabled - Default13002600390052006500SE +/- 29.65, N = 3SE +/- 3.28, N = 35824.656268.381. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Person Re-Identification Retail FP16 - Device: CPUSMT Enabled - DefaultSMT Disabled0.7561.5122.2683.0243.78SE +/- 0.00, N = 3SE +/- 0.00, N = 33.361.92MIN: 1.65 / MAX: 16.78MIN: 1.7 / MAX: 10.141. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Person Re-Identification Retail FP16 - Device: CPUSMT DisabledSMT Enabled - Default2K4K6K8K10KSE +/- 9.64, N = 3SE +/- 12.91, N = 38237.159483.911. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Road Segmentation ADAS FP16-INT8 - Device: CPUSMT Enabled - DefaultSMT Disabled48121620SE +/- 0.02, N = 3SE +/- 0.01, N = 313.636.85MIN: 7.43 / MAX: 29.08MIN: 5.51 / MAX: 16.011. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Road Segmentation ADAS FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default5001000150020002500SE +/- 3.04, N = 3SE +/- 2.80, N = 32331.652341.281. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Face Detection Retail FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default0.6661.3321.9982.6643.33SE +/- 0.00, N = 3SE +/- 0.00, N = 32.962.80MIN: 2.34 / MAX: 10.85MIN: 1.63 / MAX: 16.341. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Face Detection Retail FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default5K10K15K20K25KSE +/- 26.61, N = 3SE +/- 42.14, N = 320077.0722207.981. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Vehicle Detection FP16-INT8 - Device: CPUSMT Enabled - DefaultSMT Disabled0.93151.8632.79453.7264.6575SE +/- 0.00, N = 3SE +/- 0.00, N = 34.142.18MIN: 2.3 / MAX: 17.74MIN: 1.73 / MAX: 8.31. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Vehicle Detection FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default16003200480064008000SE +/- 9.98, N = 3SE +/- 4.72, N = 37258.667670.871. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2024.5Model: Handwritten English Recognition FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default510152025SE +/- 0.02, N = 3SE +/- 0.01, N = 320.5020.07MIN: 19.01 / MAX: 29.66MIN: 12.13 / MAX: 35.831. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2024.5Model: Handwritten English Recognition FP16-INT8 - Device: CPUSMT DisabledSMT Enabled - Default7001400210028003500SE +/- 2.44, N = 3SE +/- 2.16, N = 33116.453186.971. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Update RandomSMT Enabled - DefaultSMT Disabled200K400K600K800K1000KSE +/- 369.07, N = 3SE +/- 1284.63, N = 35328489101041. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Update RandomSMT Enabled - DefaultSMT Disabled300K600K900K1200K1500KSE +/- 692.00, N = 3SE +/- 11648.08, N = 369641712866851. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5SMT DisabledSMT Enabled - Default4M8M12M16M20MSE +/- 8660.25, N = 3SE +/- 49208.17, N = 314054000184603331. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SharpenSMT DisabledSMT Enabled - Default60120180240300SE +/- 0.88, N = 3SE +/- 0.33, N = 32922971. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: Noise-GaussianSMT DisabledSMT Enabled - Default60120180240300SE +/- 0.67, N = 3SE +/- 0.33, N = 32272811. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read Random Write RandomSMT DisabledSMT Enabled - Default800K1600K2400K3200K4000KSE +/- 9776.13, N = 3SE +/- 2099.62, N = 3362045136539921. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: RotateSMT Enabled - DefaultSMT Disabled60120180240300SE +/- 0.33, N = 3SE +/- 1.76, N = 32752811. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: ResizingSMT Enabled - DefaultSMT Disabled90180270360450SE +/- 0.67, N = 3SE +/- 5.24, N = 32134031. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: EnhancedSMT DisabledSMT Enabled - Default70140210280350SE +/- 0.00, N = 3SE +/- 2.19, N = 32523381. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: HWB Color SpaceSMT Enabled - DefaultSMT Disabled110220330440550SE +/- 4.51, N = 3SE +/- 3.51, N = 34775171. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

RocksDB

This is a benchmark of Meta/Facebook's RocksDB as an embeddable persistent key-value store for fast storage based on Google's LevelDB. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 9.0Test: Read Random Write RandomSMT DisabledSMT Enabled - Default1.6M3.2M4.8M6.4M8MSE +/- 42893.92, N = 3SE +/- 36529.56, N = 3599702674253041. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

Speedb

Speedb is a next-generation key value storage engine that is RocksDB compatible and aiming for stability, efficiency, and performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Random ReadSMT Enabled - DefaultSMT Disabled120M240M360M480M600MSE +/- 1478695.31, N = 3SE +/- 1746081.52, N = 35473418835589024501. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti

GraphicsMagick

This is a test of GraphicsMagick with its OpenMP implementation that performs various imaging tests on a sample high resolution (currently 15400 x 6940) JPEG image. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations Per Minute, More Is BetterGraphicsMagick 1.3.43Operation: SwirlSMT DisabledSMT Enabled - Default150300450600750SE +/- 0.33, N = 3SE +/- 2.96, N = 36046791. (CC) gcc options: -fopenmp -O2 -ltiff -ljbig -lsharpyuv -lwebp -lwebpmux -ljpeg -lXext -lSM -lICE -lX11 -lxml2 -lzstd -llzma -lbz2 -lz -lm -lpthread -lgomp

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: HMAC-SHA512SMT DisabledSMT Enabled - Default90M180M270M360M450MSE +/- 2440649.98, N = 3SE +/- 1354899.42, N = 33788760004279586671. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

Laghos

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshSMT Enabled - DefaultSMT Disabled120240360480600SE +/- 2.09, N = 3SE +/- 3.98, N = 3562.40566.241. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: CPU-OnlySMT DisabledSMT Enabled - Default1428425670SE +/- 0.03, N = 3SE +/- 0.08, N = 361.6946.69

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default12K24K36K48K60KSE +/- 45.71, N = 3SE +/- 91.26, N = 35555641325

Palabos

OpenBenchmarking.orgMega Site Updates Per Second, More Is BetterPalabos 2.3Grid Size: 500SMT Enabled - DefaultSMT Disabled170340510680850SE +/- 0.76, N = 3SE +/- 1.81, N = 3770.44772.341. (CXX) g++ options: -std=c++17 -pedantic -O3 -rdynamic -lcrypto -lcurl -lsz -lz -ldl -lm

VVenC

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.13Video Input: Bosphorus 4K - Video Preset: FastSMT Enabled - DefaultSMT Disabled3691215SE +/- 0.12, N = 3SE +/- 0.13, N = 312.1113.081. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: CPU-OnlySMT DisabledSMT Enabled - Default1224364860SE +/- 0.04, N = 3SE +/- 0.09, N = 352.2341.25

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default10K20K30K40K50KSE +/- 32.74, N = 3SE +/- 14.15, N = 34764435541

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 1.0Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPUSMT DisabledSMT Enabled - Default10K20K30K40K50KSE +/- 101.57, N = 3SE +/- 73.90, N = 34707135249

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Bosphorus 4KSMT Enabled - DefaultSMT Disabled1428425670SE +/- 0.21, N = 3SE +/- 0.02, N = 360.3962.121. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384SMT Enabled - DefaultSMT Disabled150K300K450K600K750KSE +/- 862.31, N = 3SE +/- 432.07, N = 3708240.91708330.201. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

Laghos

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point ProblemSMT Enabled - DefaultSMT Disabled70140210280350SE +/- 3.57, N = 3SE +/- 2.92, N = 3295.09298.521. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPSMT DisabledSMT Enabled - Default6K12K18K24K30KSE +/- 17.22, N = 3SE +/- 11.50, N = 327904.2527898.721. (CXX) g++ options: -O3 -march=native -fopenmp

C-Ray

OpenBenchmarking.orgSeconds, Fewer Is BetterC-Ray 2.0Resolution: 4K - Rays Per Pixel: 16SMT DisabledSMT Enabled - Default816243240SE +/- 0.01, N = 3SE +/- 0.01, N = 334.2833.611. (CC) gcc options: -lpthread -lm

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: BT.CSMT Enabled - DefaultSMT Disabled80K160K240K320K400KSE +/- 2468.08, N = 15SE +/- 491.86, N = 5329182.78352266.661. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 128 - Buffer Length: 256 - Filter Length: 512SMT DisabledSMT Enabled - Default400M800M1200M1600M2000MSE +/- 3661056.31, N = 3SE +/- 3470350.61, N = 3150780000018338000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512SMT DisabledSMT Enabled - Default300M600M900M1200M1500MSE +/- 1589898.67, N = 3SE +/- 1734935.16, N = 3151243333315159000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

John The Ripper

This is a benchmark of John The Ripper, which is a password cracker. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: WPA PSKSMT DisabledSMT Enabled - Default200K400K600K800K1000KSE +/- 541.28, N = 3SE +/- 400.95, N = 35581608594231. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptSMT DisabledSMT Enabled - Default40K80K120K160K200KSE +/- 48.59, N = 3SE +/- 205.13, N = 31381351993251. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: BlowfishSMT DisabledSMT Enabled - Default40K80K120K160K200KSE +/- 46.23, N = 3SE +/- 89.67, N = 31380731990401. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lgmp -lm -lrt -lz -ldl -lcrypt -lbz2

Stress-NG

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: NUMASMT DisabledSMT Enabled - Default400800120016002000SE +/- 3.80, N = 3SE +/- 8.23, N = 32004.872094.071. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Memory CopyingSMT DisabledSMT Enabled - Default6K12K18K24K30KSE +/- 57.84, N = 3SE +/- 54.76, N = 325765.7826334.211. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Vector MathSMT DisabledSMT Enabled - Default120K240K360K480K600KSE +/- 951.54, N = 3SE +/- 306.60, N = 3399202.60553414.271. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: CPU StressSMT DisabledSMT Enabled - Default40K80K120K160K200KSE +/- 99.86, N = 3SE +/- 559.15, N = 3146670.47207237.291. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Integer MathSMT DisabledSMT Enabled - Default1.5M3M4.5M6M7.5MSE +/- 1218.12, N = 3SE +/- 11122.15, N = 35730701.526977901.841. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Hyperbolic Trigonometric MathSMT DisabledSMT Enabled - Default100K200K300K400K500KSE +/- 124.87, N = 3SE +/- 357.40, N = 3345394.67488803.141. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Integer Bit OperationsSMT DisabledSMT Enabled - Default4M8M12M16M20MSE +/- 10193.47, N = 3SE +/- 3213.22, N = 317279400.3619006107.551. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: AVX-512 VNNISMT DisabledSMT Enabled - Default3M6M9M12M15MSE +/- 2571.79, N = 3SE +/- 37914.71, N = 311117419.4013250045.881. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.18.09Test: Context SwitchingSMT DisabledSMT Enabled - Default11M22M33M44M55MSE +/- 142871.30, N = 3SE +/- 146419.18, N = 341052099.1552341805.751. (CXX) g++ options: -lm -laio -lapparmor -latomic -lcrypt -ldl -ljpeg -lEGL -lGLESv2 -lmpfr -lgmp -lsctp -lz -lrt -lpthread -lc -std=gnu99 -O2 -fipa-pta -fivopts -fmodulo-sched

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Bosphorus 4KSMT Enabled - DefaultSMT Disabled100200300400500SE +/- 11.10, N = 15SE +/- 12.29, N = 15456.70482.461. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Timed Eigen Compilation

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.4.0Time To CompileSMT DisabledSMT Enabled - Default714212835SE +/- 0.01, N = 3SE +/- 0.05, N = 330.5228.34

srsRAN Project

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 24.10Test: PUSCH Processor Benchmark, Throughput TotalSMT DisabledSMT Enabled - Default4K8K12K16K20KSE +/- 176.66, N = 3SE +/- 157.68, N = 318626.320677.21. (CXX) g++ options: -O3 -march=native -mtune=generic -fno-trapping-math -fno-math-errno -ldl

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.CSMT Enabled - DefaultSMT Disabled70K140K210K280K350KSE +/- 2151.09, N = 15SE +/- 973.24, N = 6284866.89317628.321. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Primesieve

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.6Length: 1e13SMT DisabledSMT Enabled - Default612182430SE +/- 0.05, N = 3SE +/- 0.05, N = 326.2924.581. (CXX) g++ options: -O3

Coremark

This is a test of EEMBC CoreMark processor benchmark. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per SecondSMT DisabledSMT Enabled - Default900K1800K2700K3600K4500KSE +/- 11150.98, N = 3SE +/- 2817.99, N = 32844496.914035383.571. (CC) gcc options: -O2 -lrt" -lrt

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPSMT DisabledSMT Enabled - Default4K8K12K16K20KSE +/- 18.58, N = 3SE +/- 15.72, N = 317164.1017136.711. (CXX) g++ options: -O3 -march=native -fopenmp

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: CPU-OnlySMT DisabledSMT Enabled - Default612182430SE +/- 0.04, N = 3SE +/- 0.01, N = 326.0420.01

VVenC

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.13Video Input: Bosphorus 4K - Video Preset: FasterSMT Enabled - DefaultSMT Disabled714212835SE +/- 0.07, N = 3SE +/- 0.07, N = 325.6127.781. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Decompression RatingSMT DisabledSMT Enabled - Default110K220K330K440K550KSE +/- 214.48, N = 3SE +/- 447.27, N = 33305365332101. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 24.05Test: Compression RatingSMT DisabledSMT Enabled - Default140K280K420K560K700KSE +/- 5895.67, N = 3SE +/- 5367.73, N = 34895256373181. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: STMV with 1,066,628 AtomsSMT DisabledSMT Enabled - Default0.84221.68442.52663.36884.211SE +/- 0.00445, N = 3SE +/- 0.00568, N = 43.285523.74317

uvg266

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: SlowSMT Enabled - DefaultSMT Disabled714212835SE +/- 0.02, N = 3SE +/- 0.02, N = 327.7529.92

Tachyon

This is a test of the threaded Tachyon, a parallel ray-tracing system, measuring the time to ray-trace a sample scene. The sample scene used is the Teapot scene ray-traced to 8K x 8K with 32 samples. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTachyon 0.99.2Total TimeSMT DisabledSMT Enabled - Default48121620SE +/- 0.04, N = 3SE +/- 0.00, N = 418.1516.381. (CC) gcc options: -m64 -O3 -fomit-frame-pointer -ffast-math -ltachyon -lm -lpthread

OpenVINO GenAI

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO GenAI 2024.5Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU - Time Per Output TokenSMT Enabled - DefaultSMT Disabled3691215SE +/- 0.10, N = 4SE +/- 0.04, N = 412.8112.73

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO GenAI 2024.5Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU - Time To First TokenSMT DisabledSMT Enabled - Default48121620SE +/- 0.03, N = 4SE +/- 0.11, N = 415.8915.84

OpenBenchmarking.orgtokens/s, More Is BetterOpenVINO GenAI 2024.5Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPUSMT Enabled - DefaultSMT Disabled20406080100SE +/- 0.62, N = 4SE +/- 0.26, N = 478.0678.55

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: CPU-OnlySMT DisabledSMT Enabled - Default510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 418.7214.88

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS13_CHACHA20_POLY1305_SHA256SMT Enabled - DefaultSMT Disabled20K40K60K80K100KSE +/- 81.03, N = 4SE +/- 21.33, N = 4111705.91111713.221. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: SlowSMT Enabled - DefaultSMT Disabled1020304050SE +/- 0.03, N = 4SE +/- 0.09, N = 440.4944.011. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

uvg266

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: MediumSMT Enabled - DefaultSMT Disabled816243240SE +/- 0.04, N = 3SE +/- 0.09, N = 330.8432.91

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: MediumSMT Enabled - DefaultSMT Disabled1020304050SE +/- 0.04, N = 4SE +/- 0.02, N = 441.3544.851. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256SMT DisabledSMT Enabled - Default30K60K90K120K150KSE +/- 69.93, N = 4SE +/- 115.83, N = 4117111.87117122.321. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128SMT Enabled - DefaultSMT Disabled1122334455SE +/- 0.04, N = 4SE +/- 0.03, N = 450.4350.491. (CXX) g++ options: -O3

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian Dragon ObjSMT DisabledSMT Enabled - Default306090120150SE +/- 0.01, N = 4SE +/- 0.08, N = 475.88118.33MIN: 75.31 / MAX: 76.83MIN: 116.67 / MAX: 120.05

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.CSMT Enabled - DefaultSMT Disabled30K60K90K120K150KSE +/- 194.21, N = 5SE +/- 133.32, N = 5147801.32151971.931. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128SMT DisabledSMT Enabled - Default1224364860SE +/- 0.05, N = 4SE +/- 0.04, N = 452.8652.931. (CXX) g++ options: -O3

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Bosphorus 4KSMT Enabled - DefaultSMT Disabled50100150200250SE +/- 0.95, N = 4SE +/- 0.89, N = 4199.94211.071. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

srsRAN Project

OpenBenchmarking.orgMbps, More Is BettersrsRAN Project 24.10Test: PDSCH Processor Benchmark, Throughput TotalSMT DisabledSMT Enabled - Default30K60K90K120K150KSE +/- 524.68, N = 6SE +/- 550.27, N = 4108233.2118394.61. (CXX) g++ options: -O3 -march=native -mtune=generic -fno-trapping-math -fno-math-errno -ldl

uvg266

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Super FastSMT Enabled - DefaultSMT Disabled20406080100SE +/- 0.06, N = 6SE +/- 0.04, N = 676.4776.71

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: CrownSMT DisabledSMT Enabled - Default306090120150SE +/- 0.03, N = 5SE +/- 0.04, N = 671.42111.96MIN: 70.44 / MAX: 72.43MIN: 110.19 / MAX: 114.2

ACES DGEMM

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RateSMT Enabled - DefaultSMT Disabled12002400360048006000SE +/- 4.91, N = 5SE +/- 8.97, N = 55217.655484.791. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas

LuxCoreRender

LuxCoreRender is an open-source 3D physically based renderer formerly known as LuxRender. LuxCoreRender supports CPU-based rendering as well as GPU acceleration via OpenCL, NVIDIA CUDA, and NVIDIA OptiX interfaces. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Rainbow Colors and Prism - Acceleration: CPUSMT DisabledSMT Enabled - Default714212835SE +/- 0.06, N = 5SE +/- 0.10, N = 622.3631.54MIN: 20.03 / MAX: 22.83MIN: 27.49 / MAX: 32.55

uvg266

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Very FastSMT DisabledSMT Enabled - Default20406080100SE +/- 0.06, N = 5SE +/- 0.10, N = 670.5974.75

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.8.0Video Input: Bosphorus 4K - Video Preset: Ultra FastSMT Enabled - DefaultSMT Disabled20406080100SE +/- 0.06, N = 6SE +/- 0.08, N = 678.3681.97

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.CSMT Enabled - DefaultSMT Disabled40K80K120K160K200KSE +/- 2690.96, N = 15SE +/- 76.78, N = 9149810.23171779.611. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.DSMT Enabled - DefaultSMT Disabled15003000450060007500SE +/- 68.21, N = 6SE +/- 15.47, N = 67000.737078.611. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.3Binary: Pathtracer ISPC - Model: Asian DragonSMT DisabledSMT Enabled - Default306090120150SE +/- 0.04, N = 5SE +/- 0.05, N = 788.34137.99MIN: 87.7 / MAX: 89.3MIN: 136.67 / MAX: 139.92

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: ATPase with 327,506 AtomsSMT DisabledSMT Enabled - Default3691215SE +/- 0.00, N = 6SE +/- 0.09, N = 711.3712.97

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4397Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128SMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.46, N = 7SE +/- 0.84, N = 7117.03119.561. (CXX) g++ options: -O3

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Very FastSMT DisabledSMT Enabled - Default20406080100SE +/- 0.05, N = 6SE +/- 0.04, N = 687.8593.251. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.BSMT Enabled - DefaultSMT Disabled50K100K150K200K250KSE +/- 1833.56, N = 15SE +/- 1995.83, N = 15185256.17227311.901. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Super FastSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.18, N = 7SE +/- 0.04, N = 7108.96112.561. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.CSMT Enabled - DefaultSMT Disabled16K32K48K64K80KSE +/- 1042.31, N = 15SE +/- 301.36, N = 1062539.0072412.041. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

Kvazaar

This is a test of Kvazaar as a CPU-based H.265/HEVC video encoder written in the C programming language and optimized in Assembly. Kvazaar is the winner of the 2016 ACM Open-Source Software Competition and developed at the Ultra Video Group, Tampere University, Finland. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.2Video Input: Bosphorus 4K - Video Preset: Ultra FastSMT Enabled - DefaultSMT Disabled306090120150SE +/- 0.28, N = 7SE +/- 0.24, N = 7112.00114.501. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

NAS Parallel Benchmarks

NPB, NAS Parallel Benchmarks, is a benchmark developed by NASA for high-end computer systems. This test profile currently uses the MPI version of NPB. This test profile offers selecting the different NPB tests/problems and varying problem sizes. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.CSMT Enabled - DefaultSMT Disabled2K4K6K8K10KSE +/- 94.51, N = 15SE +/- 313.55, N = 159641.8611316.881. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.CSMT Enabled - DefaultSMT Disabled40K80K120K160K200KSE +/- 463.56, N = 11SE +/- 337.13, N = 11159653.78178843.531. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.6

156 Results Shown

OpenVKL
Stockfish
Timed Gem5 Compilation
OpenSSL:
  RSA4096:
    verify/s
    sign/s
Speedb
Rustls:
  handshake-resume - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384
  handshake-ticket - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384
Timed Linux Kernel Compilation
Memcached
OpenSSL:
  ChaCha20-Poly1305
  ChaCha20
  AES-256-GCM
  AES-128-GCM
  SHA512
  SHA256
Blender
LAMMPS Molecular Dynamics Simulator
Rustls
SVT-AV1
Rustls
PostgreSQL:
  100 - 1000 - Read Only - Average Latency
  100 - 1000 - Read Only
  100 - 1000 - Read Write - Average Latency
  100 - 1000 - Read Write
  100 - 800 - Read Only - Average Latency
  100 - 800 - Read Only
  100 - 800 - Read Write - Average Latency
  100 - 800 - Read Write
Timed Node.js Compilation
RocksDB
ClickHouse:
  100M Rows Hits Dataset, Third Run
  100M Rows Hits Dataset, Second Run
  100M Rows Hits Dataset, First Run / Cold Cache
Timed LLVM Compilation
nginx:
  1000
  500
Stress-NG
Rustls:
  handshake-resume - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256
  handshake-ticket - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256
RocksDB
Memcached
OSPRay Studio:
  3 - 4K - 1 - Path Tracer - CPU
  2 - 4K - 1 - Path Tracer - CPU
  1 - 4K - 1 - Path Tracer - CPU
LuxCoreRender:
  Orange Juice - CPU
  Danish Mood - CPU
  LuxCore Benchmark - CPU
  DLSC - CPU
OpenVINO:
  Noise Suppression Poconet-Like FP16 - CPU:
    ms
    FPS
NAS Parallel Benchmarks
OpenVINO:
  Machine Translation EN To DE FP16 - CPU:
    ms
    FPS
C-Ray
OpenVINO:
  Person Vehicle Bike Detection FP16 - CPU:
    ms
    FPS
  Person Re-Identification Retail FP16 - CPU:
    ms
    FPS
  Road Segmentation ADAS FP16-INT8 - CPU:
    ms
    FPS
  Face Detection Retail FP16-INT8 - CPU:
    ms
    FPS
  Vehicle Detection FP16-INT8 - CPU:
    ms
    FPS
  Handwritten English Recognition FP16-INT8 - CPU:
    ms
    FPS
Speedb
RocksDB
John The Ripper
GraphicsMagick:
  Sharpen
  Noise-Gaussian
Speedb
GraphicsMagick:
  Rotate
  Resizing
  Enhanced
  HWB Color Space
RocksDB
Speedb
GraphicsMagick
John The Ripper
Laghos
Blender
OSPRay Studio
Palabos
VVenC
Blender
OSPRay Studio:
  2 - 4K - 32 - Path Tracer - CPU
  1 - 4K - 32 - Path Tracer - CPU
SVT-AV1
Rustls
Laghos
FinanceBench
C-Ray
NAS Parallel Benchmarks
Liquid-DSP:
  128 - 256 - 512
  64 - 256 - 512
John The Ripper:
  WPA PSK
  bcrypt
  Blowfish
Stress-NG:
  NUMA
  Memory Copying
  Vector Math
  CPU Stress
  Integer Math
  Hyperbolic Trigonometric Math
  Integer Bit Operations
  AVX-512 VNNI
  Context Switching
SVT-AV1
Timed Eigen Compilation
srsRAN Project
NAS Parallel Benchmarks
Primesieve
Coremark
FinanceBench
Blender
VVenC
7-Zip Compression:
  Decompression Rating
  Compression Rating
NAMD
uvg266
Tachyon
OpenVINO GenAI:
  TinyLlama-1.1B-Chat-v1.0 - CPU - Time Per Output Token
  TinyLlama-1.1B-Chat-v1.0 - CPU - Time To First Token
  TinyLlama-1.1B-Chat-v1.0 - CPU
Blender
Rustls
Kvazaar
uvg266
Kvazaar
Rustls
Llama.cpp
Embree
NAS Parallel Benchmarks
Llama.cpp
SVT-AV1
srsRAN Project
uvg266
Embree
ACES DGEMM
LuxCoreRender
uvg266:
  Bosphorus 4K - Very Fast
  Bosphorus 4K - Ultra Fast
NAS Parallel Benchmarks:
  FT.C
  IS.D
Embree
NAMD
Llama.cpp
Kvazaar
NAS Parallel Benchmarks
Kvazaar
NAS Parallel Benchmarks
Kvazaar
NAS Parallel Benchmarks:
  EP.C
  MG.C