eoy2024

Benchmarks for a future article. AMD EPYC 4484PX 12-Core testing with a Supermicro AS-3015A-I H13SAE-MF v1.00 (2.1 BIOS) and ASPEED on Ubuntu 24.04 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2412086-NE-EOY20243255
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts
Allow Limiting Results To Certain Suite(s)

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Toggle/Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
a
December 05
  6 Hours, 48 Minutes
4484PX
December 07
  7 Hours, 3 Minutes
px
December 07
  7 Hours, 3 Minutes
Invert Behavior (Only Show Selected Data)
  6 Hours, 58 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


eoy2024ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerCompilerFile-SystemScreen Resolutiona4484PXpxAMD EPYC 4564P 16-Core @ 5.88GHz (16 Cores / 32 Threads)Supermicro AS-3015A-I H13SAE-MF v1.00 (2.1 BIOS)AMD Device 14d82 x 32GB DRAM-4800MT/s Micron MTC20C2085S1EC48BA1 BC3201GB Micron_7450_MTFDKCC3T2TFS + 960GB SAMSUNG MZ1L2960HCJR-00A07ASPEEDAMD Rembrandt Radeon HD AudioVA24312 x Intel I210Ubuntu 24.046.8.0-11-generic (x86_64)GNOME Shell 45.3X Server 1.21.1.11GCC 13.2.0ext41024x768AMD EPYC 4484PX 12-Core @ 5.66GHz (12 Cores / 24 Threads)6.12.2-061202-generic (x86_64)OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-fxIygj/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- a: Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa601209- 4484PX: Scaling Governor: amd-pstate-epp performance (Boost: Enabled EPP: performance) - CPU Microcode: 0xa601209- px: Scaling Governor: amd-pstate-epp performance (Boost: Enabled EPP: performance) - CPU Microcode: 0xa601209Java Details- OpenJDK Runtime Environment (build 21.0.2+13-Ubuntu-2)Python Details- Python 3.12.3Security Details- a: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - 4484PX: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected - px: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected

a4484PXpxResult OverviewPhoronix Test Suite100%114%128%142%Apache CassandraBYTE Unix BenchmarkASTC EncoderPrimesieveEtcpakPOV-RayOpenSSLBlenderACES DGEMMOSPRayStockfishRELION7-Zip CompressionBuild2RustlsLiteRTNAMDx265SVT-AV1Timed Eigen CompilationWhisperfileoneDNNNumpy BenchmarksimdjsonApache CouchDBWhisper.cppQuantLibLlama.cppGROMACSXNNPACKGcrypt LibraryCP2K Molecular DynamicsY-CruncherLlamafilePyPerformanceONNX RuntimeOpenVINO GenAIRenaissanceFinanceBench

eoy2024litert: NASNet Mobileonednn: IP Shapes 1D - CPUonednn: Convolution Batch Shapes Auto - CPUbyte: System Callcassandra: Writesllama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 1024litert: DeepLab V3litert: Quantized COCO SSD MobileNet v1onednn: IP Shapes 3D - CPUonnx: CaffeNet 12-int8 - CPU - Standardbyte: Pipeonednn: Deconvolution Batch shapes_3d - CPUprimesieve: 1e12astcenc: Thoroughastcenc: Mediumastcenc: Fastastcenc: Exhaustiveastcenc: Very Thoroughprimesieve: 1e13etcpak: Multi-Threaded - ETC2byte: Whetstone Doublellama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 512ospray: particle_volume/scivis/real_timerustls: handshake - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384byte: Dhrystone 2onednn: Recurrent Neural Network Training - CPUblender: BMW27 - CPU-Onlyospray: particle_volume/ao/real_timestockfish: Chess Benchmarkonednn: Recurrent Neural Network Inference - CPUblender: Classroom - CPU-Onlyospray: gravity_spheres_volume/dim_512/pathtracer/real_timeopenssl: AES-128-GCMopenssl: AES-256-GCMospray: gravity_spheres_volume/dim_512/scivis/real_timepovray: Trace Timeblender: Pabellon Barcelona - CPU-Onlyblender: Fishy Cat - CPU-Onlyrustls: handshake - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256ospray: gravity_spheres_volume/dim_512/ao/real_timemt-dgemm: Sustained Floating-Point Rateopenssl: ChaCha20openssl: ChaCha20-Poly1305blender: Barbershop - CPU-Onlyllama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 2048onnx: T5 Encoder - CPU - Standardrustls: handshake - TLS13_CHACHA20_POLY1305_SHA256compress-7zip: Decompression Ratingblender: Junkshop - CPU-Onlyonnx: ResNet101_DUC_HDC-12 - CPU - Standardrelion: Basic - CPUstockfish: Chess Benchmarkrenaissance: Genetic Algorithm Using Jenetics + Futuressvt-av1: Preset 3 - Bosphorus 4Kbuild2: Time To Compilexnnpack: FP16MobileNetV1xnnpack: FP32MobileNetV3Smallsimdjson: PartialTweetsx265: Bosphorus 4Ksvt-av1: Preset 3 - Beauty 4K 10-bitsvt-av1: Preset 8 - Bosphorus 4Ksimdjson: DistinctUserIDsvt-av1: Preset 5 - Bosphorus 4Kospray: particle_volume/pathtracer/real_timexnnpack: FP32MobileNetV3Largenamd: ATPase with 327,506 Atomsonnx: GPT-2 - CPU - Standardsvt-av1: Preset 8 - Bosphorus 1080pxnnpack: FP16MobileNetV3Smallrustls: handshake-ticket - TLS13_CHACHA20_POLY1305_SHA256xnnpack: QS8MobileNetV2rustls: handshake-resume - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256onnx: Faster R-CNN R-50-FPN-int8 - CPU - Standardsvt-av1: Preset 5 - Beauty 4K 10-bitrustls: handshake-ticket - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384whisperfile: Smallrustls: handshake-resume - TLS13_CHACHA20_POLY1305_SHA256svt-av1: Preset 3 - Bosphorus 1080pnamd: STMV with 1,066,628 Atomscompress-7zip: Compression Ratingrustls: handshake-resume - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384whisper-cpp: ggml-medium.en - 2016 State of the Unionsvt-av1: Preset 5 - Bosphorus 1080ppyperformance: async_tree_ioonnx: fcn-resnet101-11 - CPU - Standardsvt-av1: Preset 8 - Beauty 4K 10-bitbuild-eigen: Time To Compilerustls: handshake-ticket - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256onednn: Deconvolution Batch shapes_1d - CPUonnx: ArcFace ResNet-100 - CPU - Standardrenaissance: Gaussian Mixture Modelx265: Bosphorus 1080pwhisperfile: Mediumonnx: super-resolution-10 - CPU - Standardrenaissance: Apache Spark PageRankcouchdb: 300 - 1000 - 30whisperfile: Tinysimdjson: Kostyanumpy: couchdb: 500 - 1000 - 30renaissance: Scala Dottycouchdb: 300 - 3000 - 30quantlib: XXScp2k: H20-64renaissance: Akka Unbalanced Cobwebbed Treellama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128couchdb: 100 - 3000 - 30onnx: ResNet50 v1-12-int8 - CPU - Standardcouchdb: 500 - 3000 - 30svt-av1: Preset 13 - Bosphorus 4Kxnnpack: FP32MobileNetV2whisper-cpp: ggml-small.en - 2016 State of the Unionsvt-av1: Preset 13 - Bosphorus 1080prenaissance: Rand Forestpyperformance: asyncio_tcp_sslcouchdb: 100 - 1000 - 30onnx: ZFNet-512 - CPU - Standardrenaissance: Apache Spark Bayesquantlib: Srenaissance: Finagle HTTP Requestsgromacs: water_GMX50_bareonnx: bertsquad-12 - CPU - Standardsvt-av1: Preset 13 - Beauty 4K 10-bitwhisper-cpp: ggml-base.en - 2016 State of the Unionllama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 1024cp2k: H20-256litert: Inception V4llamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Text Generation 128renaissance: ALS Movie Lensfinancebench: Bonds OpenMPpyperformance: python_startupllamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Text Generation 16gcrypt: openvino-genai: Phi-3-mini-128k-instruct-int4-ov - CPUxnnpack: FP16MobileNetV2renaissance: Savina Reactors.IOllamafile: mistral-7b-instruct-v0.2.Q5_K_M - Text Generation 128pyperformance: gc_collectfinancebench: Repo OpenMPopenvino-genai: Gemma-7b-int4-ov - CPUllama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 512llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 1024xnnpack: FP16MobileNetV3Largepyperformance: raytracepyperformance: chaospyperformance: regex_compilepyperformance: crypto_pyaesopenvino-genai: Falcon-7b-instruct-int4-ov - CPUllama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128simdjson: TopTweetllamafile: wizardcoder-python-34b-v1.0.Q6_K - Text Generation 16pyperformance: json_loadsonnx: yolov4 - CPU - Standardlitert: Mobilenet Quantllamafile: wizardcoder-python-34b-v1.0.Q6_K - Text Generation 128cp2k: Fayalite-FISTpyperformance: xml_etreellama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128litert: Mobilenet Floatrenaissance: In-Memory Database Shootoutllamafile: Llama-3.2-3B-Instruct.Q6_K - Text Generation 16pyperformance: pickle_pure_pythonpyperformance: django_templatellamafile: mistral-7b-instruct-v0.2.Q5_K_M - Text Generation 16pyperformance: asyncio_websocketspyperformance: gollamafile: Llama-3.2-3B-Instruct.Q6_K - Text Generation 128y-cruncher: 500Mxnnpack: FP32MobileNetV1litert: SqueezeNetpyperformance: pathlibpyperformance: floatllama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 2048llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 2048llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 512pyperformance: nbodyy-cruncher: 1Bsimdjson: LargeRandlitert: Inception ResNet V2llamafile: wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 2048llamafile: wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 1024llamafile: wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 512llamafile: wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 256llamafile: mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 2048llamafile: mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 1024llamafile: mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 512llamafile: mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 256llamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 2048llamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 1024llamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 512llamafile: TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 256llamafile: Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 2048llamafile: Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 1024llamafile: Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 512llamafile: Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 256openvino-genai: Phi-3-mini-128k-instruct-int4-ov - CPU - Time Per Output Tokenopenvino-genai: Phi-3-mini-128k-instruct-int4-ov - CPU - Time To First Tokenopenvino-genai: Falcon-7b-instruct-int4-ov - CPU - Time Per Output Tokenopenvino-genai: Falcon-7b-instruct-int4-ov - CPU - Time To First Tokenopenvino-genai: Gemma-7b-int4-ov - CPU - Time Per Output Tokenopenvino-genai: Gemma-7b-int4-ov - CPU - Time To First Tokenonnx: Faster R-CNN R-50-FPN-int8 - CPU - Standardonnx: ResNet101_DUC_HDC-12 - CPU - Standardonnx: super-resolution-10 - CPU - Standardonnx: ResNet50 v1-12-int8 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardonnx: fcn-resnet101-11 - CPU - Standardonnx: CaffeNet 12-int8 - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: T5 Encoder - CPU - Standardonnx: ZFNet-512 - CPU - Standardonnx: yolov4 - CPU - Standardonnx: GPT-2 - CPU - Standardrenaissance: Apache Spark ALSa4484PXpx169361.125736.6728749140426.6271333355.093579.672129.524.058636.31848806257.12.412946.34720.3025156.2217396.64951.68442.74178.498577.817343491.9327.38.98486423535.681866536062.71372.0353.559.0091746507038700.859143.368.82093104784522170971727517007.5878918.542166.1271.3580462.67.639441141.19410413058849505092393529340506.2279.04156.45376454.4516591673.561.54196944.2754752796732.89.5992.05311439799.7632.571.422102.00510.4634.538236.24518102.79632134.596339.023920404263.458443563852.5747.06916.5041553632.14195.41642388077.6929.5730.756561638591820810.21700.91101.9717553.216712.46858.65526203322.9761242.45373399.5114.45534.919141.1172412.2106.1341.709355.97775.75148.049477.0367.8313.43258.1914403.847.72232.188390.597511.775212.521495245.07838842.558414.464569.929102.331490.012.74762319.41.69215.589918.58887.4897370.85592.85721477.826.289805.733061.218755.7724.59162.12519.2811903506.410.4767721418.4453129.8370.7669.26149817538.269.841.712.936.8810.461.7812.111.0552823.171.9994.03235.87.241211.483256.119.0316520.710.2231577.820.138.77212521794.1114.250.763.0962.9768.45918.4851.8319530.21228861443072153632768163848192409632768163848192409632768163848192409651.8655.9377.3486.06101.72106.6221.2429648.5227.086012.5589823.553310.8751.5708464.1416.391129.7698590.45237.427768057.561.938064.1155130761218.9174960232.262343.381420.152.73072941.40133443359.23.50849.11614.17109.0265278.24451.18871.9412110.608410.726244075.3243.146.44913306153.21346521770.31898.3674.086.5277633702298965.015197.26.4119876496336760711602918705.5488825.264226.3496.6759308.755.63122842.7306429710523569068816544020679.34222.75208.17457716.6412569897.011.17627729.445267546904.07.684111.651138380910.127.161.18885.20110.7629.094199.02315152.38124159.71287.047779344296.247173035330.2140.09355.6021329363.1173.38197333882.9225.4460.651191412631586292.42809.7896988.4156662.8109310.96767.3642282729.643.4029337.38323860.6101.37473.55091125.1722138.1117.56637.134626.11745.59164.468428.6406.1212.116953.0054038.452.3253.99356.409559.346198.1121365268.23891776.115422.059075.901110.94513.211.86472492.21.57714.512217.40692.7093366.57628.10422083.327.599378.834600.7734386.0825.86171.02320.2812173655.810.9169922320.33203110.2369.1166.85146718239.771.743.113.47.1110.821.8312.410.7338848.9432.0592.2136.87.411244.73241.519.491692110.4532178.620.398.68812571809.1814.451.363.863.6168.259.518.3791.8419477.81228861443072153632768163848192409632768163848192409632768163848192409649.3158.9174.6593.0197.79121.4824.9402850.1417.988732.8054426.7478355.7511.0618868.90514.802879.0132293.16056.258157931.641.939134.1332130701622.8173946244.772359.991417.352.72942937.77833381363.13.512439.14714.1464108.8588277.29941.18621.9391110.709409.875244131232.866.52304304060.281340340196.61895.6873.166.5220633871595966.013197.536.407476184405610709026564805.614725.328224.6497.0959206.345.71084842.0128319701989745068678955550678.4208.99206.09157688.0812560597.11.1705733.0242973396920.77.646113.7813868378.3526.941.18484.9988.9728.824197.215742.35379157.893286.962798342775.297233038723.4843.3625.5511340712.85167.89219333574.325.4470.654481422131572010.68809.48988.276562.7963810.85567.0762292879.443.4062837.10483815.2101.25475.51084125.0762229.7119.34938.718285.45831.42164.812436.2408.48312.105752.7244002.352.37254.733356.194560.7194.0241368266.81425769.818453.259076.389110.892474.911.8392483.11.57514.574717.35593.4546366.35631.3122752.427.89275.734896.8359386.0925.94163.83920.2912483676.010.9370622318.73828110.2467.9566.52152718239.472.543.313.417.1210.511.8412.510.7127849.2092.0594.89636.57.441244.513175.619.516821.210.4532279.420.518.62312721821.3514.450.863.7963.4168.8159.218.3651.8419490.71228861443072153632768163848192409632768163848192409632768163848192409649.2858.8674.549397.61122.323.0604854.3347.994862.8069526.9485357.6021.06668.61044.851429.0168793.34416.33034OpenBenchmarking.org

LiteRT

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: NASNet Mobilepx4484PXa4K8K12K16K20K7931.648057.5616936.00

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: IP Shapes 1D - Engine: CPUpx4484PXa0.43630.87261.30891.74522.18151.939131.938061.12573MIN: 1.91MIN: 1.92MIN: 1.031. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Convolution Batch Shapes Auto - Engine: CPUpx4484PXa2468104.133214.115516.67287MIN: 4.07MIN: 4.05MIN: 6.21. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

BYTE Unix Benchmark

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 5.1.3-gitComputational Test: System Callpx4484PXa11M22M33M44M55M30701622.830761218.949140426.61. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm

Apache Cassandra

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 5.0Test: Writespx4484PXa60K120K180K240K300K173946174960271333

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 1024px4484PXa80160240320400244.77232.26355.091. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

LiteRT

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: DeepLab V3px4484PXa80016002400320040002359.992343.383579.67

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Quantized COCO SSD MobileNet v1px4484PXa50010001500200025001417.351420.152129.52

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: IP Shapes 3D - Engine: CPUpx4484PXa0.91311.82622.73933.65244.56552.729422.730724.05800MIN: 2.7MIN: 2.7MIN: 3.751. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: CaffeNet 12-int8 - Device: CPU - Executor: Standardpx4484PXa2004006008001000937.78941.40636.321. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

BYTE Unix Benchmark

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 5.1.3-gitComputational Test: Pipepx4484PXa10M20M30M40M50M33381363.133443359.248806257.11. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Deconvolution Batch shapes_3d - Engine: CPUpx4484PXa0.79031.58062.37093.16123.95153.512433.508402.41294MIN: 3.47MIN: 3.46MIN: 2.341. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

Primesieve

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.6Length: 1e12px4484PXa36912159.1479.1166.3471. (CXX) g++ options: -O3

ASTC Encoder

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Thoroughpx4484PXa51015202514.1514.1720.301. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Mediumpx4484PXa306090120150108.86109.03156.221. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Fastpx4484PXa90180270360450277.30278.24396.651. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Exhaustivepx4484PXa0.3790.7581.1371.5161.8951.18621.18871.68441. (CXX) g++ options: -O3 -flto -pthread

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 5.0Preset: Very Thoroughpx4484PXa0.61671.23341.85012.46683.08351.93911.94122.74101. (CXX) g++ options: -O3 -flto -pthread

Primesieve

OpenBenchmarking.orgSeconds, Fewer Is BetterPrimesieve 12.6Length: 1e13px4484PXa20406080100110.71110.6178.501. (CXX) g++ options: -O3

Etcpak

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 2.0Benchmark: Multi-Threaded - Configuration: ETC2px4484PXa120240360480600409.88410.73577.821. (CXX) g++ options: -flto -pthread

BYTE Unix Benchmark

OpenBenchmarking.orgMWIPS, More Is BetterBYTE Unix Benchmark 5.1.3-gitComputational Test: Whetstone Doublepx4484PXa70K140K210K280K350K244131.0244075.3343491.91. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 512px4484PXa70140210280350232.86243.14327.301. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

OSPRay

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.2Benchmark: particle_volume/scivis/real_timepx4484PXa36912156.523046.449138.98486

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384px4484PXa90K180K270K360K450K304060.28306153.20423535.681. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

BYTE Unix Benchmark

OpenBenchmarking.orgLPS, More Is BetterBYTE Unix Benchmark 5.1.3-gitComputational Test: Dhrystone 2px4484PXa400M800M1200M1600M2000M1340340196.61346521770.31866536062.71. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Recurrent Neural Network Training - Engine: CPUpx4484PXa4008001200160020001895.681898.361372.03MIN: 1892.59MIN: 1894.26MIN: 1342.061. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: BMW27 - Compute: CPU-Onlypx4484PXa163248648073.1674.0853.55

OSPRay

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.2Benchmark: particle_volume/ao/real_timepx4484PXa36912156.522066.527769.00917

Stockfish

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfishChess Benchmarkpx4484PXa10M20M30M40M50M3387159533702298465070381. Stockfish 16 by the Stockfish developers (see AUTHORS file)

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Recurrent Neural Network Inference - Engine: CPUpx4484PXa2004006008001000966.01965.02700.86MIN: 963.43MIN: 963.27MIN: 679.891. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Classroom - Compute: CPU-Onlypx4484PXa4080120160200197.53197.20143.36

OSPRay

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.2Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timepx4484PXa2468106.407406.411988.82093

OpenSSL

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-128-GCMpx4484PXa20000M40000M60000M80000M100000M76184405610764963367601047845221701. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: AES-256-GCMpx4484PXa20000M40000M60000M80000M100000M7090265648071160291870971727517001. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OSPRay

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.2Benchmark: gravity_spheres_volume/dim_512/scivis/real_timepx4484PXa2468105.614705.548887.58789

POV-Ray

OpenBenchmarking.orgSeconds, Fewer Is BetterPOV-RayTrace Timepx4484PXa61218243025.3325.2618.541. POV-Ray 3.7.0.10.unofficial

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Pabellon Barcelona - Compute: CPU-Onlypx4484PXa50100150200250224.64226.34166.12

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Fishy Cat - Compute: CPU-Onlypx4484PXa2040608010097.0996.6771.35

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256px4484PXa20K40K60K80K100K59206.3459308.7580462.601. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

OSPRay

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.2Benchmark: gravity_spheres_volume/dim_512/ao/real_timepx4484PXa2468105.710845.631227.63944

ACES DGEMM

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Ratepx4484PXa2004006008001000842.01842.731141.191. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas

OpenSSL

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20px4484PXa30000M60000M90000M120000M150000M97019897450971052356901305884950501. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSLAlgorithm: ChaCha20-Poly1305px4484PXa20000M40000M60000M80000M100000M6867895555068816544020923935293401. OpenSSL 3.0.13 30 Jan 2024 (Library: OpenSSL 3.0.13 30 Jan 2024) - Additional Parameters: -engine qatengine -async_jobs 8

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Barbershop - Compute: CPU-Onlypx4484PXa150300450600750678.40679.34506.20

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048px4484PXa60120180240300208.99222.75279.041. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: T5 Encoder - Device: CPU - Executor: Standardpx4484PXa50100150200250206.09208.17156.451. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake - Suite: TLS13_CHACHA20_POLY1305_SHA256px4484PXa16K32K48K64K80K57688.0857716.6476454.451. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Decompression Ratingpx4484PXa40K80K120K160K200K1256051256981659161. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 4.3Blend File: Junkshop - Compute: CPU-Onlypx4484PXa2040608010097.1097.0173.56

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: ResNet101_DUC_HDC-12 - Device: CPU - Executor: Standardpx4484PXa0.34690.69381.04071.38761.73451.170501.176271.541961. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

RELION

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 5.0Test: Basic - Device: CPUpx4484PXa2004006008001000733.02729.40944.271. (CXX) g++ options: -fPIC -std=c++14 -fopenmp -O3 -rdynamic -lfftw3f -lfftw3 -ldl -ltiff -lpng -ljpeg -lmpi_cxx -lmpi

Stockfish

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 17Chess Benchmarkpx4484PXa12M24M36M48M60M4297339645267546547527961. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Genetic Algorithm Using Jenetics + Futurespx4484PXa2004006008001000920.7904.0732.8MIN: 888.75 / MAX: 934.44MIN: 886.83 / MAX: 919.31MIN: 713.67 / MAX: 813.49

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Bosphorus 4Kpx4484PXa36912157.6467.6849.5901. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Build2

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.17Time To Compilepx4484PXa306090120150113.78111.6592.05

XNNPACK

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV1px4484PXa300600900120015001386138311431. (CXX) g++ options: -O3 -lrt -lm

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3Smallpx4484PXa20040060080010008378099791. (CXX) g++ options: -O3 -lrt -lm

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: PartialTweetspx4484PXa36912158.3510.109.761. (CXX) g++ options: -O3 -lrt

x265

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 4Kpx4484PXa81624324026.9427.1632.571. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Beauty 4K 10-bitpx4484PXa0.320.640.961.281.61.1841.1881.4221. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Bosphorus 4Kpx4484PXa2040608010085.0085.20102.011. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: DistinctUserIDpx4484PXa36912158.9710.7610.461. (CXX) g++ options: -O3 -lrt

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Bosphorus 4Kpx4484PXa81624324028.8229.0934.541. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OSPRay

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 3.2Benchmark: particle_volume/pathtracer/real_timepx4484PXa50100150200250197.20199.02236.25

XNNPACK

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV3Largepx4484PXa4008001200160020001574151518101. (CXX) g++ options: -O3 -lrt -lm

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: ATPase with 327,506 Atomspx4484PXa0.62921.25841.88762.51683.1462.353792.381242.79632

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: GPT-2 - Device: CPU - Executor: Standardpx4484PXa4080120160200157.89159.71134.601. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Bosphorus 1080ppx4484PXa70140210280350286.96287.05339.021. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

XNNPACK

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3Smallpx4484PXa20040060080010007987799201. (CXX) g++ options: -O3 -lrt -lm

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS13_CHACHA20_POLY1305_SHA256px4484PXa90K180K270K360K450K342775.29344296.24404263.451. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

XNNPACK

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: QS8MobileNetV2px4484PXa20040060080010007237178441. (CXX) g++ options: -O3 -lrt -lm

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256px4484PXa800K1600K2400K3200K4000K3038723.483035330.213563852.571. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Standardpx4484PXa112233445543.3640.0947.071. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Beauty 4K 10-bitpx4484PXa2468105.5515.6026.5041. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384px4484PXa300K600K900K1200K1500K1340712.851329363.101553632.141. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

Whisperfile

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisperfile 20Aug24Model Size: Smallpx4484PXa4080120160200167.89173.38195.42

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS13_CHACHA20_POLY1305_SHA256px4484PXa80K160K240K320K400K333574.30333882.92388077.691. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 3 - Input: Bosphorus 1080ppx4484PXa71421283525.4525.4529.571. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

NAMD

OpenBenchmarking.orgns/day, More Is BetterNAMD 3.0Input: STMV with 1,066,628 Atomspx4484PXa0.17020.34040.51060.68080.8510.654480.651190.75656

7-Zip Compression

OpenBenchmarking.orgMIPS, More Is Better7-Zip CompressionTest: Compression Ratingpx4484PXa40K80K120K160K200K1422131412631638591. 7-Zip 23.01 (x64) : Copyright (c) 1999-2023 Igor Pavlov : 2023-06-20

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-resume - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384px4484PXa400K800K1200K1600K2000K1572010.681586292.421820810.211. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

Whisper.cpp

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisper.cpp 1.6.2Model: ggml-medium.en - Input: 2016 State of the Unionpx4484PXa2004006008001000809.49809.79700.911. (CXX) g++ options: -O3 -std=c++11 -fPIC -pthread -msse3 -mssse3 -mavx -mf16c -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512dq -mavx512bw -mavx512vbmi -mavx512vnni

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 5 - Input: Bosphorus 1080ppx4484PXa2040608010088.2788.42101.971. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

PyPerformance

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: async_tree_iopx4484PXa160320480640800656666755

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: fcn-resnet101-11 - Device: CPU - Executor: Standardpx4484PXa0.72381.44762.17142.89523.6192.796382.810933.216701. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 8 - Input: Beauty 4K 10-bitpx4484PXa369121510.8610.9712.471. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Timed Eigen Compilation

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.4.0Time To Compilepx4484PXa153045607567.0867.3658.66

Rustls

OpenBenchmarking.orghandshakes/s, More Is BetterRustls 0.23.17Benchmark: handshake-ticket - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256px4484PXa600K1200K1800K2400K3000K2292879.442282729.642620332.001. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.6Harness: Deconvolution Batch shapes_1d - Engine: CPUpx4484PXa0.76641.53282.29923.06563.8323.406283.402932.97612MIN: 3.03MIN: 3.03MIN: 2.421. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -fcf-protection=full -pie -ldl

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: ArcFace ResNet-100 - Device: CPU - Executor: Standardpx4484PXa102030405037.1037.3842.451. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Gaussian Mixture Modelpx4484PXa80016002400320040003815.23860.63399.5MIN: 2749.56 / MAX: 3815.24MIN: 2758.89 / MAX: 3860.61MIN: 2471.52

x265

OpenBenchmarking.orgFrames Per Second, More Is Betterx265Video Input: Bosphorus 1080ppx4484PXa306090120150101.25101.37114.451. x265 [info]: HEVC encoder version 3.5+1-f0c1022b6

Whisperfile

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisperfile 20Aug24Model Size: Mediumpx4484PXa120240360480600475.51473.55534.92

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: super-resolution-10 - Device: CPU - Executor: Standardpx4484PXa306090120150125.08125.17141.121. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Apache Spark PageRankpx4484PXa50010001500200025002229.72138.12412.2MIN: 1612.96 / MAX: 2229.74MIN: 1499.64MIN: 1691.04

Apache CouchDB

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.4.1Bulk Size: 300 - Inserts: 1000 - Rounds: 30px4484PXa306090120150119.35117.57106.131. (CXX) g++ options: -flto -lstdc++ -shared -lei

Whisperfile

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisperfile 20Aug24Model Size: Tinypx4484PXa102030405038.7237.1341.71

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: Kostyapx4484PXa2468105.456.115.971. (CXX) g++ options: -O3 -lrt

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmarkpx4484PXa2004006008001000831.42745.59775.75

Apache CouchDB

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.4.1Bulk Size: 500 - Inserts: 1000 - Rounds: 30px4484PXa4080120160200164.81164.47148.051. (CXX) g++ options: -flto -lstdc++ -shared -lei

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Scala Dottypx4484PXa100200300400500436.2428.6477.0MIN: 380.62 / MAX: 721.56MIN: 378.22 / MAX: 628.77MIN: 371.54 / MAX: 736.5

Apache CouchDB

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.4.1Bulk Size: 300 - Inserts: 3000 - Rounds: 30px4484PXa90180270360450408.48406.12367.831. (CXX) g++ options: -flto -lstdc++ -shared -lei

QuantLib

OpenBenchmarking.orgtasks/s, More Is BetterQuantLib 1.35-devSize: XXSpx4484PXa369121512.1112.1213.431. (CXX) g++ options: -O3 -march=native -fPIE -pie

CP2K Molecular Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2024.3Input: H20-64px4484PXa132639526552.7253.0158.191. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -l:libopenblas.a -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Akka Unbalanced Cobwebbed Treepx4484PXa90018002700360045004002.34038.44403.8MIN: 4002.27 / MAX: 4983.72MIN: 4038.36 / MAX: 5089.28MAX: 5719.11

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128px4484PXa122436486052.3752.3047.721. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

Apache CouchDB

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.4.1Bulk Size: 100 - Inserts: 3000 - Rounds: 30px4484PXa60120180240300254.73253.99232.191. (CXX) g++ options: -flto -lstdc++ -shared -lei

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Standardpx4484PXa80160240320400356.19356.41390.601. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

Apache CouchDB

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.4.1Bulk Size: 500 - Inserts: 3000 - Rounds: 30px4484PXa120240360480600560.70559.35511.781. (CXX) g++ options: -flto -lstdc++ -shared -lei

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Bosphorus 4Kpx4484PXa50100150200250194.02198.11212.521. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

XNNPACK

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV2px4484PXa300600900120015001368136514951. (CXX) g++ options: -O3 -lrt -lm

Whisper.cpp

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisper.cpp 1.6.2Model: ggml-small.en - Input: 2016 State of the Unionpx4484PXa60120180240300266.81268.24245.081. (CXX) g++ options: -O3 -std=c++11 -fPIC -pthread -msse3 -mssse3 -mavx -mf16c -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512dq -mavx512bw -mavx512vbmi -mavx512vnni

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Bosphorus 1080ppx4484PXa2004006008001000769.82776.12842.561. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Random Forestpx4484PXa100200300400500453.2422.0414.4MIN: 352.31 / MAX: 513.31MIN: 357.91 / MAX: 497.55MIN: 322.79 / MAX: 466.1

PyPerformance

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: asyncio_tcp_sslpx4484PXa140280420560700590590645

Apache CouchDB

OpenBenchmarking.orgSeconds, Fewer Is BetterApache CouchDB 3.4.1Bulk Size: 100 - Inserts: 1000 - Rounds: 30px4484PXa2040608010076.3975.9069.931. (CXX) g++ options: -flto -lstdc++ -shared -lei

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: ZFNet-512 - Device: CPU - Executor: Standardpx4484PXa20406080100110.89110.94102.331. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Apache Spark Bayespx4484PXa110220330440550474.9513.2490.0MIN: 454.77 / MAX: 514.32MIN: 453.66 / MAX: 554.7MIN: 459.29 / MAX: 580.9

QuantLib

OpenBenchmarking.orgtasks/s, More Is BetterQuantLib 1.35-devSize: Spx4484PXa369121511.8411.8612.751. (CXX) g++ options: -O3 -march=native -fPIE -pie

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Finagle HTTP Requestspx4484PXa50010001500200025002483.12492.22319.4MIN: 1933.43MIN: 1947.63MIN: 1832.84

GROMACS

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACSInput: water_GMX50_barepx4484PXa0.38070.76141.14211.52281.90351.5751.5771.6921. GROMACS version: 2023.3-Ubuntu_2023.3_1ubuntu3

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: bertsquad-12 - Device: CPU - Executor: Standardpx4484PXa4812162014.5714.5115.591. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 2.3Encoder Mode: Preset 13 - Input: Beauty 4K 10-bitpx4484PXa51015202517.3617.4118.591. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Whisper.cpp

OpenBenchmarking.orgSeconds, Fewer Is BetterWhisper.cpp 1.6.2Model: ggml-base.en - Input: 2016 State of the Unionpx4484PXa2040608010093.4592.7187.491. (CXX) g++ options: -O3 -std=c++11 -fPIC -pthread -msse3 -mssse3 -mavx -mf16c -mfma -mavx2 -mavx512f -mavx512cd -mavx512vl -mavx512dq -mavx512bw -mavx512vbmi -mavx512vnni

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 1024px4484PXa163248648066.3566.5770.851. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

CP2K Molecular Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2024.3Input: H20-256px4484PXa140280420560700631.31628.10592.861. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -l:libopenblas.a -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

LiteRT

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Inception V4px4484PXa5K10K15K20K25K22752.422083.321477.8

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Text Generation 128px4484PXa71421283527.8027.5926.28

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: ALS Movie Lenspx4484PXa2K4K6K8K10K9275.79378.89805.7MIN: 8821.09 / MAX: 9495.91MIN: 8718.36 / MAX: 9413.7MIN: 9253.4 / MAX: 10057.61

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMPpx4484PXa7K14K21K28K35K34896.8434600.7733061.221. (CXX) g++ options: -O3 -march=native -fopenmp

PyPerformance

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: python_startuppx4484PXa2468106.096.085.77

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Text Generation 16px4484PXa61218243025.9425.8624.59

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.10.3px4484PXa4080120160200163.84171.02162.131. (CC) gcc options: -O2 -fvisibility=hidden

OpenVINO GenAI

OpenBenchmarking.orgtokens/s, More Is BetterOpenVINO GenAI 2024.5Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPUpx4484PXa51015202520.2920.2819.28

XNNPACK

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV2px4484PXa300600900120015001248121711901. (CXX) g++ options: -O3 -lrt -lm

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: Savina Reactors.IOpx4484PXa80016002400320040003676.03655.83506.4MAX: 4536.84MIN: 3655.76 / MAX: 4484.97MIN: 3506.38 / MAX: 4329.37

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Text Generation 128px4484PXa369121510.9310.9110.47

PyPerformance

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: gc_collectpx4484PXa150300450600750706699677

FinanceBench

FinanceBench is a collection of financial program benchmarks with support for benchmarking on the GPU via OpenCL and CPU benchmarking with OpenMP. The FinanceBench test cases are focused on Black-Sholes-Merton Process with Analytic European Option engine, QMC (Sobol) Monte-Carlo method (Equity Option Example), Bonds Fixed-rate bond with flat forward curve, and Repo Securities repurchase agreement. FinanceBench was originally written by the Cavazos Lab at University of Delaware. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMPpx4484PXa5K10K15K20K25K22318.7422320.3321418.451. (CXX) g++ options: -O3 -march=native -fopenmp

OpenVINO GenAI

OpenBenchmarking.orgtokens/s, More Is BetterOpenVINO GenAI 2024.5Model: Gemma-7b-int4-ov - Device: CPUpx4484PXa369121510.2410.239.83

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 512px4484PXa163248648067.9569.1170.761. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 1024px4484PXa153045607566.5266.8569.261. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

XNNPACK

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP16MobileNetV3Largepx4484PXa300600900120015001527146714981. (CXX) g++ options: -O3 -lrt -lm

PyPerformance

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: raytracepx4484PXa4080120160200182182175

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: chaospx4484PXa91827364539.439.738.2

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: regex_compilepx4484PXa163248648072.571.769.8

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: crypto_pyaespx4484PXa102030405043.343.141.7

OpenVINO GenAI

OpenBenchmarking.orgtokens/s, More Is BetterOpenVINO GenAI 2024.5Model: Falcon-7b-instruct-int4-ov - Device: CPUpx4484PXa369121513.4113.4012.93

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128px4484PXa2468107.127.116.881. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: TopTweetpx4484PXa369121510.5110.8210.461. (CXX) g++ options: -O3 -lrt

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Text Generation 16px4484PXa0.4140.8281.2421.6562.071.841.831.78

PyPerformance

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: json_loadspx4484PXa369121512.512.412.1

ONNX Runtime

OpenBenchmarking.orgInferences Per Second, More Is BetterONNX Runtime 1.19Model: yolov4 - Device: CPU - Executor: Standardpx4484PXa369121510.7110.7311.061. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

LiteRT

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Mobilenet Quantpx4484PXa2004006008001000849.21848.94823.17

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Text Generation 128px4484PXa0.46130.92261.38391.84522.30652.052.051.99

CP2K Molecular Dynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 2024.3Input: Fayalite-FISTpx4484PXa2040608010094.9092.2194.031. (F9X) gfortran options: -fopenmp -march=native -mtune=native -O3 -funroll-loops -fbacktrace -ffree-form -fimplicit-none -std=f2008 -lcp2kstart -lcp2kmc -lcp2kswarm -lcp2kmotion -lcp2kthermostat -lcp2kemd -lcp2ktmc -lcp2kmain -lcp2kdbt -lcp2ktas -lcp2kgrid -lcp2kgriddgemm -lcp2kgridcpu -lcp2kgridref -lcp2kgridcommon -ldbcsrarnoldi -ldbcsrx -lcp2kdbx -lcp2kdbm -lcp2kshg_int -lcp2keri_mme -lcp2kminimax -lcp2khfxbase -lcp2ksubsys -lcp2kxc -lcp2kao -lcp2kpw_env -lcp2kinput -lcp2kpw -lcp2kgpu -lcp2kfft -lcp2kfpga -lcp2kfm -lcp2kcommon -lcp2koffload -lcp2kmpiwrap -lcp2kbase -ldbcsr -lsirius -lspla -lspfft -lsymspg -lvdwxc -l:libhdf5_fortran.a -l:libhdf5.a -lz -lgsl -lelpa_openmp -lcosma -lcosta -lscalapack -lxsmmf -lxsmm -ldl -lpthread -llibgrpp -lxcf03 -lxc -lint2 -lfftw3_mpi -lfftw3 -lfftw3_omp -lmpi_cxx -lmpi -l:libopenblas.a -lvori -lstdc++ -lmpi_usempif08 -lmpi_mpifh -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm

PyPerformance

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: xml_etreepx4484PXa81624324036.536.835.8

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128px4484PXa2468107.447.417.241. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

LiteRT

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Mobilenet Floatpx4484PXa300600900120015001244.511244.701211.48

Renaissance

OpenBenchmarking.orgms, Fewer Is BetterRenaissance 0.16Test: In-Memory Database Shootoutpx4484PXa70014002100280035003175.63241.53256.1MIN: 2896.06 / MAX: 3367.44MIN: 3037.03 / MAX: 3491.91MIN: 3019.89 / MAX: 3599.5

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Text Generation 16px4484PXa51015202519.5019.4919.03

PyPerformance

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: pickle_pure_pythonpx4484PXa4080120160200168169165

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: django_templatepx4484PXa51015202521.221.020.7

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Text Generation 16px4484PXa369121510.4510.4510.22

PyPerformance

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: asyncio_websocketspx4484PXa70140210280350322321315

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: gopx4484PXa2040608010079.478.677.8

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Text Generation 128px4484PXa51015202520.5120.3920.13

Y-Cruncher

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 500Mpx4484PXa2468108.6238.6888.772

XNNPACK

OpenBenchmarking.orgus, Fewer Is BetterXNNPACK b7b048Model: FP32MobileNetV1px4484PXa300600900120015001272125712521. (CXX) g++ options: -O3 -lrt -lm

LiteRT

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: SqueezeNetpx4484PXa4008001200160020001821.351809.181794.11

PyPerformance

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: pathlibpx4484PXa4812162014.414.414.2

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: floatpx4484PXa122436486050.851.350.7

Llama.cpp

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048px4484PXa142842567063.7963.8063.091. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048px4484PXa142842567063.4163.6162.971. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b4154Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 512px4484PXa153045607568.8168.2068.401. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -fopenmp -march=native -mtune=native -lopenblas

PyPerformance

OpenBenchmarking.orgMilliseconds, Fewer Is BetterPyPerformance 1.11Benchmark: nbodypx4484PXa132639526559.259.559.0

Y-Cruncher

OpenBenchmarking.orgSeconds, Fewer Is BetterY-Cruncher 0.8.5Pi Digits To Calculate: 1Bpx4484PXa51015202518.3718.3818.49

simdjson

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 3.10Throughput Test: LargeRandompx4484PXa0.4140.8281.2421.6562.071.841.841.831. (CXX) g++ options: -O3 -lrt

LiteRT

OpenBenchmarking.orgMicroseconds, Fewer Is BetterLiteRT 2024-10-15Model: Inception ResNet V2px4484PXa4K8K12K16K20K19490.719477.819530.2

Llamafile

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 2048px4484PXa3K6K9K12K15K122881228812288

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 1024px4484PXa13002600390052006500614461446144

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 512px4484PXa7001400210028003500307230723072

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: wizardcoder-python-34b-v1.0.Q6_K - Test: Prompt Processing 256px4484PXa30060090012001500153615361536

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 2048px4484PXa7K14K21K28K35K327683276832768

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 1024px4484PXa4K8K12K16K20K163841638416384

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 512px4484PXa2K4K6K8K10K819281928192

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: mistral-7b-instruct-v0.2.Q5_K_M - Test: Prompt Processing 256px4484PXa9001800270036004500409640964096

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 2048px4484PXa7K14K21K28K35K327683276832768

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 1024px4484PXa4K8K12K16K20K163841638416384

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 512px4484PXa2K4K6K8K10K819281928192

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: TinyLlama-1.1B-Chat-v1.0.BF16 - Test: Prompt Processing 256px4484PXa9001800270036004500409640964096

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 2048px4484PXa7K14K21K28K35K327683276832768

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 1024px4484PXa4K8K12K16K20K163841638416384

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 512px4484PXa2K4K6K8K10K819281928192

OpenBenchmarking.orgTokens Per Second, More Is BetterLlamafile 0.8.16Model: Llama-3.2-3B-Instruct.Q6_K - Test: Prompt Processing 256px4484PXa9001800270036004500409640964096

OpenVINO GenAI

Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU

a: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:

4484PX: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:

px: The test quit with a non-zero exit status. E: RuntimeError: Exception from src/inference/src/cpp/core.cpp:90:

OpenSSL

Algorithm: RSA4096

a: The test quit with a non-zero exit status.

4484PX: The test quit with a non-zero exit status.

px: The test quit with a non-zero exit status.

Algorithm: SHA512

a: The test quit with a non-zero exit status.

4484PX: The test quit with a non-zero exit status.

px: The test quit with a non-zero exit status.

Algorithm: SHA256

a: The test quit with a non-zero exit status.

4484PX: The test quit with a non-zero exit status.

px: The test quit with a non-zero exit status.

Renaissance

Test: Apache Spark ALS

a: The test quit with a non-zero exit status.

4484PX: The test quit with a non-zero exit status.

px: The test quit with a non-zero exit status.

195 Results Shown

LiteRT
oneDNN:
  IP Shapes 1D - CPU
  Convolution Batch Shapes Auto - CPU
BYTE Unix Benchmark
Apache Cassandra
Llama.cpp
LiteRT:
  DeepLab V3
  Quantized COCO SSD MobileNet v1
oneDNN
ONNX Runtime
BYTE Unix Benchmark
oneDNN
Primesieve
ASTC Encoder:
  Thorough
  Medium
  Fast
  Exhaustive
  Very Thorough
Primesieve
Etcpak
BYTE Unix Benchmark
Llama.cpp
OSPRay
Rustls
BYTE Unix Benchmark
oneDNN
Blender
OSPRay
Stockfish
oneDNN
Blender
OSPRay
OpenSSL:
  AES-128-GCM
  AES-256-GCM
OSPRay
POV-Ray
Blender:
  Pabellon Barcelona - CPU-Only
  Fishy Cat - CPU-Only
Rustls
OSPRay
ACES DGEMM
OpenSSL:
  ChaCha20
  ChaCha20-Poly1305
Blender
Llama.cpp
ONNX Runtime
Rustls
7-Zip Compression
Blender
ONNX Runtime
RELION
Stockfish
Renaissance
SVT-AV1
Build2
XNNPACK:
  FP16MobileNetV1
  FP32MobileNetV3Small
simdjson
x265
SVT-AV1:
  Preset 3 - Beauty 4K 10-bit
  Preset 8 - Bosphorus 4K
simdjson
SVT-AV1
OSPRay
XNNPACK
NAMD
ONNX Runtime
SVT-AV1
XNNPACK
Rustls
XNNPACK
Rustls
ONNX Runtime
SVT-AV1
Rustls
Whisperfile
Rustls
SVT-AV1
NAMD
7-Zip Compression
Rustls
Whisper.cpp
SVT-AV1
PyPerformance
ONNX Runtime
SVT-AV1
Timed Eigen Compilation
Rustls
oneDNN
ONNX Runtime
Renaissance
x265
Whisperfile
ONNX Runtime
Renaissance
Apache CouchDB
Whisperfile
simdjson
Numpy Benchmark
Apache CouchDB
Renaissance
Apache CouchDB
QuantLib
CP2K Molecular Dynamics
Renaissance
Llama.cpp
Apache CouchDB
ONNX Runtime
Apache CouchDB
SVT-AV1
XNNPACK
Whisper.cpp
SVT-AV1
Renaissance
PyPerformance
Apache CouchDB
ONNX Runtime
Renaissance
QuantLib
Renaissance
GROMACS
ONNX Runtime
SVT-AV1
Whisper.cpp
Llama.cpp
CP2K Molecular Dynamics
LiteRT
Llamafile
Renaissance
FinanceBench
PyPerformance
Llamafile
Gcrypt Library
OpenVINO GenAI
XNNPACK
Renaissance
Llamafile
PyPerformance
FinanceBench
OpenVINO GenAI
Llama.cpp:
  CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 512
  CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 1024
XNNPACK
PyPerformance:
  raytrace
  chaos
  regex_compile
  crypto_pyaes
OpenVINO GenAI
Llama.cpp
simdjson
Llamafile
PyPerformance
ONNX Runtime
LiteRT
Llamafile
CP2K Molecular Dynamics
PyPerformance
Llama.cpp
LiteRT
Renaissance
Llamafile
PyPerformance:
  pickle_pure_python
  django_template
Llamafile
PyPerformance:
  asyncio_websockets
  go
Llamafile
Y-Cruncher
XNNPACK
LiteRT
PyPerformance:
  pathlib
  float
Llama.cpp:
  CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 2048
  CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 2048
  CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 512
PyPerformance
Y-Cruncher
simdjson
LiteRT
Llamafile:
  wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 2048
  wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 1024
  wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 512
  wizardcoder-python-34b-v1.0.Q6_K - Prompt Processing 256
  mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 2048
  mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 1024
  mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 512
  mistral-7b-instruct-v0.2.Q5_K_M - Prompt Processing 256
  TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 2048
  TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 1024
  TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 512
  TinyLlama-1.1B-Chat-v1.0.BF16 - Prompt Processing 256
  Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 2048
  Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 1024
  Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 512
  Llama-3.2-3B-Instruct.Q6_K - Prompt Processing 256