9684X EOY2024 AMD Linux Tests for a future article. 2 x AMD EPYC 9684X 96-Core testing with a AMD Titanite_4G (RTI1007B BIOS) and ASPEED on Ubuntu 24.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2412305-NE-9684XEOY256&grs .
9684X EOY2024 AMD Linux Processor Motherboard Chipset Memory Disk Graphics Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution a b c 2 x AMD EPYC 9684X 96-Core @ 2.55GHz (192 Cores / 384 Threads) AMD Titanite_4G (RTI1007B BIOS) AMD Device 14a4 1520GB 3201GB Micron_7450_MTFDKCB3T2TFS ASPEED Broadcom NetXtreme BCM5720 PCIe Ubuntu 24.10 6.11.0-13-generic (x86_64) GNOME Shell 47.0 X Server GCC 14.2.0 ext4 1920x1200 OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2,rust --enable-libphobos-checking=release --enable-libstdcxx-backtrace --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-14-zdkDXv/gcc-14-14.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa101148 Java Details - OpenJDK Runtime Environment (build 21.0.5+11-Ubuntu-1ubuntu124.10) Python Details - Python 3.12.7 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + reg_file_data_sampling: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of Safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS; IBPB: conditional; STIBP: always-on; RSB filling; PBRSB-eIBRS: Not affected; BHI: Not affected + srbds: Not affected + tsx_async_abort: Not affected
9684X EOY2024 AMD Linux xnnpack: FP32MobileNetV1 litert: DeepLab V3 litert: Mobilenet Quant litert: Mobilenet Float xnnpack: FP32MobileNetV2 minife: Small svt-av1: Preset 13 - Bosphorus 1080p xnnpack: QS8MobileNetV2 xnnpack: FP16MobileNetV1 litert: SqueezeNet xnnpack: FP16MobileNetV3Small ncnn: CPU - yolov4-tiny ncnn: CPUv2-yolov3v2-yolov3 - mobilenetv2-yolov3 ncnn: CPU - mobilenet ncnn: CPU - vision_transformer ncnn: CPU - FastestDet litert: Quantized COCO SSD MobileNet v1 ncnn: CPU - vgg16 ncnn: CPU - mnasnet svt-av1: Preset 13 - Bosphorus 4K ncnn: CPU - squeezenet_ssd ncnn: CPU - shufflenet-v2 ncnn: CPU - alexnet ncnn: CPU - googlenet ncnn: CPU - resnet50 ncnn: CPU - resnet18 stockfish: Chess Benchmark litert: Inception V4 llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Text Generation 128 ncnn: CPU - blazeface ncnn: CPU - efficientnet-b0 ncnn: CPU-v3-v3 - mobilenet-v3 xnnpack: FP16MobileNetV3Large litert: Inception ResNet V2 litert: NASNet Mobile xnnpack: FP32MobileNetV3Small xnnpack: FP32MobileNetV3Large ncnn: CPU - regnety_400m cassandra: Writes svt-av1: Preset 8 - Bosphorus 4K incompact3d: input.i3d 129 Cells Per Direction c-ray: 1080p - 16 llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 512 llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 512 llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 1024 llama-cpp: CPU BLAS - Llama-3.1-Tulu-3-8B-Q8_0 - Prompt Processing 2048 primesieve: 1e12 srsran: PUSCH Processor Benchmark, Throughput Total openvino-genai: Gemma-7b-int4-ov - CPU llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 512 warpx: Uniform Plasma openvino-genai: Falcon-7b-instruct-int4-ov - CPU graph500: 26 llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Text Generation 128 xnnpack: FP16MobileNetV2 gromacs: water_GMX50_bare graph500: 26 openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU incompact3d: input.i3d 193 Cells Per Direction rustls: handshake-resume - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 ncnn: CPU-v2-v2 - mobilenet-v2 openvino: Noise Suppression Poconet-Like FP16 - CPU compress-7zip: Decompression Rating llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 2048 warpx: Plasma Acceleration svt-av1: Preset 8 - Bosphorus 1080p graph500: 26 srsran: PDSCH Processor Benchmark, Throughput Total llama-cpp: CPU BLAS - granite-3.0-3b-a800m-instruct-Q8_0 - Prompt Processing 1024 llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 1024 x265: Bosphorus 1080p svt-av1: Preset 5 - Bosphorus 4K openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU etcpak: Multi-Threaded - ETC2 openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Noise Suppression Poconet-Like FP16 - CPU relion: Basic - CPU incompact3d: X3D-benchmarking input.i3d openvino-genai: Phi-3-mini-128k-instruct-int4-ov - CPU llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Prompt Processing 2048 blender: Fishy Cat - CPU-Only graph500: 26 openssl: RSA4096 rustls: handshake-ticket - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384 compress-7zip: Compression Rating rustls: handshake-resume - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384 oidn: RT.hdr_alb_nrm.3840x2160 - CPU-Only rustls: handshake-ticket - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 laghos: Sedov Blast Wave, ube_922_hex.mesh openvino: Age Gender Recognition Retail 0013 FP16 - CPU x265: Bosphorus 1080p openvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU blender: BMW27 - CPU-Only c-ray: 4K - 16 build2: Time To Compile gromacs: MPI CPU - water_GMX50_bare svt-av1: Preset 5 - Bosphorus 1080p compress-7zip: Compression Rating build-ffmpeg: Time To Compile ospray-studio: 2 - 1080p - 16 - Path Tracer - CPU epoch: Cone oidn: RTLightmap.hdr.4096x4096 - CPU-Only ospray: particle_volume/pathtracer/real_time build-eigen: Time To Compile compress-7zip: Decompression Rating blender: Junkshop - CPU-Only webp: Quality 100, Highest Compression svt-av1: Preset 3 - Bosphorus 4K build-linux-kernel: defconfig ospray-studio: 3 - 4K - 16 - Path Tracer - CPU y-cruncher: 500M namd: STMV with 1,066,628 Atoms ospray-studio: 1 - 1080p - 16 - Path Tracer - CPU blender: Barbershop - CPU-Only laghos: Triple Point Problem build-linux-kernel: allmodconfig openvino: Weld Porosity Detection FP16-INT8 - CPU ospray-studio: 3 - 1080p - 32 - Path Tracer - CPU rustls: handshake - TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384 ospray-studio: 3 - 4K - 32 - Path Tracer - CPU ospray-studio: 3 - 4K - 1 - Path Tracer - CPU palabos: 100 openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU c-ray: 5K - 16 webp: Default llama-cpp: CPU BLAS - Mistral-7B-Instruct-v0.3-Q8_0 - Text Generation 128 openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU x265: Bosphorus 4K ospray-studio: 2 - 1080p - 32 - Path Tracer - CPU ospray-studio: 3 - 1080p - 16 - Path Tracer - CPU ospray-studio: 2 - 4K - 1 - Path Tracer - CPU ospray-studio: 1 - 1080p - 1 - Path Tracer - CPU ospray-studio: 2 - 1080p - 1 - Path Tracer - CPU ospray-studio: 1 - 1080p - 32 - Path Tracer - CPU ospray-studio: 2 - 4K - 32 - Path Tracer - CPU ospray-studio: 3 - 1080p - 1 - Path Tracer - CPU x265: Bosphorus 4K ospray-studio: 2 - 4K - 16 - Path Tracer - CPU ospray: gravity_spheres_volume/dim_512/ao/real_time ospray-studio: 1 - 4K - 16 - Path Tracer - CPU ospray-studio: 1 - 4K - 1 - Path Tracer - CPU palabos: 400 namd: ATPase with 327,506 Atoms webp: Quality 100 build-nodejs: Time To Compile openvino: Person Vehicle Bike Detection FP16 - CPU ospray: particle_volume/ao/real_time openvino: Person Vehicle Bike Detection FP16 - CPU mt-dgemm: Sustained Floating-Point Rate blender: Pabellon Barcelona - CPU-Only openvino: Face Detection Retail FP16 - CPU ospray: particle_volume/scivis/real_time openvino: Handwritten English Recognition FP16-INT8 - CPU oidn: RT.ldr_alb_nrm.3840x2160 - CPU-Only ospray: gravity_spheres_volume/dim_512/pathtracer/real_time palabos: 500 ospray-studio: 1 - 4K - 32 - Path Tracer - CPU blender: Classroom - CPU-Only ospray: gravity_spheres_volume/dim_512/scivis/real_time palabos: 1000 openvino: Handwritten English Recognition FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU y-cruncher: 5B build-php: Time To Compile openssl: SHA256 openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openssl: ChaCha20 openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Person Re-Identification Retail FP16 - CPU openssl: AES-256-GCM byte: Dhrystone 2 openvino: Face Detection Retail FP16-INT8 - CPU primesieve: 1e13 openvino: Vehicle Detection FP16 - CPU rustls: handshake - TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 openvino: Handwritten English Recognition FP16 - CPU openvino: Face Detection FP16 - CPU openvino: Face Detection FP16-INT8 - CPU svt-av1: Preset 3 - Bosphorus 1080p openssl: SHA512 openvino: Vehicle Detection FP16 - CPU openssl: RSA4096 openssl: ChaCha20-Poly1305 openvino: Road Segmentation ADAS FP16 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openssl: AES-128-GCM openvino: Road Segmentation ADAS FP16 - CPU y-cruncher: 1B byte: Pipe byte: Whetstone Double byte: System Call openvino: Face Detection FP16 - CPU openvino: Person Re-Identification Retail FP16 - CPU openvino: Face Detection Retail FP16 - CPU webp: Quality 100, Lossless, Highest Compression webp: Quality 100, Lossless openvino-genai: Phi-3-mini-128k-instruct-int4-ov - CPU - Time Per Output Token openvino-genai: Phi-3-mini-128k-instruct-int4-ov - CPU - Time To First Token openvino-genai: Falcon-7b-instruct-int4-ov - CPU - Time Per Output Token openvino-genai: Falcon-7b-instruct-int4-ov - CPU - Time To First Token openvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time Per Output Token openvino-genai: TinyLlama-1.1B-Chat-v1.0 - CPU - Time To First Token openvino-genai: Gemma-7b-int4-ov - CPU - Time Per Output Token openvino-genai: Gemma-7b-int4-ov - CPU - Time To First Token quantlib: S a b c 24800 52254.8 20120 20933.9 33117 65310.9 551.636 48535 20169 31218 48621 41.1 32.32 32.32 62.7 33.63 29419.1 57.29 21.21 154.085 48.87 28.54 10.05 38.09 40.59 23.91 512969802 162456 26.13 15.23 34.25 27.96 61636 254071 581416 59877 61147 130.33 296920 110.454 1.29827499 5.041 69.99 75.67 74.92 75.93 1.147 7635.4 33.63 158.95 39.29356258 50.58 1653580000 54.03 60203 12.691 879749000 164808.53 3.97435308 2839244.84 22.98 6669.51 1155891 160.28 37.69209741 316.396 1540820000 129795.7 158.71 75.97 100.86 39.596 0.36 875.752 134020.93 25.72 76.042 160.910339 57.19 76.4 11.74 615929000 2833287.3 1390278.58 804522 1698216.97 3.29 2073385.56 404.654108843 0.56 97.18 68.29 9 18.262 73.151 19.994 95.439 784684 19.901 2603 455.5 1.57 202.362 40.544 1178953 12.3 3.57 11.022 28.842 11987 4.607 5.65889 2573 79.23 202.39 206.651 17881.36 6054 399853.9 23858 749 734.581 10.61 47.76 1004.23 32.096 17.92 30.83 47.76 1004.04 35.86 5169 3042 642 162 164 5158 20412 190 34.9 10279 43.3925 10152 638 530.001 17.85166 11.18 111.738 9438.66 46.8907 5.07 3788.093096 26.74 20737.16 46.8538 37.62 3.25 50.4084 556.406 20328 22.54 42.9096 659.936 39.29 20.3 32.809 41.191 242548530680 13.51 9365.24 179.06 3548.57 4.62 983205666540 5096.13 31408.22 11355.93 1529363548300 12634644588.3 6.09 12.83 6.42 82349.22 4858.25 95.79 267.41 28.759 78840468670 7461.75 74941.9 686377188830 3190.93 10334.31 1787346309360 15.03 9.057 302008825.4 2713226.7 247897646.3 499.71 4.21 2.3 0.56 1.44 17.49 27.52 19.77 34.43 14.64 17.76 29.74 46.08 43522 52731.5 18899.6 16633.9 33453 67739.3 480.526 46034 21882 30070.9 38085 48.37 37.78 37.78 75.85 37.5 27302.5 67.68 23.79 187.688 55.33 34.31 11.55 43.26 45.84 26.78 477502351 189474 30.39 16.49 39.4 28.66 57237 258097 615123 53472 63696 145.07 283197 121.817 1.28448296 5.323 65.87 72.24 79.85 71.43 1.215 8082.6 31.8 158.65 41.4413694 48 1641600000 54.25 62409 12.131 842062000 158277.32 4.15029907 2963208.68 23.98 6771.73 1180405 154.18 39.11805274 328.083 1524900000 132681.9 159.58 78.14 99.95 38.524 0.37 853.032 137459.28 26.04 74.219 163.125946 55.87 75.34 12 610704000 2820653.3 1365832.45 789248 1715553.24 3.25 2060813.4 397.50 0.57 98.77 68.45 9.02 18.502 73.584 19.7 94.715 773934 20.029 2585 461.42 1.55 199.846 40.093 1192798 12.44 3.59 10.927 29.001 12038 4.656 5.65973 2575 79.31 203.72 207.598 17864.8 6085 403088.18 24050 751 729.043 10.61 47.84 1002.63 32.067 18.05 31.05 47.71 1005.17 35.78 5196 3035 641 163 163 5143 20522 190 34.83 10233 43.3994 10163 640 529.249 17.85590 11.13 111.629 9405.62 46.9915 5.09 3796.85965 26.65 20684.11 46.9547 37.5 3.26 50.5096 556.414 20294 22.59 42.8705 659.837 39.36 20.29 32.889 41.222 242399788540 13.54 9360.81 179.29 3541.66 4.63 983433981080 5092.6 31395.35 11377.2 1529285362960 12655563306.7 6.09 12.809 6.41 82232.23 4855.51 95.78 267.41 28.792 78904969300 7462.48 75010.2 685952939270 3188.25 10339.33 1788600410900 15.04 9.057 302075584.9 2714473.2 247922303.2 499.92 4.21 2.3 0.56 1.44 17.9 27.83 20.84 35.01 14.61 17.54 31.44 48.41 24895 91444.5 31618.4 25812.1 45462 50712.3 636.984 60593 26390 38557.3 47611 52.04 40.6 40.6 78.66 42.04 34124.3 71.29 26.06 184.333 58.88 32.46 11.91 45.04 47.57 27.99 557325172 187012 30.19 17.7 38.28 32.02 54309 288314 543452 53286 56805 146.13 267019 115.683 1.41423202 4.865 72.06 78.73 73.96 75.24 1.145 7631.4 32.01 167.59 41.02977149 49.8 1727360000 56.6 59616 12.292 880729000 157712.09 4.05741787 2869856.83 23.6 6951.71 1203970 155.34 37.78828715 321.118 1580110000 128169.6 163.84 76.26 98.1 39.035 0.36 856.221 136174.8 25.39 74.58 159.310562 56.84 77.09 11.92 623794000 2877947.2 1392293.46 791157 1729925.11 3.23 2035952.96 403.45 0.57 97.06 69.4 9.14 18.223 72.487 19.871 96.098 776559 19.759 2620 458.47 1.56 201.155 40.064 1181592 12.41 3.61 11.047 28.69 11909 4.656 5.71674 2599 79.94 204.16 205.872 17734.15 6104 402383.42 23921 755 734.145 10.69 48.12 996.74 32.305 17.96 30.94 48.05 998.12 36.03 5202 3054 645 163 164 5174 20511 191 35.01 10227 43.1862 10201 641 531.729 17.93413 11.14 111.247 9401.19 47.0772 5.09 3782.565125 26.64 20660.92 47.0238 37.54 3.25 50.3549 558.061 20349 22.6 42.7973 661.554 39.26 20.34 32.825 41.29 242005934470 13.54 9344.75 179.45 3540.87 4.63 981347683260 5085.52 31348.09 11360.69 1531952540600 12636880431.6 6.1 12.811 6.42 82326.54 4862.29 95.91 267.08 28.782 78929554520 7455.49 74959.3 686536891250 3190.21 10331.07 1788351156160 15.03 9.052 302154327.2 2714351.2 248008992.6 499.76 4.21 2.3 0.56 1.44 17.59 27.62 20.08 35.17 14.41 17.56 31.24 47.62 OpenBenchmarking.org
XNNPACK Model: FP32MobileNetV1 OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP32MobileNetV1 a b c 9K 18K 27K 36K 45K 24800 43522 24895 1. (CXX) g++ options: -O3 -lrt -lm
LiteRT Model: DeepLab V3 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: DeepLab V3 a b c 20K 40K 60K 80K 100K 52254.8 52731.5 91444.5
LiteRT Model: Mobilenet Quant OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Mobilenet Quant a b c 7K 14K 21K 28K 35K 20120.0 18899.6 31618.4
LiteRT Model: Mobilenet Float OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Mobilenet Float a b c 6K 12K 18K 24K 30K 20933.9 16633.9 25812.1
XNNPACK Model: FP32MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP32MobileNetV2 a b c 10K 20K 30K 40K 50K 33117 33453 45462 1. (CXX) g++ options: -O3 -lrt -lm
miniFE Problem Size: Small OpenBenchmarking.org CG Mflops, More Is Better miniFE 2.2 Problem Size: Small a b c 15K 30K 45K 60K 75K 65310.9 67739.3 50712.3 1. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 13 - Input: Bosphorus 1080p a b c 140 280 420 560 700 551.64 480.53 636.98 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
XNNPACK Model: QS8MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: QS8MobileNetV2 a b c 13K 26K 39K 52K 65K 48535 46034 60593 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP16MobileNetV1 OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP16MobileNetV1 a b c 6K 12K 18K 24K 30K 20169 21882 26390 1. (CXX) g++ options: -O3 -lrt -lm
LiteRT Model: SqueezeNet OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: SqueezeNet a b c 8K 16K 24K 32K 40K 31218.0 30070.9 38557.3
XNNPACK Model: FP16MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP16MobileNetV3Small a b c 10K 20K 30K 40K 50K 48621 38085 47611 1. (CXX) g++ options: -O3 -lrt -lm
NCNN Target: CPU - Model: yolov4-tiny OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: yolov4-tiny a b c 12 24 36 48 60 41.10 48.37 52.04 MIN: 40.13 / MAX: 48.66 MIN: 44.62 / MAX: 227.83 MIN: 47.36 / MAX: 54.47 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPUv2-yolov3v2-yolov3 - Model: mobilenetv2-yolov3 a b c 9 18 27 36 45 32.32 37.78 40.60 MIN: 32.09 / MAX: 37.05 MIN: 37.49 / MAX: 49.72 MIN: 40.3 / MAX: 42.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mobilenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: mobilenet a b c 9 18 27 36 45 32.32 37.78 40.60 MIN: 32.09 / MAX: 37.05 MIN: 37.49 / MAX: 49.72 MIN: 40.3 / MAX: 42.33 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: vision_transformer OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: vision_transformer a b c 20 40 60 80 100 62.70 75.85 78.66 MIN: 61.66 / MAX: 71.71 MIN: 71.67 / MAX: 100.04 MIN: 74.66 / MAX: 141.44 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: FastestDet OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: FastestDet a b c 10 20 30 40 50 33.63 37.50 42.04 MIN: 33.37 / MAX: 41.47 MIN: 36.47 / MAX: 39.01 MIN: 40.1 / MAX: 43.88 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
LiteRT Model: Quantized COCO SSD MobileNet v1 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Quantized COCO SSD MobileNet v1 a b c 7K 14K 21K 28K 35K 29419.1 27302.5 34124.3
NCNN Target: CPU - Model: vgg16 OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: vgg16 a b c 16 32 48 64 80 57.29 67.68 71.29 MIN: 56.41 / MAX: 59.7 MIN: 65.28 / MAX: 71.33 MIN: 69.89 / MAX: 75.28 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: mnasnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: mnasnet a b c 6 12 18 24 30 21.21 23.79 26.06 MIN: 20.88 / MAX: 61.55 MIN: 23.02 / MAX: 25.23 MIN: 24 / MAX: 282.22 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 13 - Input: Bosphorus 4K a b c 40 80 120 160 200 154.09 187.69 184.33 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
NCNN Target: CPU - Model: squeezenet_ssd OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: squeezenet_ssd a b c 13 26 39 52 65 48.87 55.33 58.88 MIN: 48.32 / MAX: 57.97 MIN: 54.25 / MAX: 188.52 MIN: 58.23 / MAX: 62.18 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: shufflenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: shufflenet-v2 a b c 8 16 24 32 40 28.54 34.31 32.46 MIN: 28.22 / MAX: 31 MIN: 29.7 / MAX: 325.64 MIN: 30.3 / MAX: 287 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: alexnet OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: alexnet a b c 3 6 9 12 15 10.05 11.55 11.91 MIN: 9.71 / MAX: 11.49 MIN: 11.26 / MAX: 13 MIN: 11.57 / MAX: 13.56 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: googlenet OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: googlenet a b c 10 20 30 40 50 38.09 43.26 45.04 MIN: 37.88 / MAX: 45.5 MIN: 43.04 / MAX: 44.82 MIN: 44.77 / MAX: 52.98 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet50 OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: resnet50 a b c 11 22 33 44 55 40.59 45.84 47.57 MIN: 40.35 / MAX: 42.4 MIN: 45.14 / MAX: 142.04 MIN: 47.3 / MAX: 49.21 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: resnet18 OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: resnet18 a b c 7 14 21 28 35 23.91 26.78 27.99 MIN: 23.77 / MAX: 25.2 MIN: 26.55 / MAX: 34.81 MIN: 27.8 / MAX: 29.37 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Stockfish Chess Benchmark OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 17 Chess Benchmark a b c 120M 240M 360M 480M 600M 512969802 477502351 557325172 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -funroll-loops -msse -msse3 -mpopcnt -mavx2 -mbmi -mavx512f -mavx512bw -mavx512vnni -mavx512dq -mavx512vl -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto-partition=one -flto=jobserver
LiteRT Model: Inception V4 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Inception V4 a b c 40K 80K 120K 160K 200K 162456 189474 187012
Llama.cpp Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Text Generation 128 a b c 7 14 21 28 35 26.13 30.39 30.19 1. (CXX) g++ options: -O3
NCNN Target: CPU - Model: blazeface OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: blazeface a b c 4 8 12 16 20 15.23 16.49 17.70 MIN: 15.12 / MAX: 16.82 MIN: 16.39 / MAX: 17.8 MIN: 17.07 / MAX: 21.55 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU - Model: efficientnet-b0 OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: efficientnet-b0 a b c 9 18 27 36 45 34.25 39.40 38.28 MIN: 32.27 / MAX: 339.49 MIN: 36.7 / MAX: 209.1 MIN: 37.89 / MAX: 41.58 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
NCNN Target: CPU-v3-v3 - Model: mobilenet-v3 OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU-v3-v3 - Model: mobilenet-v3 a b c 7 14 21 28 35 27.96 28.66 32.02 MIN: 25.07 / MAX: 299.42 MIN: 25.96 / MAX: 214.05 MIN: 26.22 / MAX: 360.82 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
XNNPACK Model: FP16MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP16MobileNetV3Large a b c 13K 26K 39K 52K 65K 61636 57237 54309 1. (CXX) g++ options: -O3 -lrt -lm
LiteRT Model: Inception ResNet V2 OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: Inception ResNet V2 a b c 60K 120K 180K 240K 300K 254071 258097 288314
LiteRT Model: NASNet Mobile OpenBenchmarking.org Microseconds, Fewer Is Better LiteRT 2024-10-15 Model: NASNet Mobile a b c 130K 260K 390K 520K 650K 581416 615123 543452
XNNPACK Model: FP32MobileNetV3Small OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP32MobileNetV3Small a b c 13K 26K 39K 52K 65K 59877 53472 53286 1. (CXX) g++ options: -O3 -lrt -lm
XNNPACK Model: FP32MobileNetV3Large OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP32MobileNetV3Large a b c 14K 28K 42K 56K 70K 61147 63696 56805 1. (CXX) g++ options: -O3 -lrt -lm
NCNN Target: CPU - Model: regnety_400m OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU - Model: regnety_400m a b c 30 60 90 120 150 130.33 145.07 146.13 MIN: 129.83 / MAX: 134.5 MIN: 142 / MAX: 364.83 MIN: 144.99 / MAX: 148.12 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
Apache Cassandra Test: Writes OpenBenchmarking.org Op/s, More Is Better Apache Cassandra 5.0 Test: Writes a b c 60K 120K 180K 240K 300K 296920 283197 267019
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 8 - Input: Bosphorus 4K a b c 30 60 90 120 150 110.45 121.82 115.68 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Xcompact3d Incompact3d Input: input.i3d 129 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 129 Cells Per Direction a b c 0.3182 0.6364 0.9546 1.2728 1.591 1.29827499 1.28448296 1.41423202 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
C-Ray Resolution: 1080p - Rays Per Pixel: 16 OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 2.0 Resolution: 1080p - Rays Per Pixel: 16 a b c 1.1977 2.3954 3.5931 4.7908 5.9885 5.041 5.323 4.865 1. (CC) gcc options: -lpthread -lm
Llama.cpp Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 512 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 512 a b c 16 32 48 64 80 69.99 65.87 72.06 1. (CXX) g++ options: -O3
Llama.cpp Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 512 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 512 a b c 20 40 60 80 100 75.67 72.24 78.73 1. (CXX) g++ options: -O3
Llama.cpp Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 1024 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 1024 a b c 20 40 60 80 100 74.92 79.85 73.96 1. (CXX) g++ options: -O3
Llama.cpp Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Llama-3.1-Tulu-3-8B-Q8_0 - Test: Prompt Processing 2048 a b c 20 40 60 80 100 75.93 71.43 75.24 1. (CXX) g++ options: -O3
Primesieve Length: 1e12 OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 12.6 Length: 1e12 a b c 0.2734 0.5468 0.8202 1.0936 1.367 1.147 1.215 1.145 1. (CXX) g++ options: -O3
srsRAN Project Test: PUSCH Processor Benchmark, Throughput Total OpenBenchmarking.org Mbps, More Is Better srsRAN Project 24.10 Test: PUSCH Processor Benchmark, Throughput Total a b c 2K 4K 6K 8K 10K 7635.4 8082.6 7631.4 1. (CXX) g++ options: -O3 -march=native -mtune=generic -fno-trapping-math -fno-math-errno -ldl
OpenVINO GenAI Model: Gemma-7b-int4-ov - Device: CPU OpenBenchmarking.org tokens/s, More Is Better OpenVINO GenAI 2024.5 Model: Gemma-7b-int4-ov - Device: CPU a b c 8 16 24 32 40 33.63 31.80 32.01
Llama.cpp Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 512 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 512 a b c 40 80 120 160 200 158.95 158.65 167.59 1. (CXX) g++ options: -O3
WarpX Input: Uniform Plasma OpenBenchmarking.org Seconds, Fewer Is Better WarpX 24.10 Input: Uniform Plasma a b c 9 18 27 36 45 39.29 41.44 41.03 1. (CXX) g++ options: -O3 -lm
OpenVINO GenAI Model: Falcon-7b-instruct-int4-ov - Device: CPU OpenBenchmarking.org tokens/s, More Is Better OpenVINO GenAI 2024.5 Model: Falcon-7b-instruct-int4-ov - Device: CPU a b c 11 22 33 44 55 50.58 48.00 49.80
Graph500 Scale: 26 OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 a b c 400M 800M 1200M 1600M 2000M 1653580000 1641600000 1727360000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
Llama.cpp Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Text Generation 128 a b c 13 26 39 52 65 54.03 54.25 56.60 1. (CXX) g++ options: -O3
XNNPACK Model: FP16MobileNetV2 OpenBenchmarking.org us, Fewer Is Better XNNPACK b7b048 Model: FP16MobileNetV2 a b c 13K 26K 39K 52K 65K 60203 62409 59616 1. (CXX) g++ options: -O3 -lrt -lm
GROMACS Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS Input: water_GMX50_bare a b c 3 6 9 12 15 12.69 12.13 12.29 1. GROMACS version: 2024.2-Ubuntu_2024.2_1
Graph500 Scale: 26 OpenBenchmarking.org sssp max_TEPS, More Is Better Graph500 3.0 Scale: 26 a b c 200M 400M 600M 800M 1000M 879749000 842062000 880729000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU a b c 40K 80K 120K 160K 200K 164808.53 158277.32 157712.09 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
Xcompact3d Incompact3d Input: input.i3d 193 Cells Per Direction OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: input.i3d 193 Cells Per Direction a b c 0.9338 1.8676 2.8014 3.7352 4.669 3.97435308 4.15029907 4.05741787 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Rustls Benchmark: handshake-resume - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 OpenBenchmarking.org handshakes/s, More Is Better Rustls 0.23.17 Benchmark: handshake-resume - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 a b c 600K 1200K 1800K 2400K 3000K 2839244.84 2963208.68 2869856.83 1. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs
NCNN Target: CPU-v2-v2 - Model: mobilenet-v2 OpenBenchmarking.org ms, Fewer Is Better NCNN 20241226 Target: CPU-v2-v2 - Model: mobilenet-v2 a b c 6 12 18 24 30 22.98 23.98 23.60 MIN: 22.48 / MAX: 24.91 MIN: 23.73 / MAX: 32.78 MIN: 23.33 / MAX: 31.25 1. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread
OpenVINO Model: Noise Suppression Poconet-Like FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Noise Suppression Poconet-Like FP16 - Device: CPU a b c 1500 3000 4500 6000 7500 6669.51 6771.73 6951.71 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 24.05 Test: Decompression Rating a b c 300K 600K 900K 1200K 1500K 1155891 1180405 1203970 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Llama.cpp Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 2048 a b c 40 80 120 160 200 160.28 154.18 155.34 1. (CXX) g++ options: -O3
WarpX Input: Plasma Acceleration OpenBenchmarking.org Seconds, Fewer Is Better WarpX 24.10 Input: Plasma Acceleration a b c 9 18 27 36 45 37.69 39.12 37.79 1. (CXX) g++ options: -O3 -lm
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 8 - Input: Bosphorus 1080p a b c 70 140 210 280 350 316.40 328.08 321.12 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Graph500 Scale: 26 OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 a b c 300M 600M 900M 1200M 1500M 1540820000 1524900000 1580110000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
srsRAN Project Test: PDSCH Processor Benchmark, Throughput Total OpenBenchmarking.org Mbps, More Is Better srsRAN Project 24.10 Test: PDSCH Processor Benchmark, Throughput Total a b c 30K 60K 90K 120K 150K 129795.7 132681.9 128169.6 1. (CXX) g++ options: -O3 -march=native -mtune=generic -fno-trapping-math -fno-math-errno -ldl
Llama.cpp Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 1024 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: granite-3.0-3b-a800m-instruct-Q8_0 - Test: Prompt Processing 1024 a b c 40 80 120 160 200 158.71 159.58 163.84 1. (CXX) g++ options: -O3
Llama.cpp Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 1024 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 1024 a b c 20 40 60 80 100 75.97 78.14 76.26 1. (CXX) g++ options: -O3
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 4.1 Video Input: Bosphorus 1080p a b c 20 40 60 80 100 100.86 99.95 98.10 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
SVT-AV1 Encoder Mode: Preset 5 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 5 - Input: Bosphorus 4K a b c 9 18 27 36 45 39.60 38.52 39.04 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU a b c 0.0833 0.1666 0.2499 0.3332 0.4165 0.36 0.37 0.36 MIN: 0.33 / MAX: 45.82 MIN: 0.33 / MAX: 45.02 MIN: 0.33 / MAX: 47.54 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
Etcpak Benchmark: Multi-Threaded - Configuration: ETC2 OpenBenchmarking.org Mpx/s, More Is Better Etcpak 2.0 Benchmark: Multi-Threaded - Configuration: ETC2 a b c 200 400 600 800 1000 875.75 853.03 856.22 1. (CXX) g++ options: -flto -pthread
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU a b c 30K 60K 90K 120K 150K 134020.93 137459.28 136174.80 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Noise Suppression Poconet-Like FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Noise Suppression Poconet-Like FP16 - Device: CPU a b c 6 12 18 24 30 25.72 26.04 25.39 MIN: 10.36 / MAX: 59.49 MIN: 9.89 / MAX: 73.93 MIN: 10.78 / MAX: 87.72 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
RELION Test: Basic - Device: CPU OpenBenchmarking.org Seconds, Fewer Is Better RELION 5.0 Test: Basic - Device: CPU a b c 20 40 60 80 100 76.04 74.22 74.58 1. (CXX) g++ options: -fPIC -std=c++14 -fopenmp -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -ljpeg -lmpi_cxx -lmpi
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d a b c 40 80 120 160 200 160.91 163.13 159.31 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OpenVINO GenAI Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPU OpenBenchmarking.org tokens/s, More Is Better OpenVINO GenAI 2024.5 Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPU a b c 13 26 39 52 65 57.19 55.87 56.84
Llama.cpp Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Prompt Processing 2048 a b c 20 40 60 80 100 76.40 75.34 77.09 1. (CXX) g++ options: -O3
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Fishy Cat - Compute: CPU-Only a b c 3 6 9 12 15 11.74 12.00 11.92
Graph500 Scale: 26 OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 a b c 130M 260M 390M 520M 650M 615929000 610704000 623794000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.3 Algorithm: RSA4096 a b c 600K 1200K 1800K 2400K 3000K 2833287.3 2820653.3 2877947.2 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Rustls Benchmark: handshake-ticket - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384 OpenBenchmarking.org handshakes/s, More Is Better Rustls 0.23.17 Benchmark: handshake-ticket - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384 a b c 300K 600K 900K 1200K 1500K 1390278.58 1365832.45 1392293.46 1. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 24.05 Test: Compression Rating a b c 200K 400K 600K 800K 1000K 804522 789248 791157 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Rustls Benchmark: handshake-resume - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384 OpenBenchmarking.org handshakes/s, More Is Better Rustls 0.23.17 Benchmark: handshake-resume - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384 a b c 400K 800K 1200K 1600K 2000K 1698216.97 1715553.24 1729925.11 1. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs
Intel Open Image Denoise Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.3 Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only a b c 0.7403 1.4806 2.2209 2.9612 3.7015 3.29 3.25 3.23
Rustls Benchmark: handshake-ticket - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 OpenBenchmarking.org handshakes/s, More Is Better Rustls 0.23.17 Benchmark: handshake-ticket - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 a b c 400K 800K 1200K 1600K 2000K 2073385.56 2060813.40 2035952.96 1. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs
Laghos Test: Sedov Blast Wave, ube_922_hex.mesh OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Sedov Blast Wave, ube_922_hex.mesh a b c 90 180 270 360 450 404.65 397.50 403.45 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU a b c 0.1283 0.2566 0.3849 0.5132 0.6415 0.56 0.57 0.57 MIN: 0.51 / MAX: 20.56 MIN: 0.51 / MAX: 45.57 MIN: 0.51 / MAX: 18.65 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
x265 Video Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better x265 Video Input: Bosphorus 1080p a b c 20 40 60 80 100 97.18 98.77 97.06 1. x265 [info]: HEVC encoder version 3.6+1-aa7f602f7
OpenVINO GenAI Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU OpenBenchmarking.org tokens/s, More Is Better OpenVINO GenAI 2024.5 Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU a b c 15 30 45 60 75 68.29 68.45 69.40
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: BMW27 - Compute: CPU-Only a b c 3 6 9 12 15 9.00 9.02 9.14
C-Ray Resolution: 4K - Rays Per Pixel: 16 OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 2.0 Resolution: 4K - Rays Per Pixel: 16 a b c 5 10 15 20 25 18.26 18.50 18.22 1. (CC) gcc options: -lpthread -lm
Build2 Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Build2 0.17 Time To Compile a b c 16 32 48 64 80 73.15 73.58 72.49
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare a b c 5 10 15 20 25 19.99 19.70 19.87 1. (CXX) g++ options: -O3 -lm
SVT-AV1 Encoder Mode: Preset 5 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 5 - Input: Bosphorus 1080p a b c 20 40 60 80 100 95.44 94.72 96.10 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression Test: Compression Rating a b c 200K 400K 600K 800K 1000K 784684 773934 776559 1. 7-Zip 24.08 (x64) : Copyright (c) 1999-2024 Igor Pavlov : 2024-08-11
Timed FFmpeg Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed FFmpeg Compilation 7.0 Time To Compile a b c 5 10 15 20 25 19.90 20.03 19.76
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b c 600 1200 1800 2400 3000 2603 2585 2620
Epoch Epoch3D Deck: Cone OpenBenchmarking.org Seconds, Fewer Is Better Epoch 4.19.4 Epoch3D Deck: Cone a b c 100 200 300 400 500 455.50 461.42 458.47 1. (F9X) gfortran options: -O3 -std=f2003 -Jobj -lsdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Intel Open Image Denoise Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.3 Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only a b c 0.3533 0.7066 1.0599 1.4132 1.7665 1.57 1.55 1.56
OSPRay Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.2 Benchmark: particle_volume/pathtracer/real_time a b c 40 80 120 160 200 202.36 199.85 201.16
Timed Eigen Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Eigen Compilation 3.4.0 Time To Compile a b c 9 18 27 36 45 40.54 40.09 40.06
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression Test: Decompression Rating a b c 300K 600K 900K 1200K 1500K 1178953 1192798 1181592 1. 7-Zip 24.08 (x64) : Copyright (c) 1999-2024 Igor Pavlov : 2024-08-11
Blender Blend File: Junkshop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Junkshop - Compute: CPU-Only a b c 3 6 9 12 15 12.30 12.44 12.41
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.4 Encode Settings: Quality 100, Highest Compression a b c 0.8123 1.6246 2.4369 3.2492 4.0615 3.57 3.59 3.61 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
SVT-AV1 Encoder Mode: Preset 3 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 3 - Input: Bosphorus 4K a b c 3 6 9 12 15 11.02 10.93 11.05 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: defconfig a b c 7 14 21 28 35 28.84 29.00 28.69
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b c 3K 6K 9K 12K 15K 11987 12038 11909
Y-Cruncher Pi Digits To Calculate: 500M OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 500M a b c 1.0476 2.0952 3.1428 4.1904 5.238 4.607 4.656 4.656
NAMD Input: STMV with 1,066,628 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: STMV with 1,066,628 Atoms a b c 1.2863 2.5726 3.8589 5.1452 6.4315 5.65889 5.65973 5.71674
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b c 600 1200 1800 2400 3000 2573 2575 2599
Blender Blend File: Barbershop - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Barbershop - Compute: CPU-Only a b c 20 40 60 80 100 79.23 79.31 79.94
Laghos Test: Triple Point Problem OpenBenchmarking.org Major Kernels Total Rate, More Is Better Laghos 3.1 Test: Triple Point Problem a b c 40 80 120 160 200 202.39 203.72 204.16 1. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.8 Build: allmodconfig a b c 50 100 150 200 250 206.65 207.60 205.87
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Weld Porosity Detection FP16-INT8 - Device: CPU a b c 4K 8K 12K 16K 20K 17881.36 17864.80 17734.15 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b c 1300 2600 3900 5200 6500 6054 6085 6104
Rustls Benchmark: handshake - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384 OpenBenchmarking.org handshakes/s, More Is Better Rustls 0.23.17 Benchmark: handshake - Suite: TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384 a b c 90K 180K 270K 360K 450K 399853.90 403088.18 402383.42 1. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b c 5K 10K 15K 20K 25K 23858 24050 23921
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b c 160 320 480 640 800 749 751 755
Palabos Grid Size: 100 OpenBenchmarking.org Mega Site Updates Per Second, More Is Better Palabos 2.3 Grid Size: 100 a b c 160 320 480 640 800 734.58 729.04 734.15 1. (CXX) g++ options: -std=c++17 -pedantic -O3 -rdynamic -lcrypto -lcurl -lsz -lz -ldl -lm
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Weld Porosity Detection FP16-INT8 - Device: CPU a b c 3 6 9 12 15 10.61 10.61 10.69 MIN: 8.51 / MAX: 44.19 MIN: 8.75 / MAX: 49.94 MIN: 8.86 / MAX: 55.26 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Person Detection FP16 - Device: CPU a b c 11 22 33 44 55 47.76 47.84 48.12 MIN: 42.27 / MAX: 95.66 MIN: 40.4 / MAX: 93.82 MIN: 43.74 / MAX: 95.68 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Person Detection FP16 - Device: CPU a b c 200 400 600 800 1000 1004.23 1002.63 996.74 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
C-Ray Resolution: 5K - Rays Per Pixel: 16 OpenBenchmarking.org Seconds, Fewer Is Better C-Ray 2.0 Resolution: 5K - Rays Per Pixel: 16 a b c 8 16 24 32 40 32.10 32.07 32.31 1. (CC) gcc options: -lpthread -lm
WebP Image Encode Encode Settings: Default OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.4 Encode Settings: Default a b c 4 8 12 16 20 17.92 18.05 17.96 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
Llama.cpp Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128 OpenBenchmarking.org Tokens Per Second, More Is Better Llama.cpp b4397 Backend: CPU BLAS - Model: Mistral-7B-Instruct-v0.3-Q8_0 - Test: Text Generation 128 a b c 7 14 21 28 35 30.83 31.05 30.94 1. (CXX) g++ options: -O3
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Machine Translation EN To DE FP16 - Device: CPU a b c 11 22 33 44 55 47.76 47.71 48.05 MIN: 36.58 / MAX: 94.91 MIN: 39.39 / MAX: 104.16 MIN: 35.89 / MAX: 151.58 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Machine Translation EN To DE FP16 - Device: CPU a b c 200 400 600 800 1000 1004.04 1005.17 998.12 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 4.1 Video Input: Bosphorus 4K a b c 8 16 24 32 40 35.86 35.78 36.03 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b c 1100 2200 3300 4400 5500 5169 5196 5202
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b c 700 1400 2100 2800 3500 3042 3035 3054
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b c 140 280 420 560 700 642 641 645
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b c 40 80 120 160 200 162 163 163
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b c 40 80 120 160 200 164 163 164
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b c 1100 2200 3300 4400 5500 5158 5143 5174
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b c 4K 8K 12K 16K 20K 20412 20522 20511
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b c 40 80 120 160 200 190 190 191
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 Video Input: Bosphorus 4K a b c 8 16 24 32 40 34.90 34.83 35.01 1. x265 [info]: HEVC encoder version 3.6+1-aa7f602f7
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b c 2K 4K 6K 8K 10K 10279 10233 10227
OSPRay Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.2 Benchmark: gravity_spheres_volume/dim_512/ao/real_time a b c 10 20 30 40 50 43.39 43.40 43.19
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b c 2K 4K 6K 8K 10K 10152 10163 10201
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b c 140 280 420 560 700 638 640 641
Palabos Grid Size: 400 OpenBenchmarking.org Mega Site Updates Per Second, More Is Better Palabos 2.3 Grid Size: 400 a b c 110 220 330 440 550 530.00 529.25 531.73 1. (CXX) g++ options: -std=c++17 -pedantic -O3 -rdynamic -lcrypto -lcurl -lsz -lz -ldl -lm
NAMD Input: ATPase with 327,506 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0 Input: ATPase with 327,506 Atoms a b c 4 8 12 16 20 17.85 17.86 17.93
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.4 Encode Settings: Quality 100 a b c 3 6 9 12 15 11.18 11.13 11.14 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 21.7.2 Time To Compile a b c 30 60 90 120 150 111.74 111.63 111.25
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Person Vehicle Bike Detection FP16 - Device: CPU a b c 2K 4K 6K 8K 10K 9438.66 9405.62 9401.19 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OSPRay Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.2 Benchmark: particle_volume/ao/real_time a b c 11 22 33 44 55 46.89 46.99 47.08
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Person Vehicle Bike Detection FP16 - Device: CPU a b c 1.1453 2.2906 3.4359 4.5812 5.7265 5.07 5.09 5.09 MIN: 4.31 / MAX: 24 MIN: 4.4 / MAX: 27.98 MIN: 4.35 / MAX: 26.38 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate a b c 800 1600 2400 3200 4000 3788.09 3796.86 3782.57 1. (CC) gcc options: -ffast-math -mavx2 -O3 -fopenmp -lopenblas
Blender Blend File: Pabellon Barcelona - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Pabellon Barcelona - Compute: CPU-Only a b c 6 12 18 24 30 26.74 26.65 26.64
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Face Detection Retail FP16 - Device: CPU a b c 4K 8K 12K 16K 20K 20737.16 20684.11 20660.92 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OSPRay Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.2 Benchmark: particle_volume/scivis/real_time a b c 11 22 33 44 55 46.85 46.95 47.02
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Handwritten English Recognition FP16-INT8 - Device: CPU a b c 9 18 27 36 45 37.62 37.50 37.54 MIN: 31.29 / MAX: 56.35 MIN: 29.82 / MAX: 58.47 MIN: 32.61 / MAX: 58.23 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.3 Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only a b c 0.7335 1.467 2.2005 2.934 3.6675 3.25 3.26 3.25
OSPRay Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.2 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time a b c 11 22 33 44 55 50.41 50.51 50.35
Palabos Grid Size: 500 OpenBenchmarking.org Mega Site Updates Per Second, More Is Better Palabos 2.3 Grid Size: 500 a b c 120 240 360 480 600 556.41 556.41 558.06 1. (CXX) g++ options: -std=c++17 -pedantic -O3 -rdynamic -lcrypto -lcurl -lsz -lz -ldl -lm
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b c 4K 8K 12K 16K 20K 20328 20294 20349
Blender Blend File: Classroom - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 4.3 Blend File: Classroom - Compute: CPU-Only a b c 5 10 15 20 25 22.54 22.59 22.60
OSPRay Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.2 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time a b c 10 20 30 40 50 42.91 42.87 42.80
Palabos Grid Size: 1000 OpenBenchmarking.org Mega Site Updates Per Second, More Is Better Palabos 2.3 Grid Size: 1000 a b c 140 280 420 560 700 659.94 659.84 661.55 1. (CXX) g++ options: -std=c++17 -pedantic -O3 -rdynamic -lcrypto -lcurl -lsz -lz -ldl -lm
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Handwritten English Recognition FP16 - Device: CPU a b c 9 18 27 36 45 39.29 39.36 39.26 MIN: 32.59 / MAX: 69.46 MIN: 33.81 / MAX: 75.48 MIN: 31.98 / MAX: 70.5 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Weld Porosity Detection FP16 - Device: CPU a b c 5 10 15 20 25 20.30 20.29 20.34 MIN: 18.42 / MAX: 53.61 MIN: 17.92 / MAX: 98.47 MIN: 17.23 / MAX: 77.56 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
Y-Cruncher Pi Digits To Calculate: 5B OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 5B a b c 8 16 24 32 40 32.81 32.89 32.83
Timed PHP Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed PHP Compilation 8.3.4 Time To Compile a b c 9 18 27 36 45 41.19 41.22 41.29
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: SHA256 a b c 50000M 100000M 150000M 200000M 250000M 242548530680 242399788540 242005934470 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU a b c 3 6 9 12 15 13.51 13.54 13.54 MIN: 11.72 / MAX: 45 MIN: 12.3 / MAX: 45.77 MIN: 11.78 / MAX: 41.47 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Weld Porosity Detection FP16 - Device: CPU a b c 2K 4K 6K 8K 10K 9365.24 9360.81 9344.75 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Face Detection FP16-INT8 - Device: CPU a b c 40 80 120 160 200 179.06 179.29 179.45 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Road Segmentation ADAS FP16-INT8 - Device: CPU a b c 800 1600 2400 3200 4000 3548.57 3541.66 3540.87 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Vehicle Detection FP16-INT8 - Device: CPU a b c 1.0418 2.0836 3.1254 4.1672 5.209 4.62 4.63 4.63 MIN: 3.61 / MAX: 26.86 MIN: 3.64 / MAX: 25.83 MIN: 3.65 / MAX: 27.38 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: ChaCha20 a b c 200000M 400000M 600000M 800000M 1000000M 983205666540 983433981080 981347683260 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Handwritten English Recognition FP16-INT8 - Device: CPU a b c 1100 2200 3300 4400 5500 5096.13 5092.60 5085.52 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Face Detection Retail FP16-INT8 - Device: CPU a b c 7K 14K 21K 28K 35K 31408.22 31395.35 31348.09 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Person Re-Identification Retail FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Person Re-Identification Retail FP16 - Device: CPU a b c 2K 4K 6K 8K 10K 11355.93 11377.20 11360.69 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: AES-256-GCM a b c 300000M 600000M 900000M 1200000M 1500000M 1529363548300 1529285362960 1531952540600 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
BYTE Unix Benchmark Computational Test: Dhrystone 2 OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 5.1.3-git Computational Test: Dhrystone 2 a b c 3000M 6000M 9000M 12000M 15000M 12634644588.3 12655563306.7 12636880431.6 1. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Face Detection Retail FP16-INT8 - Device: CPU a b c 2 4 6 8 10 6.09 6.09 6.10 MIN: 5.01 / MAX: 32 MIN: 4.9 / MAX: 31.14 MIN: 4.96 / MAX: 28.29 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
Primesieve Length: 1e13 OpenBenchmarking.org Seconds, Fewer Is Better Primesieve 12.6 Length: 1e13 a b c 3 6 9 12 15 12.83 12.81 12.81 1. (CXX) g++ options: -O3
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Vehicle Detection FP16 - Device: CPU a b c 2 4 6 8 10 6.42 6.41 6.42 MIN: 4.91 / MAX: 29.23 MIN: 4.86 / MAX: 30.37 MIN: 5.39 / MAX: 28.39 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
Rustls Benchmark: handshake - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 OpenBenchmarking.org handshakes/s, More Is Better Rustls 0.23.17 Benchmark: handshake - Suite: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 a b c 20K 40K 60K 80K 100K 82349.22 82232.23 82326.54 1. (CC) gcc options: -m64 -lgcc_s -lutil -lrt -lpthread -lm -ldl -lc -pie -nodefaultlibs
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Handwritten English Recognition FP16 - Device: CPU a b c 1000 2000 3000 4000 5000 4858.25 4855.51 4862.29 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Face Detection FP16 - Device: CPU a b c 20 40 60 80 100 95.79 95.78 95.91 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Face Detection FP16-INT8 - Device: CPU a b c 60 120 180 240 300 267.41 267.41 267.08 MIN: 216.32 / MAX: 362.62 MIN: 244.77 / MAX: 328.68 MIN: 243.4 / MAX: 286.84 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
SVT-AV1 Encoder Mode: Preset 3 - Input: Bosphorus 1080p OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 2.3 Encoder Mode: Preset 3 - Input: Bosphorus 1080p a b c 7 14 21 28 35 28.76 28.79 28.78 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: SHA512 a b c 20000M 40000M 60000M 80000M 100000M 78840468670 78904969300 78929554520 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Vehicle Detection FP16 - Device: CPU a b c 1600 3200 4800 6400 8000 7461.75 7462.48 7455.49 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.3 Algorithm: RSA4096 a b c 16K 32K 48K 64K 80K 74941.9 75010.2 74959.3 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: ChaCha20-Poly1305 a b c 150000M 300000M 450000M 600000M 750000M 686377188830 685952939270 686536891250 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Road Segmentation ADAS FP16 - Device: CPU a b c 700 1400 2100 2800 3500 3190.93 3188.25 3190.21 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2024.5 Model: Vehicle Detection FP16-INT8 - Device: CPU a b c 2K 4K 6K 8K 10K 10334.31 10339.33 10331.07 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.3 Algorithm: AES-128-GCM a b c 400000M 800000M 1200000M 1600000M 2000000M 1787346309360 1788600410900 1788351156160 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Road Segmentation ADAS FP16 - Device: CPU a b c 4 8 12 16 20 15.03 15.04 15.03 MIN: 13.37 / MAX: 48.6 MIN: 13.62 / MAX: 54.33 MIN: 13.1 / MAX: 58.58 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
Y-Cruncher Pi Digits To Calculate: 1B OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.5 Pi Digits To Calculate: 1B a b c 3 6 9 12 15 9.057 9.057 9.052
BYTE Unix Benchmark Computational Test: Pipe OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 5.1.3-git Computational Test: Pipe a b c 60M 120M 180M 240M 300M 302008825.4 302075584.9 302154327.2 1. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm
BYTE Unix Benchmark Computational Test: Whetstone Double OpenBenchmarking.org MWIPS, More Is Better BYTE Unix Benchmark 5.1.3-git Computational Test: Whetstone Double a b c 600K 1200K 1800K 2400K 3000K 2713226.7 2714473.2 2714351.2 1. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm
BYTE Unix Benchmark Computational Test: System Call OpenBenchmarking.org LPS, More Is Better BYTE Unix Benchmark 5.1.3-git Computational Test: System Call a b c 50M 100M 150M 200M 250M 247897646.3 247922303.2 248008992.6 1. (CC) gcc options: -pedantic -O3 -ffast-math -march=native -mtune=native -lm
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Face Detection FP16 - Device: CPU a b c 110 220 330 440 550 499.71 499.92 499.76 MIN: 449.21 / MAX: 571.54 MIN: 479.24 / MAX: 552.53 MIN: 466.23 / MAX: 522.51 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Person Re-Identification Retail FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Person Re-Identification Retail FP16 - Device: CPU a b c 0.9473 1.8946 2.8419 3.7892 4.7365 4.21 4.21 4.21 MIN: 3.52 / MAX: 21.19 MIN: 3.83 / MAX: 23.73 MIN: 3.57 / MAX: 20.4 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2024.5 Model: Face Detection Retail FP16 - Device: CPU a b c 0.5175 1.035 1.5525 2.07 2.5875 2.3 2.3 2.3 MIN: 1.81 / MAX: 19.11 MIN: 1.94 / MAX: 25.03 MIN: 1.84 / MAX: 19.42 1. (CXX) g++ options: -fPIC -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -shared -ldl -lstdc++fs
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.4 Encode Settings: Quality 100, Lossless, Highest Compression a b c 0.126 0.252 0.378 0.504 0.63 0.56 0.56 0.56 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.4 Encode Settings: Quality 100, Lossless a b c 0.324 0.648 0.972 1.296 1.62 1.44 1.44 1.44 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
OpenVINO GenAI Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPU - Time Per Output Token OpenBenchmarking.org ms, Fewer Is Better OpenVINO GenAI 2024.5 Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPU - Time Per Output Token a b c 4 8 12 16 20 17.49 17.90 17.59
OpenVINO GenAI Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPU - Time To First Token OpenBenchmarking.org ms, Fewer Is Better OpenVINO GenAI 2024.5 Model: Phi-3-mini-128k-instruct-int4-ov - Device: CPU - Time To First Token a b c 7 14 21 28 35 27.52 27.83 27.62
OpenVINO GenAI Model: Falcon-7b-instruct-int4-ov - Device: CPU - Time Per Output Token OpenBenchmarking.org ms, Fewer Is Better OpenVINO GenAI 2024.5 Model: Falcon-7b-instruct-int4-ov - Device: CPU - Time Per Output Token a b c 5 10 15 20 25 19.77 20.84 20.08
OpenVINO GenAI Model: Falcon-7b-instruct-int4-ov - Device: CPU - Time To First Token OpenBenchmarking.org ms, Fewer Is Better OpenVINO GenAI 2024.5 Model: Falcon-7b-instruct-int4-ov - Device: CPU - Time To First Token a b c 8 16 24 32 40 34.43 35.01 35.17
OpenVINO GenAI Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU - Time Per Output Token OpenBenchmarking.org ms, Fewer Is Better OpenVINO GenAI 2024.5 Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU - Time Per Output Token a b c 4 8 12 16 20 14.64 14.61 14.41
OpenVINO GenAI Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU - Time To First Token OpenBenchmarking.org ms, Fewer Is Better OpenVINO GenAI 2024.5 Model: TinyLlama-1.1B-Chat-v1.0 - Device: CPU - Time To First Token a b c 4 8 12 16 20 17.76 17.54 17.56
OpenVINO GenAI Model: Gemma-7b-int4-ov - Device: CPU - Time Per Output Token OpenBenchmarking.org ms, Fewer Is Better OpenVINO GenAI 2024.5 Model: Gemma-7b-int4-ov - Device: CPU - Time Per Output Token a b c 7 14 21 28 35 29.74 31.44 31.24
OpenVINO GenAI Model: Gemma-7b-int4-ov - Device: CPU - Time To First Token OpenBenchmarking.org ms, Fewer Is Better OpenVINO GenAI 2024.5 Model: Gemma-7b-int4-ov - Device: CPU - Time To First Token a b c 11 22 33 44 55 46.08 48.41 47.62
Phoronix Test Suite v10.8.5