Benchmarks for a future article.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2403039-NE-2403025NE90 Linux Distros Emerald Rapids - Phoronix Test Suite Linux Distros Emerald Rapids Benchmarks for a future article.
HTML result view exported from: https://openbenchmarking.org/result/2403039-NE-2403025NE90&grs&sro .
Linux Distros Emerald Rapids Processor Motherboard Chipset Memory Disk Graphics Network OS Kernel Compiler File-System Screen Resolution Desktop Display Server Ubuntu Linux 23.10 CentOS Stream 9 Fedora Server 39 Arch Linux 2 x INTEL XEON PLATINUM 8592+ @ 3.90GHz (128 Cores / 256 Threads) Quanta Cloud QuantaGrid D54Q-2U S6Q-MB-MPS (3B05.TEL4P1 BIOS) Intel Device 1bce 1008GB 3201GB Micron_7450_MTFDKCB3T2TFS ASPEED 2 x Intel X710 for 10GBASE-T Ubuntu 23.10 6.5.0-17-generic (x86_64) GCC 13.2.0 ext4 1024x768 3201GB Micron_7450_MTFDKCB3T2TFS + 257GB Flash Drive CentOS Stream 9 5.14.0-419.el9.x86_64 (x86_64) GNOME Shell 40.10 X Server GCC 11.4.1 20231218 xfs 1920x1200 Fedora Linux 39 6.7.6-200.fc39.x86_64 (x86_64) GCC 13.2.1 20231205 1024x768 Arch Linux 6.7.6-arch1-2 (x86_64) GCC 13.2.1 20230801 btrfs 1920x1200 OpenBenchmarking.org Kernel Details - Ubuntu Linux 23.10: Transparent Huge Pages: madvise - CentOS Stream 9: Transparent Huge Pages: always - Fedora Server 39: Transparent Huge Pages: madvise - Arch Linux: Transparent Huge Pages: always Compiler Details - Ubuntu Linux 23.10: --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - CentOS Stream 9: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl - Fedora Server 39: --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-initfini-array --enable-languages=c,c++,fortran,objc,obj-c++,ada,go,d,m2,lto --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-multilib --enable-offload-defaulted --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=i686 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-libstdcxx-zoneinfo=/usr/share/zoneinfo --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver - Arch Linux: --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,m2,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu Processor Details - Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x21000161 Python Details - Ubuntu Linux 23.10: Python 3.11.6 - CentOS Stream 9: Python 3.9.18 - Fedora Server 39: Python 3.12.2 - Arch Linux: Python 3.11.7 Security Details - Ubuntu Linux 23.10: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected - CentOS Stream 9: SELinux + gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected - Fedora Server 39: SELinux + gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected - Arch Linux: gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected
Linux Distros Emerald Rapids gpaw: Carbon Nanotube memcached: 1:10 compress-7zip: Compression Rating embree: Pathtracer ISPC - Crown cloverleaf: clover_bm16 cloverleaf: clover_bm64_short rocksdb: Read Rand Write Rand graph500: 26 rawtherapee: Total Benchmark Time graph500: 26 openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU ospray: gravity_spheres_volume/dim_512/ao/real_time oidn: RTLightmap.hdr.4096x4096 - CPU-Only y-cruncher: 1B clickhouse: 100M Rows Hits Dataset, First Run / Cold Cache rocksdb: Read While Writing openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU y-cruncher: 5B oidn: RT.hdr_alb_nrm.3840x2160 - CPU-Only lammps: 20k Atoms ospray: gravity_spheres_volume/dim_512/scivis/real_time incompact3d: X3D-benchmarking input.i3d clickhouse: 100M Rows Hits Dataset, Second Run clickhouse: 100M Rows Hits Dataset, Third Run svt-av1: Preset 12 - Bosphorus 4K quicksilver: CORAL2 P2 svt-av1: Preset 13 - Bosphorus 4K ospray-studio: 3 - 4K - 32 - Path Tracer - CPU graph500: 26 rocksdb: Rand Read openvino: Weld Porosity Detection FP16-INT8 - CPU graph500: 26 oidn: RT.ldr_alb_nrm.3840x2160 - CPU-Only openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU ospray-studio: 3 - 4K - 16 - Path Tracer - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU vvenc: Bosphorus 4K - Faster ospray-studio: 3 - 1080p - 32 - Path Tracer - CPU tensorflow: CPU - 512 - ResNet-50 ospray-studio: 1 - 4K - 16 - Path Tracer - CPU ospray: gravity_spheres_volume/dim_512/pathtracer/real_time ospray-studio: 2 - 4K - 16 - Path Tracer - CPU ospray-studio: 3 - 4K - 1 - Path Tracer - CPU ospray-studio: 2 - 4K - 1 - Path Tracer - CPU ospray-studio: 3 - 1080p - 16 - Path Tracer - CPU quicksilver: CTS2 openvkl: vklBenchmarkCPU ISPC ospray-studio: 1 - 4K - 1 - Path Tracer - CPU ospray-studio: 2 - 1080p - 1 - Path Tracer - CPU speedb: Rand Read deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream ospray-studio: 2 - 1080p - 16 - Path Tracer - CPU svt-av1: Preset 8 - Bosphorus 4K deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream ospray-studio: 3 - 1080p - 1 - Path Tracer - CPU vvenc: Bosphorus 4K - Fast ospray-studio: 1 - 1080p - 1 - Path Tracer - CPU svt-av1: Preset 4 - Bosphorus 4K deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream rocksdb: Update Rand ospray-studio: 1 - 1080p - 16 - Path Tracer - CPU openvino: Face Detection FP16-INT8 - CPU deepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream deepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream openvino: Face Detection FP16-INT8 - CPU deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream ospray-studio: 1 - 1080p - 32 - Path Tracer - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU deepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream deepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Stream hpcg: 104 104 104 - 60 deepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream hpcg: 144 144 144 - 60 ospray: particle_volume/pathtracer/real_time deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream ospray: particle_volume/scivis/real_time openvino: Person Vehicle Bike Detection FP16 - CPU deepsparse: ResNet-50, Baseline - Asynchronous Multi-Stream openvino: Person Vehicle Bike Detection FP16 - CPU deepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Stream ospray-studio: 2 - 1080p - 32 - Path Tracer - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU deepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream openvino: Face Detection Retail FP16-INT8 - CPU deepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream ospray: particle_volume/ao/real_time openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream deepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream deepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream deepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream nwchem: C240 Buckyball lczero: Eigen ospray-studio: 2 - 4K - 32 - Path Tracer - CPU ospray-studio: 1 - 4K - 32 - Path Tracer - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU pgbench: 100 - 1000 - Read Write - Average Latency pgbench: 100 - 1000 - Read Write pgbench: 100 - 1000 - Read Only - Average Latency pgbench: 100 - 1000 - Read Only gromacs: MPI CPU - water_GMX50_bare redis: SET - 500 redis: GET - 500 memcached: 1:100 compress-7zip: Decompression Rating mt-dgemm: Sustained Floating-Point Rate embree: Pathtracer ISPC - Asian Dragon easywave: e2Asean Grid + BengkuluSept2007 Source - 2400 lammps: Rhodopsin Protein namd: STMV with 1,066,628 Atoms namd: ATPase with 327,506 Atoms quicksilver: CORAL2 P1 lczero: BLAS Ubuntu Linux 23.10 CentOS Stream 9 Fedora Server 39 Arch Linux 38.637 3370014.38 464813 100.3399 279.21 32.10 955461 1226550000 129.079 1289580000 85.06 375.75 28.6088 2.18 7.207 205.20 9775989 61020.16 29.821 4.53 56.683 28.9162 169.178579 214.31 215.62 66.857 6744778 66.430 37285 483534000 599822046 43516.98 664991000 4.54 0.80 14508 2.85 5.611 7321 73.12 12086 32.3212 12243 1079 911 4353 7680222 2821 914 233 600703384 176.5691 3694 21.317 361.2877 276 2.785 234 2.307 1866.2259 34.2545 125560 3718 519.83 34.9884 1826.0447 245.98 155.8178 6448 39.76 3217.84 4547.5395 409.0089 71.1371 14.0573 69.3362 82.9700 11099.6440 1837.1113 20.7725 10033.95 34.7603 12.74 5.7515 6324 2377.90 53.79 133.2063 132.6789 24570.10 477.1362 20.6546 8901.61 14.37 5.20 478.7631 829.5063 77.0180 853.2079 74.8838 1223.1639 52.2774 1750.3 380 33174 32279 27.93 1143.25 25.616 39542 1.489 679317 9.630 2134872.89 1541713.80 3477137.28 455056 37.971018 144.9195 177.754 37.518 1.49004 4.70711 5931500 15 3337339.10 751070 105.6194 314.49 25.01 1464120000 1499600000 97.34 328.37 29.7626 2.51 7.913 183.62 60429.54 32.850 5.04 57.836 29.5104 184.192332 200.18 201.05 69.868 6381000 69.718 466398000 40187.34 640794000 4.90 0.83 3.02 5.808 76.68 33.2938 7398333 2815 173.9515 21.560 366.3831 2.823 2.358 1810.9508 35.2846 528.79 34.2262 1866.5153 241.67 152.5678 40.02 3196.47 4462.5255 416.6816 71.1332 14.3182 69.1728 83.5652 11022.3020 1863.3429 20.4427 10113.47 34.2906 12.65 5.7833 2412.51 53.02 133.2818 132.9101 24295.71 476.5785 20.5021 8908.74 14.36 5.25 478.0315 827.9914 77.1671 853.3374 74.8038 1223.6470 52.2360 52.09 617.45 5.939 1978937.18 1821521.39 2615816.73 434200 63.635840 177.7960 205.005 39.795 0.50251 3.20328 5800833 194.508 5054518.76 496138 142.1675 255.39 31.63 1184837 1195510000 118.228 1253250000 93.74 340.97 28.6637 2.25 8.270 188.71 8567006 56624.98 33.734 4.68 57.677 28.430 168.671819 202.14 206.83 71.518 6759600 70.874 34698 493031000 652854125 42419.89 663412000 4.70 0.83 14422 2.86 5.804 7424 12187 31.6999 12262 1058 897 4377 7367500 2821 902 233 626261826 3655 21.161 276 2.834 235 2.346 123092 3683 529.14 241.70 6314 39.30 3255.16 72.4575 70.3805 82.1380 20.5419 10082.93 12.69 6303 2383.22 53.67 24296.54 20.4442 8862.07 14.43 5.25 27524 28035 32.67 978.70 7.180 2125843.97 1838671.43 8983240.52 428601 37.720076 179.5403 156.237 39.009 1.92977 5.40706 5835818 5462773.67 759279 118.7400 356.23 30.30 932034 1275690000 107.048 1323480000 81.35 392.84 34.0487 2.36 7.561 209.64 9385652 64056.00 33.652 4.75 62.990 31.5793 187.266596 221.80 221.71 73.717 7013889 72.906 33976 509964000 608345894 41472.25 692893000 4.75 0.77 13639 2.94 5.942 7043 77.06 11575 33.3694 11658 1026 867 4173 7723667 2951 874 223 616573385 181.3330 3544 22.046 351.7460 265 2.900 226 2.395 1878.4478 34.0360 127451 3591 531.53 34.7428 1839.9833 240.65 154.3216 6380 40.11 3189.70 4534.8946 413.3838 71.1292 14.0959 70.1210 82.5113 11213.1629 1868.7705 20.7789 10197.22 34.2134 12.54 5.6953 6399 2409.76 53.08 134.9333 134.2065 24355.33 471.8157 20.6673 8956.37 14.28 5.24 474.2419 834.1920 76.6571 857.8494 74.5256 1227.3743 52.1246 23687 23884 29.23 1092.75 54.085 18536 1.440 698734 13.638 2231227.61 1777534.65 9633352.44 482076 43.669234 148.6022 145.713 42.751 1.74784 5.59956 5865500 OpenBenchmarking.org
GPAW Input: Carbon Nanotube OpenBenchmarking.org Seconds, Fewer Is Better GPAW 23.6 Input: Carbon Nanotube Fedora Server 39 Ubuntu Linux 23.10 40 80 120 160 200 SE +/- 0.94, N = 3 SE +/- 0.35, N = 15 194.51 38.64 -fno-strict-overflow -fcf-protection -fexceptions -fPIC -UNDEBUG -std=c99 -fwrapv -O2 1. (CC) gcc options: -shared -lxc -lblas -lmpi
Memcached Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 1.2M 2.4M 3.6M 4.8M 6M SE +/- 45099.83, N = 8 SE +/- 43085.73, N = 13 SE +/- 65254.44, N = 3 SE +/- 36897.17, N = 5 5462773.67 3337339.10 5054518.76 3370014.38 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 160K 320K 480K 640K 800K SE +/- 9120.21, N = 15 SE +/- 5583.39, N = 15 SE +/- 4508.27, N = 7 SE +/- 4291.24, N = 3 759279 751070 496138 464813 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 30 60 90 120 150 SE +/- 0.13, N = 3 SE +/- 1.24, N = 15 SE +/- 1.32, N = 15 SE +/- 0.25, N = 3 118.74 105.62 142.17 100.34 MIN: 113.06 / MAX: 135.85 MIN: 95.01 / MAX: 132.75 MIN: 115.81 / MAX: 164.58 MIN: 95.92 / MAX: 109.46
CloverLeaf Input: clover_bm16 OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm16 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 80 160 240 320 400 SE +/- 4.10, N = 3 SE +/- 1.51, N = 3 SE +/- 1.01, N = 3 SE +/- 3.60, N = 3 356.23 314.49 255.39 279.21 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
CloverLeaf Input: clover_bm64_short OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm64_short Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 7 14 21 28 35 SE +/- 0.27, N = 3 SE +/- 0.12, N = 3 SE +/- 0.40, N = 3 SE +/- 0.23, N = 11 30.30 25.01 31.63 32.10 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
RocksDB Test: Read Random Write Random OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read Random Write Random Arch Linux Fedora Server 39 Ubuntu Linux 23.10 300K 600K 900K 1200K 1500K SE +/- 9325.25, N = 15 SE +/- 13512.70, N = 15 SE +/- 5328.60, N = 3 932034 1184837 955461 -lpthread 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
Graph500 Scale: 26 OpenBenchmarking.org bfs median_TEPS, More Is Better Graph500 3.0 Scale: 26 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 300M 600M 900M 1200M 1500M 1275690000 1464120000 1195510000 1226550000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
RawTherapee Total Benchmark Time OpenBenchmarking.org Seconds, Fewer Is Better RawTherapee Total Benchmark Time Arch Linux Fedora Server 39 Ubuntu Linux 23.10 30 60 90 120 150 SE +/- 0.34, N = 3 SE +/- 0.45, N = 3 SE +/- 0.65, N = 3 107.05 118.23 129.08 1. Arch Linux: RawTherapee, version 5.10, command line. 2. Fedora Server 39: RawTherapee, version 5.10, command line. 3. Ubuntu Linux 23.10: RawTherapee, version 5.9, command line.
Graph500 Scale: 26 OpenBenchmarking.org bfs max_TEPS, More Is Better Graph500 3.0 Scale: 26 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 300M 600M 900M 1200M 1500M 1323480000 1499600000 1253250000 1289580000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 20 40 60 80 100 SE +/- 0.10, N = 3 SE +/- 0.81, N = 3 SE +/- 0.18, N = 3 SE +/- 0.11, N = 3 81.35 97.34 93.74 85.06 -pie - MIN: 38.9 / MAX: 148.69 -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 33.82 / MAX: 325.84 -pie - MIN: 34.75 / MAX: 283.24 -pie - MIN: 39.38 / MAX: 260.96 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 90 180 270 360 450 SE +/- 0.49, N = 3 SE +/- 2.69, N = 3 SE +/- 0.63, N = 3 SE +/- 0.48, N = 3 392.84 328.37 340.97 375.75 -pie -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OSPRay Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: gravity_spheres_volume/dim_512/ao/real_time Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 8 16 24 32 40 SE +/- 0.29, N = 3 SE +/- 0.26, N = 15 SE +/- 0.10, N = 3 SE +/- 0.05, N = 3 34.05 29.76 28.66 28.61
Intel Open Image Denoise Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.2 Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 0.5648 1.1296 1.6944 2.2592 2.824 SE +/- 0.03, N = 3 SE +/- 0.03, N = 4 SE +/- 0.01, N = 15 SE +/- 0.01, N = 3 2.36 2.51 2.25 2.18
Y-Cruncher Pi Digits To Calculate: 1B OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.3 Pi Digits To Calculate: 1B Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 2 4 6 8 10 SE +/- 0.109, N = 3 SE +/- 0.045, N = 3 SE +/- 0.088, N = 5 SE +/- 0.037, N = 3 7.561 7.913 8.270 7.207
ClickHouse 100M Rows Hits Dataset, First Run / Cold Cache OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, First Run / Cold Cache Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 50 100 150 200 250 SE +/- 2.35, N = 3 SE +/- 0.84, N = 3 SE +/- 2.54, N = 3 SE +/- 2.70, N = 3 209.64 183.62 188.71 205.20 MIN: 32.84 / MAX: 1764.71 MIN: 34.56 / MAX: 1714.29 MIN: 30.08 / MAX: 1428.57 MIN: 35.4 / MAX: 1818.18
RocksDB Test: Read While Writing OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Read While Writing Arch Linux Fedora Server 39 Ubuntu Linux 23.10 2M 4M 6M 8M 10M SE +/- 87416.61, N = 3 SE +/- 133209.05, N = 14 SE +/- 81611.31, N = 8 9385652 8567006 9775989 -lpthread 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 14K 28K 42K 56K 70K SE +/- 941.42, N = 15 SE +/- 478.67, N = 3 SE +/- 271.02, N = 3 SE +/- 827.51, N = 15 64056.00 60429.54 56624.98 61020.16 -pie -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
Y-Cruncher Pi Digits To Calculate: 5B OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.8.3 Pi Digits To Calculate: 5B Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 8 16 24 32 40 SE +/- 0.37, N = 5 SE +/- 0.17, N = 3 SE +/- 0.43, N = 3 SE +/- 0.07, N = 3 33.65 32.85 33.73 29.82
Intel Open Image Denoise Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.2 Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 1.134 2.268 3.402 4.536 5.67 SE +/- 0.01, N = 3 SE +/- 0.07, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 4.75 5.04 4.68 4.53
LAMMPS Molecular Dynamics Simulator Model: 20k Atoms OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: 20k Atoms Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 14 28 42 56 70 SE +/- 0.44, N = 3 SE +/- 0.46, N = 3 SE +/- 0.40, N = 3 SE +/- 0.42, N = 3 62.99 57.84 57.68 56.68 1. (CXX) g++ options: -O3 -lm -ldl
OSPRay Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 7 14 21 28 35 SE +/- 0.48, N = 15 SE +/- 0.17, N = 3 SE +/- 0.17, N = 3 SE +/- 0.11, N = 3 31.58 29.51 28.43 28.92
Xcompact3d Incompact3d Input: X3D-benchmarking input.i3d OpenBenchmarking.org Seconds, Fewer Is Better Xcompact3d Incompact3d 2021-03-11 Input: X3D-benchmarking input.i3d Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 40 80 120 160 200 SE +/- 0.25, N = 3 SE +/- 0.56, N = 3 SE +/- 0.67, N = 3 SE +/- 0.44, N = 3 187.27 184.19 168.67 169.18 -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 1. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi
ClickHouse 100M Rows Hits Dataset, Second Run OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Second Run Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 50 100 150 200 250 SE +/- 1.76, N = 3 SE +/- 3.30, N = 3 SE +/- 1.90, N = 3 SE +/- 1.67, N = 3 221.80 200.18 202.14 214.31 MIN: 36.97 / MAX: 1875 MIN: 37.41 / MAX: 2000 MIN: 36.41 / MAX: 1463.41 MIN: 36.39 / MAX: 1764.71
ClickHouse 100M Rows Hits Dataset, Third Run OpenBenchmarking.org Queries Per Minute, Geo Mean, More Is Better ClickHouse 22.12.3.5 100M Rows Hits Dataset, Third Run Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 50 100 150 200 250 SE +/- 0.45, N = 3 SE +/- 2.16, N = 3 SE +/- 3.01, N = 3 SE +/- 3.81, N = 3 221.71 201.05 206.83 215.62 MIN: 35.63 / MAX: 2068.97 MIN: 36.08 / MAX: 1500 MIN: 35.91 / MAX: 1935.48 MIN: 37.43 / MAX: 1714.29
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 12 - Input: Bosphorus 4K Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 16 32 48 64 80 SE +/- 0.15, N = 3 SE +/- 0.39, N = 3 SE +/- 0.99, N = 3 SE +/- 0.48, N = 15 73.72 69.87 71.52 66.86 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Quicksilver Input: CORAL2 P2 OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P2 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 1.5M 3M 4.5M 6M 7.5M SE +/- 67285.55, N = 9 SE +/- 51868.42, N = 3 SE +/- 74346.22, N = 5 SE +/- 81512.06, N = 9 7013889 6381000 6759600 6744778 1. (CXX) g++ options: -fopenmp -O3 -march=native
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 13 - Input: Bosphorus 4K Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 16 32 48 64 80 SE +/- 0.36, N = 3 SE +/- 0.99, N = 3 SE +/- 0.53, N = 3 SE +/- 0.74, N = 5 72.91 69.72 70.87 66.43 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 8K 16K 24K 32K 40K SE +/- 138.45, N = 3 SE +/- 119.27, N = 3 SE +/- 564.37, N = 15 33976 34698 37285
Graph500 Scale: 26 OpenBenchmarking.org sssp median_TEPS, More Is Better Graph500 3.0 Scale: 26 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 110M 220M 330M 440M 550M 509964000 466398000 493031000 483534000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
RocksDB Test: Random Read OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Random Read Arch Linux Fedora Server 39 Ubuntu Linux 23.10 140M 280M 420M 560M 700M SE +/- 7708164.70, N = 3 SE +/- 3865049.53, N = 3 SE +/- 7527805.83, N = 15 608345894 652854125 599822046 -lpthread 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 9K 18K 27K 36K 45K SE +/- 312.18, N = 3 SE +/- 347.22, N = 3 SE +/- 194.22, N = 3 SE +/- 43.54, N = 3 41472.25 40187.34 42419.89 43516.98 -pie -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
Graph500 Scale: 26 OpenBenchmarking.org sssp max_TEPS, More Is Better Graph500 3.0 Scale: 26 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 150M 300M 450M 600M 750M 692893000 640794000 663412000 664991000 1. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.2 Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 1.1025 2.205 3.3075 4.41 5.5125 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 SE +/- 0.03, N = 15 SE +/- 0.06, N = 3 4.75 4.90 4.70 4.54
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 0.1868 0.3736 0.5604 0.7472 0.934 SE +/- 0.01, N = 15 SE +/- 0.00, N = 3 SE +/- 0.00, N = 3 SE +/- 0.00, N = 15 0.77 0.83 0.83 0.80 -pie - MIN: 0.19 / MAX: 28.17 -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 0.21 / MAX: 20.36 -pie - MIN: 0.2 / MAX: 31.65 -pie - MIN: 0.2 / MAX: 28.24 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 3K 6K 9K 12K 15K SE +/- 37.30, N = 3 SE +/- 37.95, N = 3 SE +/- 177.62, N = 4 13639 14422 14508
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 0.6795 1.359 2.0385 2.718 3.3975 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.01, N = 3 SE +/- 0.00, N = 3 2.94 3.02 2.86 2.85 -pie - MIN: 2.09 / MAX: 30.51 -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 2.07 / MAX: 29.73 -pie - MIN: 1.94 / MAX: 41.9 -pie - MIN: 2.08 / MAX: 32.04 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
VVenC Video Input: Bosphorus 4K - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Faster Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 1.337 2.674 4.011 5.348 6.685 SE +/- 0.050, N = 3 SE +/- 0.013, N = 3 SE +/- 0.035, N = 3 SE +/- 0.061, N = 4 5.942 5.808 5.804 5.611 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 1600 3200 4800 6400 8000 SE +/- 29.16, N = 3 SE +/- 91.12, N = 4 SE +/- 31.90, N = 3 7043 7424 7321
TensorFlow Device: CPU - Batch Size: 512 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 512 - Model: ResNet-50 Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 20 40 60 80 100 SE +/- 0.61, N = 3 SE +/- 0.83, N = 4 SE +/- 0.84, N = 3 77.06 76.68 73.12
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 3K 6K 9K 12K 15K SE +/- 32.00, N = 3 SE +/- 22.15, N = 3 SE +/- 3.51, N = 3 11575 12187 12086
OSPRay Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 8 16 24 32 40 SE +/- 0.10, N = 3 SE +/- 0.48, N = 15 SE +/- 0.12, N = 3 SE +/- 0.05, N = 3 33.37 33.29 31.70 32.32
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 3K 6K 9K 12K 15K SE +/- 13.00, N = 3 SE +/- 36.03, N = 3 SE +/- 25.12, N = 3 11658 12262 12243
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 200 400 600 800 1000 SE +/- 2.33, N = 3 SE +/- 2.96, N = 3 SE +/- 2.40, N = 3 1026 1058 1079
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 200 400 600 800 1000 SE +/- 3.18, N = 3 SE +/- 2.03, N = 3 SE +/- 3.53, N = 3 867 897 911
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 900 1800 2700 3600 4500 SE +/- 13.25, N = 3 SE +/- 21.55, N = 3 SE +/- 47.59, N = 4 4173 4377 4353
Quicksilver Input: CTS2 OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CTS2 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 1.7M 3.4M 5.1M 6.8M 8.5M SE +/- 67222.68, N = 9 SE +/- 121572.66, N = 6 SE +/- 90618.71, N = 4 SE +/- 80526.35, N = 9 7723667 7398333 7367500 7680222 1. (CXX) g++ options: -fopenmp -O3 -march=native
OpenVKL Benchmark: vklBenchmarkCPU ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 2.0.0 Benchmark: vklBenchmarkCPU ISPC Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 600 1200 1800 2400 3000 SE +/- 20.04, N = 3 SE +/- 33.05, N = 3 SE +/- 22.66, N = 3 SE +/- 20.99, N = 3 2951 2815 2821 2821 MIN: 193 / MAX: 35719 MIN: 186 / MAX: 35204 MIN: 178 / MAX: 36193 MIN: 175 / MAX: 34505
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 200 400 600 800 1000 SE +/- 3.93, N = 3 SE +/- 9.82, N = 4 SE +/- 3.48, N = 3 874 902 914
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 50 100 150 200 250 SE +/- 0.00, N = 3 SE +/- 0.33, N = 3 SE +/- 0.33, N = 3 223 233 233
Speedb Test: Random Read OpenBenchmarking.org Op/s, More Is Better Speedb 2.7 Test: Random Read Arch Linux Fedora Server 39 Ubuntu Linux 23.10 130M 260M 390M 520M 650M SE +/- 6923132.62, N = 4 SE +/- 8524314.28, N = 3 SE +/- 5282775.06, N = 8 616573385 626261826 600703384 -lpthread 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
Neural Magic DeepSparse Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 40 80 120 160 200 SE +/- 2.15, N = 4 SE +/- 0.21, N = 3 SE +/- 1.16, N = 3 181.33 173.95 176.57
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 800 1600 2400 3200 4000 SE +/- 5.13, N = 3 SE +/- 43.59, N = 3 SE +/- 7.45, N = 3 3544 3655 3694
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 8 - Input: Bosphorus 4K Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 5 10 15 20 25 SE +/- 0.08, N = 3 SE +/- 0.21, N = 3 SE +/- 0.10, N = 3 SE +/- 0.19, N = 3 22.05 21.56 21.16 21.32 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Neural Magic DeepSparse Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 80 160 240 320 400 SE +/- 4.23, N = 4 SE +/- 0.14, N = 3 SE +/- 2.37, N = 3 351.75 366.38 361.29
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 60 120 180 240 300 SE +/- 0.67, N = 3 SE +/- 1.45, N = 3 SE +/- 0.00, N = 3 265 276 276
VVenC Video Input: Bosphorus 4K - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.11 Video Input: Bosphorus 4K - Video Preset: Fast Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 0.6525 1.305 1.9575 2.61 3.2625 SE +/- 0.007, N = 3 SE +/- 0.026, N = 3 SE +/- 0.017, N = 3 SE +/- 0.035, N = 3 2.900 2.823 2.834 2.785 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 50 100 150 200 250 SE +/- 0.33, N = 3 SE +/- 0.88, N = 3 SE +/- 0.33, N = 3 226 235 234
SVT-AV1 Encoder Mode: Preset 4 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.8 Encoder Mode: Preset 4 - Input: Bosphorus 4K Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 0.5389 1.0778 1.6167 2.1556 2.6945 SE +/- 0.009, N = 3 SE +/- 0.009, N = 3 SE +/- 0.008, N = 3 SE +/- 0.007, N = 3 2.395 2.358 2.346 2.307 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 400 800 1200 1600 2000 SE +/- 22.85, N = 3 SE +/- 16.06, N = 3 SE +/- 20.52, N = 3 1878.45 1810.95 1866.23
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 8 16 24 32 40 SE +/- 0.42, N = 3 SE +/- 0.32, N = 3 SE +/- 0.38, N = 3 34.04 35.28 34.25
RocksDB Test: Update Random OpenBenchmarking.org Op/s, More Is Better RocksDB 8.0 Test: Update Random Arch Linux Fedora Server 39 Ubuntu Linux 23.10 30K 60K 90K 120K 150K SE +/- 1143.37, N = 3 SE +/- 1187.74, N = 3 SE +/- 440.29, N = 3 127451 123092 125560 -lpthread 1. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 800 1600 2400 3200 4000 SE +/- 21.20, N = 3 SE +/- 16.22, N = 3 SE +/- 3.21, N = 3 3591 3683 3718
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 110 220 330 440 550 SE +/- 6.17, N = 4 SE +/- 1.20, N = 3 SE +/- 6.06, N = 4 SE +/- 5.90, N = 4 531.53 528.79 529.14 519.83 -pie -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
Neural Magic DeepSparse Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 8 16 24 32 40 SE +/- 0.14, N = 3 SE +/- 0.07, N = 3 SE +/- 0.31, N = 3 34.74 34.23 34.99
Neural Magic DeepSparse Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 400 800 1200 1600 2000 SE +/- 7.12, N = 3 SE +/- 3.56, N = 3 SE +/- 16.26, N = 3 1839.98 1866.52 1826.04
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 50 100 150 200 250 SE +/- 2.90, N = 4 SE +/- 0.55, N = 3 SE +/- 2.84, N = 4 SE +/- 2.84, N = 4 240.65 241.67 241.70 245.98 -pie - MIN: 175.12 / MAX: 455.8 -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 160.04 / MAX: 556.9 -pie - MIN: 163.92 / MAX: 559.83 -pie - MIN: 176.44 / MAX: 390.61 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 30 60 90 120 150 SE +/- 1.36, N = 3 SE +/- 0.32, N = 3 SE +/- 0.87, N = 3 154.32 152.57 155.82
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 1400 2800 4200 5600 7000 SE +/- 40.01, N = 3 SE +/- 13.35, N = 3 SE +/- 77.79, N = 4 6380 6314 6448
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 9 18 27 36 45 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.02, N = 3 40.11 40.02 39.30 39.76 -pie - MIN: 38.25 / MAX: 97.25 -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 33.6 / MAX: 102.02 -pie - MIN: 33.26 / MAX: 101.52 -pie - MIN: 37.84 / MAX: 104.98 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 700 1400 2100 2800 3500 SE +/- 3.83, N = 3 SE +/- 1.03, N = 3 SE +/- 4.78, N = 3 SE +/- 1.65, N = 3 3189.70 3196.47 3255.16 3217.84 -pie -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
Neural Magic DeepSparse Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 1000 2000 3000 4000 5000 SE +/- 55.62, N = 3 SE +/- 40.12, N = 7 SE +/- 40.60, N = 7 4534.89 4462.53 4547.54
Neural Magic DeepSparse Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 90 180 270 360 450 SE +/- 3.56, N = 3 SE +/- 0.86, N = 3 SE +/- 2.51, N = 3 413.38 416.68 409.01
High Performance Conjugate Gradient X Y Z: 104 104 104 - RT: 60 OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 16 32 48 64 80 SE +/- 0.32, N = 3 SE +/- 0.12, N = 3 SE +/- 0.14, N = 3 SE +/- 0.19, N = 3 71.13 71.13 72.46 71.14 -lmpi_cxx -lmpi_cxx -lmpi_cxx 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi
Neural Magic DeepSparse Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 4 8 12 16 20 SE +/- 0.18, N = 3 SE +/- 0.13, N = 7 SE +/- 0.13, N = 7 14.10 14.32 14.06
High Performance Conjugate Gradient X Y Z: 144 144 144 - RT: 60 OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 144 144 144 - RT: 60 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 16 32 48 64 80 SE +/- 0.59, N = 3 SE +/- 0.83, N = 4 SE +/- 0.27, N = 3 SE +/- 0.52, N = 3 70.12 69.17 70.38 69.34 -lmpi_cxx -lmpi_cxx -lmpi_cxx 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi
OSPRay Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: particle_volume/pathtracer/real_time Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 20 40 60 80 100 SE +/- 0.87, N = 4 SE +/- 0.23, N = 3 SE +/- 0.58, N = 3 SE +/- 0.35, N = 3 82.51 83.57 82.14 82.97
Neural Magic DeepSparse Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 2K 4K 6K 8K 10K SE +/- 102.84, N = 7 SE +/- 43.63, N = 3 SE +/- 102.73, N = 7 11213.16 11022.30 11099.64
Neural Magic DeepSparse Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 400 800 1200 1600 2000 SE +/- 9.98, N = 3 SE +/- 4.86, N = 3 SE +/- 7.36, N = 3 1868.77 1863.34 1837.11
OSPRay Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: particle_volume/scivis/real_time Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 5 10 15 20 25 SE +/- 0.04, N = 3 SE +/- 0.02, N = 3 SE +/- 0.02, N = 3 SE +/- 0.08, N = 3 20.78 20.44 20.54 20.77
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 2K 4K 6K 8K 10K SE +/- 125.31, N = 3 SE +/- 104.96, N = 5 SE +/- 124.93, N = 4 SE +/- 140.94, N = 3 10197.22 10113.47 10082.93 10033.95 -pie -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
Neural Magic DeepSparse Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 8 16 24 32 40 SE +/- 0.18, N = 3 SE +/- 0.09, N = 3 SE +/- 0.12, N = 3 34.21 34.29 34.76
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 3 6 9 12 15 SE +/- 0.16, N = 3 SE +/- 0.14, N = 5 SE +/- 0.16, N = 4 SE +/- 0.18, N = 3 12.54 12.65 12.69 12.74 -pie - MIN: 10.74 / MAX: 50.52 -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 10.9 / MAX: 74.36 -pie - MIN: 10.9 / MAX: 75.69 -pie - MIN: 10.88 / MAX: 73.62 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
Neural Magic DeepSparse Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 1.3012 2.6024 3.9036 5.2048 6.506 SE +/- 0.0549, N = 7 SE +/- 0.0226, N = 3 SE +/- 0.0557, N = 7 5.6953 5.7833 5.7515
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 1400 2800 4200 5600 7000 SE +/- 44.43, N = 3 SE +/- 28.01, N = 3 SE +/- 60.98, N = 6 6399 6303 6324
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 500 1000 1500 2000 2500 SE +/- 6.39, N = 3 SE +/- 10.50, N = 3 SE +/- 4.46, N = 3 SE +/- 15.67, N = 3 2409.76 2412.51 2383.22 2377.90 -pie -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 12 24 36 48 60 SE +/- 0.14, N = 3 SE +/- 0.23, N = 3 SE +/- 0.10, N = 3 SE +/- 0.36, N = 3 53.08 53.02 53.67 53.79 -pie - MIN: 44.51 / MAX: 131.9 -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 42.98 / MAX: 178.63 -pie - MIN: 43.29 / MAX: 161.25 -pie - MIN: 44.62 / MAX: 198.12 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
Neural Magic DeepSparse Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 30 60 90 120 150 SE +/- 0.39, N = 3 SE +/- 0.09, N = 3 SE +/- 0.24, N = 3 134.93 133.28 133.21
Neural Magic DeepSparse Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 30 60 90 120 150 SE +/- 0.47, N = 3 SE +/- 0.30, N = 3 SE +/- 0.29, N = 3 134.21 132.91 132.68
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 5K 10K 15K 20K 25K SE +/- 269.84, N = 3 SE +/- 36.89, N = 3 SE +/- 130.19, N = 3 SE +/- 208.25, N = 3 24355.33 24295.71 24296.54 24570.10 -pie -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
Neural Magic DeepSparse Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 100 200 300 400 500 SE +/- 1.04, N = 3 SE +/- 0.34, N = 3 SE +/- 0.78, N = 3 471.82 476.58 477.14
OSPRay Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 3.1 Benchmark: particle_volume/ao/real_time Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 5 10 15 20 25 SE +/- 0.05, N = 3 SE +/- 0.01, N = 3 SE +/- 0.06, N = 3 SE +/- 0.04, N = 3 20.67 20.50 20.44 20.65
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 2K 4K 6K 8K 10K SE +/- 34.95, N = 3 SE +/- 7.04, N = 3 SE +/- 9.74, N = 3 SE +/- 20.48, N = 3 8956.37 8908.74 8862.07 8901.61 -pie -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 4 8 12 16 20 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.02, N = 3 SE +/- 0.03, N = 3 14.28 14.36 14.43 14.37 -pie - MIN: 12.05 / MAX: 46.92 -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 12.55 / MAX: 76.83 -pie - MIN: 12.18 / MAX: 71.66 -pie - MIN: 12.05 / MAX: 72.49 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 1.1813 2.3626 3.5439 4.7252 5.9065 SE +/- 0.06, N = 3 SE +/- 0.01, N = 3 SE +/- 0.03, N = 3 SE +/- 0.05, N = 3 5.24 5.25 5.25 5.20 -pie - MIN: 4.62 / MAX: 25.2 -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 4.69 / MAX: 58.72 -pie - MIN: 4.62 / MAX: 31.26 -pie - MIN: 4.59 / MAX: 31.78 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
Neural Magic DeepSparse Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 100 200 300 400 500 SE +/- 1.54, N = 3 SE +/- 0.62, N = 3 SE +/- 0.91, N = 3 474.24 478.03 478.76
Neural Magic DeepSparse Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 200 400 600 800 1000 SE +/- 2.18, N = 3 SE +/- 4.43, N = 3 SE +/- 5.31, N = 3 834.19 827.99 829.51
Neural Magic DeepSparse Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 20 40 60 80 100 SE +/- 0.21, N = 3 SE +/- 0.41, N = 3 SE +/- 0.46, N = 3 76.66 77.17 77.02
Neural Magic DeepSparse Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 200 400 600 800 1000 SE +/- 8.02, N = 3 SE +/- 8.06, N = 3 SE +/- 6.05, N = 3 857.85 853.34 853.21
Neural Magic DeepSparse Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 20 40 60 80 100 SE +/- 0.70, N = 3 SE +/- 0.70, N = 3 SE +/- 0.50, N = 3 74.53 74.80 74.88
Neural Magic DeepSparse Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org items/sec, More Is Better Neural Magic DeepSparse 1.6 Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 300 600 900 1200 1500 SE +/- 12.68, N = 3 SE +/- 15.64, N = 3 SE +/- 14.85, N = 3 1227.37 1223.65 1223.16
Neural Magic DeepSparse Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream OpenBenchmarking.org ms/batch, Fewer Is Better Neural Magic DeepSparse 1.6 Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream Arch Linux CentOS Stream 9 Ubuntu Linux 23.10 12 24 36 48 60 SE +/- 0.55, N = 3 SE +/- 0.70, N = 3 SE +/- 0.63, N = 3 52.12 52.24 52.28
NWChem Input: C240 Buckyball OpenBenchmarking.org Seconds, Fewer Is Better NWChem 7.0.2 Input: C240 Buckyball Ubuntu Linux 23.10 400 800 1200 1600 2000 1750.3 1. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2
LeelaChessZero Backend: Eigen OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.30 Backend: Eigen Ubuntu Linux 23.10 80 160 240 320 400 SE +/- 4.52, N = 4 380 1. (CXX) g++ options: -flto -pthread
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 7K 14K 21K 28K 35K SE +/- 467.39, N = 15 SE +/- 1255.46, N = 15 SE +/- 600.94, N = 15 23687 27524 33174
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 1.0 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU Arch Linux Fedora Server 39 Ubuntu Linux 23.10 7K 14K 21K 28K 35K SE +/- 740.65, N = 15 SE +/- 1257.60, N = 15 SE +/- 481.26, N = 15 23884 28035 32279
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 12 24 36 48 60 SE +/- 0.30, N = 3 SE +/- 1.14, N = 15 SE +/- 0.27, N = 9 SE +/- 0.11, N = 3 29.23 52.09 32.67 27.93 -pie - MIN: 21.95 / MAX: 117.93 -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF - MIN: 17.25 / MAX: 578.83 -pie - MIN: 19.33 / MAX: 484.58 -pie - MIN: 19.99 / MAX: 261.46 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 200 400 600 800 1000 SE +/- 11.59, N = 3 SE +/- 13.25, N = 15 SE +/- 8.04, N = 9 SE +/- 4.60, N = 3 1092.75 617.45 978.70 1143.25 -pie -isystem -std=c++11 -fPIC -fvisibility=hidden -MD -MT -MF -pie -pie 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency Arch Linux Ubuntu Linux 23.10 12 24 36 48 60 SE +/- 0.82, N = 12 SE +/- 0.97, N = 9 54.09 25.62 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write Arch Linux Ubuntu Linux 23.10 8K 16K 24K 32K 40K SE +/- 280.04, N = 12 SE +/- 1662.84, N = 9 18536 39542 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency Arch Linux Ubuntu Linux 23.10 0.335 0.67 1.005 1.34 1.675 SE +/- 0.035, N = 12 SE +/- 0.047, N = 12 1.440 1.489 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only Arch Linux Ubuntu Linux 23.10 150K 300K 450K 600K 750K SE +/- 16477.93, N = 12 SE +/- 22056.84, N = 12 698734 679317 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2024 Implementation: MPI CPU - Input: water_GMX50_bare Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 4 8 12 16 20 SE +/- 1.257, N = 15 SE +/- 0.323, N = 12 SE +/- 0.850, N = 12 SE +/- 1.403, N = 7 13.638 5.939 7.180 9.630 1. (CXX) g++ options: -O3 -lm
Redis Test: SET - Parallel Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: SET - Parallel Connections: 500 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 500K 1000K 1500K 2000K 2500K SE +/- 46614.18, N = 15 SE +/- 47416.25, N = 15 SE +/- 73166.31, N = 15 SE +/- 48178.71, N = 15 2231227.61 1978937.18 2125843.97 2134872.89 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Redis Test: GET - Parallel Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better Redis 7.0.4 Test: GET - Parallel Connections: 500 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 400K 800K 1200K 1600K 2000K SE +/- 19370.21, N = 15 SE +/- 19161.91, N = 15 SE +/- 30474.05, N = 15 SE +/- 15103.72, N = 15 1777534.65 1821521.39 1838671.43 1541713.80 1. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3
Memcached Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 2M 4M 6M 8M 10M SE +/- 121500.81, N = 15 SE +/- 20448.57, N = 10 SE +/- 380886.14, N = 12 SE +/- 20632.12, N = 3 9633352.44 2615816.73 8983240.52 3477137.28 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 100K 200K 300K 400K 500K SE +/- 19128.23, N = 15 SE +/- 2615.74, N = 15 SE +/- 14994.49, N = 7 SE +/- 39319.74, N = 3 482076 434200 428601 455056 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
ACES DGEMM Sustained Floating-Point Rate OpenBenchmarking.org GFLOP/s, More Is Better ACES DGEMM 1.0 Sustained Floating-Point Rate Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 14 28 42 56 70 SE +/- 0.91, N = 13 SE +/- 0.51, N = 3 SE +/- 0.04, N = 3 SE +/- 0.09, N = 3 43.67 63.64 37.72 37.97 1. (CC) gcc options: -O3 -march=native -fopenmp
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 40 80 120 160 200 SE +/- 0.19, N = 3 SE +/- 3.90, N = 15 SE +/- 0.73, N = 3 SE +/- 0.50, N = 3 148.60 177.80 179.54 144.92 MIN: 140.37 / MAX: 169.21 MIN: 142.88 / MAX: 210.57 MIN: 145.33 / MAX: 207.47 MIN: 137.85 / MAX: 155.04
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 2400 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 40 80 120 160 200 SE +/- 7.11, N = 12 SE +/- 10.40, N = 12 SE +/- 15.88, N = 12 SE +/- 8.98, N = 12 145.71 205.01 156.24 177.75 1. (CXX) g++ options: -O3 -fopenmp
LAMMPS Molecular Dynamics Simulator Model: Rhodopsin Protein OpenBenchmarking.org ns/day, More Is Better LAMMPS Molecular Dynamics Simulator 23Jun2022 Model: Rhodopsin Protein Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 10 20 30 40 50 SE +/- 0.85, N = 12 SE +/- 0.76, N = 15 SE +/- 0.95, N = 15 SE +/- 0.93, N = 15 42.75 39.80 39.01 37.52 1. (CXX) g++ options: -O3 -lm -ldl
NAMD Input: STMV with 1,066,628 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: STMV with 1,066,628 Atoms Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 0.4342 0.8684 1.3026 1.7368 2.171 SE +/- 0.07631, N = 15 SE +/- 0.04889, N = 12 SE +/- 0.05361, N = 15 SE +/- 0.14829, N = 15 1.74784 0.50251 1.92977 1.49004
NAMD Input: ATPase with 327,506 Atoms OpenBenchmarking.org ns/day, More Is Better NAMD 3.0b6 Input: ATPase with 327,506 Atoms Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 1.2599 2.5198 3.7797 5.0396 6.2995 SE +/- 0.21129, N = 12 SE +/- 0.30866, N = 12 SE +/- 0.33952, N = 15 SE +/- 0.33618, N = 15 5.59956 3.20328 5.40706 4.70711
Quicksilver Input: CORAL2 P1 OpenBenchmarking.org Figure Of Merit, More Is Better Quicksilver 20230818 Input: CORAL2 P1 Arch Linux CentOS Stream 9 Fedora Server 39 Ubuntu Linux 23.10 1.3M 2.6M 3.9M 5.2M 6.5M SE +/- 55910.49, N = 6 SE +/- 105910.76, N = 12 SE +/- 42699.12, N = 11 SE +/- 65941.38, N = 12 5865500 5800833 5835818 5931500 1. (CXX) g++ options: -fopenmp -O3 -march=native
LeelaChessZero Backend: BLAS OpenBenchmarking.org Nodes Per Second, More Is Better LeelaChessZero 0.30 Backend: BLAS Ubuntu Linux 23.10 4 8 12 16 20 SE +/- 1.18, N = 9 15 1. (CXX) g++ options: -flto -pthread
Phoronix Test Suite v10.8.4