new tests Tests for a future article. Intel Core i9-14900K testing with a ASUS PRIME Z790-P WIFI (1402 BIOS) and AMD Radeon RX 7900 XTX 24GB on Ubuntu 23.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2310314-PTS-NEWTESTS84&grs .
new tests Processor Motherboard Chipset Memory Disk Graphics Audio Monitor OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution a b c d e f Intel Core i5-13600K @ 5.10GHz (14 Cores / 20 Threads) ASUS PRIME Z790-P WIFI (1402 BIOS) Intel Device 7a27 32GB 1000GB Western Digital WDS100T1X0E-00AFY0 AMD Radeon RX 7900 XTX 24GB (2304/1249MHz) Realtek ALC897 ASUS VP28U Ubuntu 23.10 6.5.0-9-generic (x86_64) GNOME Shell 45.0 X Server + Wayland 4.6 Mesa 23.2.1-1ubuntu3 (LLVM 15.0.7 DRM 3.54) GCC 13.2.0 ext4 3840x2160 Intel Core i5-14600K @ 5.30GHz (14 Cores / 20 Threads) Intel Core i9-14900K @ 5.70GHz (24 Cores / 32 Threads) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x11d - Thermald 2.5.4 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected Python Details - b, c, d, e, f: Python 3.11.6
new tests heffte: r2c - Stock - double-long - 128 rodinia: OpenMP Streamcluster heffte: c2c - FFTW - float-long - 128 pgbench: 100 - 1000 - Read Only - Average Latency heffte: c2c - Stock - float - 128 heffte: c2c - Stock - float-long - 128 pgbench: 100 - 1000 - Read Only npb: EP.C heffte: c2c - FFTW - float - 128 cpuminer-opt: LBC, LBRY Credits john-the-ripper: Blowfish easywave: e2Asean Grid + BengkuluSept2007 Source - 240 cpuminer-opt: Skeincoin cpuminer-opt: Blake-2 S heffte: r2c - FFTW - float - 128 heffte: r2c - Stock - float-long - 128 heffte: r2c - Stock - double - 128 minibude: OpenMP - BM1 minibude: OpenMP - BM1 cpuminer-opt: Magi cpuminer-opt: Ringcoin cpuminer-opt: Quad SHA-256, Pyrite pgbench: 100 - 1000 - Read Only - Average Latency cpuminer-opt: Deepcoin stockfish: Total Time rodinia: OpenMP CFD Solver specfem3d: Tomographic Model cpuminer-opt: Triple SHA-256, Onecoin specfem3d: Homogeneous Halfspace darktable: Server Rack - CPU-only compress-7zip: Decompression Rating john-the-ripper: WPA PSK cpuminer-opt: scrypt specfem3d: Mount St. Helens cpuminer-opt: Myriad-Groestl heffte: r2c - Stock - float - 128 openssl: SHA256 john-the-ripper: MD5 openvino: Face Detection Retail FP16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU darktable: Boat - CPU-only quantlib: Multi-Threaded astcenc: Medium onednn: Recurrent Neural Network Inference - f32 - CPU openvino: Face Detection FP16 - CPU indigobench: CPU - Supercar openssl: AES-256-GCM astcenc: Thorough openssl: AES-128-GCM openssl: SHA512 minibude: OpenMP - BM2 minibude: OpenMP - BM2 john-the-ripper: HMAC-SHA512 ospray-studio: 2 - 1080p - 16 - Path Tracer - CPU onednn: Recurrent Neural Network Training - f32 - CPU luxcorerender: Danish Mood - CPU openssl: ChaCha20 indigobench: CPU - Bedroom openssl: ChaCha20-Poly1305 ospray-studio: 1 - 1080p - 16 - Path Tracer - CPU embree: Pathtracer ISPC - Crown onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU v-ray: CPU specfem3d: Layered Halfspace john-the-ripper: bcrypt openradioss: INIVOL and Fluid Structure Interaction Drop Container oidn: RT.ldr_alb_nrm.3840x2160 - CPU-Only embree: Pathtracer ISPC - Asian Dragon ospray-studio: 3 - 1080p - 16 - Path Tracer - CPU luxcorerender: LuxCore Benchmark - CPU openradioss: Rubber O-Ring Seal Installation npb: LU.C openssl: RSA4096 uvg266: Bosphorus 4K - Medium asmfish: 1024 Hash Memory, 26 Depth nginx: 500 rodinia: OpenMP LavaMD openssl: RSA4096 specfem3d: Water-layered Halfspace blender: BMW27 - CPU-Only openvino: Vehicle Detection FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU build-linux-kernel: allmodconfig onednn: IP Shapes 1D - f32 - CPU kvazaar: Bosphorus 4K - Medium tensorflow: CPU - 16 - ResNet-50 heffte: r2c - FFTW - float-long - 128 nginx: 1000 luxcorerender: DLSC - CPU openradioss: Cell Phone Drop Test heffte: r2c - FFTW - double - 128 astcenc: Exhaustive heffte: r2c - FFTW - double-long - 128 blender: Fishy Cat - CPU-Only uvg266: Bosphorus 4K - Super Fast npb: BT.C compress-7zip: Compression Rating build-linux-kernel: defconfig openvino: Machine Translation EN To DE FP16 - CPU natron: Spaceship darktable: Masskrug - CPU-only xmrig: Wownero - 1M uvg266: Bosphorus 4K - Ultra Fast openvino: Face Detection FP16-INT8 - CPU liquid-dsp: 32 - 256 - 512 tensorflow: CPU - 64 - ResNet-50 openvino: Weld Porosity Detection FP16 - CPU luxcorerender: Rainbow Colors and Prism - CPU build-godot: Time To Compile ospray-studio: 2 - 4K - 1 - Path Tracer - CPU cpuminer-opt: Garlicoin ospray-studio: 2 - 1080p - 32 - Path Tracer - CPU openvino: Person Detection FP16 - CPU ospray-studio: 3 - 4K - 1 - Path Tracer - CPU ospray-studio: 3 - 1080p - 32 - Path Tracer - CPU ospray-studio: 1 - 1080p - 32 - Path Tracer - CPU build-nodejs: Time To Compile build-llvm: Ninja ospray-studio: 3 - 1080p - 1 - Path Tracer - CPU ospray-studio: 2 - 1080p - 1 - Path Tracer - CPU ospray-studio: 1 - 4K - 1 - Path Tracer - CPU ospray: gravity_spheres_volume/dim_512/ao/real_time ospray-studio: 2 - 4K - 16 - Path Tracer - CPU luxcorerender: Orange Juice - CPU ospray-studio: 1 - 4K - 16 - Path Tracer - CPU openvino: Person Vehicle Bike Detection FP16 - CPU uvg266: Bosphorus 4K - Very Fast openvkl: vklBenchmarkCPU ISPC ospray-studio: 2 - 4K - 32 - Path Tracer - CPU ospray-studio: 3 - 4K - 16 - Path Tracer - CPU ospray-studio: 3 - 4K - 32 - Path Tracer - CPU qmcpack: H4_ae openvino: Face Detection Retail FP16-INT8 - CPU ospray-studio: 1 - 4K - 32 - Path Tracer - CPU ospray-studio: 1 - 1080p - 1 - Path Tracer - CPU openvino: Person Detection FP32 - CPU openradioss: Bird Strike on Windshield ospray: gravity_spheres_volume/dim_512/scivis/real_time qmcpack: Li2_STO_ae namd: ATPase Simulation - 327,506 Atoms kvazaar: Bosphorus 4K - Ultra Fast svt-hevc: 7 - Bosphorus 4K npb: CG.C ospray: particle_volume/ao/real_time ospray: particle_volume/scivis/real_time svt-vp9: Visual Quality Optimized - Bosphorus 4K openfoam: drivaerFastback, Small Mesh Size - Execution Time cloverleaf: clover_bm pgbench: 100 - 1000 - Read Write pgbench: 100 - 1000 - Read Write - Average Latency openvino: Age Gender Recognition Retail 0013 FP16 - CPU avifenc: 6, Lossless ospray: gravity_spheres_volume/dim_512/pathtracer/real_time build-mesa: Time To Compile simdjson: PartialTweets simdjson: DistinctUserID npb: FT.C openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU selenium: WASM collisionDetection - Google Chrome svt-vp9: PSNR/SSIM Optimized - Bosphorus 4K avifenc: 2 kvazaar: Bosphorus 4K - Super Fast npb: SP.B kvazaar: Bosphorus 4K - Very Fast openvino: Handwritten English Recognition FP16 - CPU memcached: 1:10 openradioss: Bumper Beam svt-av1: Preset 12 - Bosphorus 4K svt-hevc: 10 - Bosphorus 4K pgbench: 100 - 1000 - Read Write - Average Latency avifenc: 6 openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU svt-av1: Preset 13 - Bosphorus 4K darktable: Server Room - CPU-only x265: Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K memcached: 1:100 openfoam: drivaerFastback, Small Mesh Size - Mesh Time npb: SP.C y-cruncher: 500M xmrig: Monero - 1M ospray: particle_volume/pathtracer/real_time openvino: Face Detection Retail FP16 - CPU liquid-dsp: 8 - 256 - 512 vvenc: Bosphorus 4K - Fast selenium: Octane - Google Chrome duckdb: IMDB openvino: Face Detection FP16 - CPU selenium: WASM imageConvolute - Google Chrome hpcg: 104 104 104 - 60 openfoam: drivaerFastback, Medium Mesh Size - Mesh Time rodinia: OpenMP HotSpot3D openfoam: drivaerFastback, Medium Mesh Size - Execution Time openvino: Vehicle Detection FP16 - CPU heffte: c2c - Stock - double-long - 128 pybench: Total For Average Test Times heffte: c2c - FFTW - double - 512 heffte: c2c - Stock - double - 128 openvino: Road Segmentation ADAS FP16 - CPU quantlib: Single-Threaded heffte: c2c - FFTW - double-long - 256 liquid-dsp: 16 - 256 - 512 heffte: c2c - FFTW - double - 256 rawtherapee: Total Benchmark Time ngspice: C2670 svt-vp9: VMAF Optimized - Bosphorus 4K vvenc: Bosphorus 4K - Faster ngspice: C7552 rodinia: OpenMP Leukocyte heffte: c2c - FFTW - float-long - 256 heffte: c2c - FFTW - float - 256 heffte: c2c - FFTW - double-long - 512 libraw: Post-Processing Benchmark openvino: Vehicle Detection FP16 - CPU heffte: c2c - FFTW - float - 512 selenium: WASM imageConvolute - Firefox openvino: Road Segmentation ADAS FP16 - CPU heffte: c2c - Stock - double-long - 256 heffte: c2c - FFTW - float-long - 512 phpbench: PHP Benchmark Suite openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16 - CPU heffte: r2c - FFTW - double - 256 simdjson: TopTweet heffte: r2c - FFTW - double-long - 256 pyperformance: crypto_pyaes pyperformance: json_loads heffte: c2c - Stock - double - 256 selenium: Kraken - Firefox selenium: WASM collisionDetection - Firefox webp: Quality 100, Highest Compression simdjson: LargeRand webp: Quality 100 webp: Quality 100, Lossless, Highest Compression pyperformance: regex_compile selenium: Jetstream 2 - Firefox openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU node-web-tooling: webp: Quality 100, Lossless heffte: r2c - FFTW - double-long - 512 openvino: Age Gender Recognition Retail 0013 FP16 - CPU heffte: r2c - FFTW - double - 512 simdjson: Kostya heffte: r2c - Stock - float-long - 512 heffte: c2c - Stock - float - 256 heffte: r2c - Stock - float - 512 selenium: Octane - Firefox openvino: Weld Porosity Detection FP16-INT8 - CPU npb: MG.C selenium: Jetstream 2 - Google Chrome selenium: Kraken - Google Chrome heffte: r2c - Stock - double-long - 256 heffte: r2c - Stock - double - 256 openvino: Machine Translation EN To DE FP16 - CPU openvino: Face Detection Retail FP16-INT8 - CPU heffte: c2c - Stock - float-long - 256 openvino: Face Detection FP16-INT8 - CPU duckdb: TPC-H Parquet openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Detection FP16 - CPU gimp: rotate selenium: Speedometer - Google Chrome heffte: r2c - FFTW - float-long - 512 heffte: r2c - FFTW - float - 512 selenium: Speedometer - Firefox heffte: c2c - FFTW - double-long - 128 heffte: c2c - FFTW - double - 128 openvino: Person Detection FP32 - CPU heffte: r2c - Stock - double-long - 512 heffte: r2c - Stock - double - 512 gimp: resize heffte: r2c - FFTW - float-long - 256 openvino: Weld Porosity Detection FP16 - CPU pyperformance: python_startup heffte: c2c - Stock - float-long - 512 easywave: e2Asean Grid + BengkuluSept2007 Source - 1200 heffte: r2c - FFTW - float - 256 heffte: c2c - Stock - float - 512 cloverleaf: clover_bm16 gimp: auto-levels heffte: c2c - Stock - double - 512 selenium: PSPDFKit WASM - Google Chrome cloverleaf: clover_bm64_short gimp: unsharp-mask heffte: c2c - Stock - double-long - 512 selenium: PSPDFKit WASM - Firefox qmcpack: O_ae_pyscf_UHF heffte: r2c - Stock - float - 256 qmcpack: simple-H2O onednn: IP Shapes 3D - f32 - CPU heffte: r2c - Stock - float-long - 256 qmcpack: FeCO6_b3lyp_gms qmcpack: LiH_ae_MSD gromacs: MPI CPU - water_GMX50_bare build-gem5: Time To Compile onednn: IP Shapes 1D - bf16bf16bf16 - CPU a b c d e f 31.8111 50.9336 31.2635 31.2292 51.881 9035.2 21370 79320 131.935 66.6684 31.5989 417.24 2345.04 41360 5436.88 61070 185.62 8564.28 68.8833 55799.7 127.305 44.2936 43.71 1893.63 58.98 14.2837 18.1985 14.658 4818.8 16.1534 16.2749 31.3044 31.2736 18.239 35.2152 15.8331 35.2816 28.5862 28.6186 15.7616 31.7325 31.6129 49.7591 29.6502 49.4547 29.6848 29.3894 29.778 58.4188 58.6489 18.1147 18.1124 27.2229 27.1619 55.5018 27.985 55.1029 27.9897 1094.128868103 15.0777 127.74 15.1102 57.5957 57.9832 32.6945 15.894 52.499 1.054 31.9321 31.6351 947197 1773.71 53.1674 9845.56 26232 8.637 22770 85030 104.025 71.0106 32.6174 216.969 8.679 442.28 2508.93 43650 949091 5549.86 46079885 10.118 49.142967323 62870 62.094293176 0.266 92168 36403 195.41 51.618068965 8774.33 70.4153 21234053100 1134000 1011.66 1705.1 4.194 55963.1 81.1968 1705.51 3.45 7.592 91408403390 10.6857 106224257980 6922574030 12.906 322.654 45526000 32575 3319.64 2.22 43683166890 2.689 31517307700 31957 19.033 3320.34 18743 135.10279615 31152 435.68 0.43 23.1049 37384 2.54 165.27 35845.04 231709.8 9.41 47742886 135692.91 131.529 3577.3 126.074892513 78.97 607.54 255 778.31 2.98963 13.07 23.91 134.246 127899.06 2.88 96.02 44.8791 1.0807 44.0648 109.94 29.3 34501.92 127009 65.562 44.66 4.6 2.364 11202.1 35.3 12.75 228820000 24.14 383.74 12.46 230.157 7338 2040.33 62478 38.73 8523 72426 61145 357.104 439.462 2156 1848 7176 3.6958 120716 4.88 118595 503.59 27.74 387 239054 139891 277357 22.73 2125.52 234540 1830 38.75 228.35 3.60869 285.85 0.86311 54.14 71.18 8025.9 7.37745 7.32159 88.07 224.43546 55.82 41736 23.5 12503.2 6.908 5.03922 29.825 5.39 6.04 18953.53 1361.39 269.31 232.97 111.84 44.202 40.71 16564.09 32.39 224.92 4464270.25 151.75 146.571 139.99 42553 4.515 32245.66 149.161 1.833 25.7 58.458 4058394.64 33.876155 12080 13.443 7676.5 169.646 5.89 139440000 6.539 97769 86.571 1730.89 15.95 8.28855 230.3022 55.996 2166.8213 19.33 14.4836 427 18.3225 14.8193 56.41 5016.1 16.4718 204060000 16.4798 40.086 66.392 93.49 13.272 65.896 53.871 31.8188 31.8292 18.2924 61.85 309.56 35.2893 16.3 106.23 16.0466 35.3807 1491682 0.49 59.37 71.08 29.014 7.29 28.9898 37.5 10.2 15.9022 520.2 242.3 4.74 1.77 15.57 0.87 71.6 229.492 23.47 9.84 24.04 2.20 32.0107 1.27 31.9085 4.8 50.8129 30.3369 50.6268 55056 11.74 24092.42 382.668 349 30.042 29.522 134.23 7.52 29.9091 469.85 89.754 11.88 154.78 9.199 486 59.1962 59.3627 406 18.5277 18.4321 154.5 27.7294 27.527 12.625 55.9418 41.65 4.85 28.418 134.185 55.526 28.337 1093.01 9.615 15.2166 2202 127.94 11.12 15.2009 2108 262.57 58.425 32.159 4.36881 58.1474 266.24 169.05 22.1694 15.834 51.9406 0.91 31.74 32.3869 833675 1775.67 52.2532 9996.74 31050 7.127 22780 86620 131.251 69.3558 32.4099 217.632 8.705 450.42 2494.87 43630 1098384 5604.27 41998826 10.121 51.562965731 62850 63.899206362 0.317 92137 36403 194.68 51.749754354 8765.28 71.5008 21897801590 1131000 1028.14 1707.37 4.389 57317.5 81.2623 1705.89 3.45 7.778 94488932900 10.6929 106192145890 7042483840 12.907 322.668 46015000 32538 3319.3 2.27 43658009260 2.6 30264414210 32007 19.1475 3321.37 18885 140.585720491 31110 438.56 0.43 23.1978 37367 2.55 165.49 34481.92 231842.8 9.43 47676136 132783.22 129.977 3578.6 126.108597881 79.19 610.86 255.73 767.219 2.99292 13.09 23.93 131.511 127678.15 2.87 95.46 44.6026 1.0804 44.6157 110.19 29.34 34573.71 127943 64.959 44.39 4.4 2.423 11160.8 35.09 12.75 228970000 24.1 383.97 12.54 234.319 7320 2040.66 62329 38.98 8512 72092 61161 355.827 435.624 2150 1840 7195 3.79562 120664 4.91 118639 502.05 27.74 386 238496 139809 277524 25.09 2170.03 234968 1816 39 234.62 3.65807 284.18 0.86116 54.29 71.18 7422.74 7.3669 7.32078 88.36 223.87156 56.10 41656 23.705 12456.32 6.983 5.04566 29.202 6.51 6.04 18737.09 1359.67 266.96 193.39 112.34 43.999 40.69 16408.43 32.33 224.74 4305589.62 150.19 148.546 140.06 42185 4.581 32241.73 147.091 1.825 26.74 58.184 4149899.46 32.902566 12101.42 13.518 7660.4 169.835 5.81 140020000 6.55 105856 86.555 1733.4 15.95 8.26023 234.39558 58.024 2166.8275 19.09 14.3837 446 18.275 14.4448 56.71 5014.7 16.3572 204100000 16.4311 40.301 66.076 96 13.276 63.909 54.926 31.5524 31.6357 18.2462 63.14 313.32 35.4345 16.1 105.65 15.942 35.1791 1479761 0.49 59.91 71.15 28.8122 7.31 28.7765 37.4 10.1 15.7502 520.4 242.5 4.74 1.77 15.53 0.88 71.4 227.935 23.41 9.79 23.75 2.19 31.8965 1.28 31.9077 4.81 50.9069 29.8126 50.8481 55067 11.76 26540.76 388.309 343.2 29.8789 29.8734 135 7.37 29.8559 470 90.453 11.91 153.73 9.447 471 59.1701 59.223 407 18.3191 18.5488 153.66 27.5481 27.6161 12.394 55.4126 41.63 4.84 28.2076 134.633 55.6015 28.4031 1091.47 9.748 15.2168 2170 128.11 11.352 15.1774 2096 264.99 57.9937 32.718 4.39625 58.467 266.65 168.7 52.5398 7.224 99.7527 0.734 59.735 60.0213 1299800 3342.51 95.578 16610 47385 4.774 38520 141280 183.452 117.845 53.6889 373.696 14.948 714.42 4039.03 70960 1361760 9210.49 71722427 5.935 30.329244698 102200 37.654774238 0.209 142133 60805 310.55 30.83045146 14360 115.198 35393577110 1862000 1564.5 1145.28 2.697 88060 130.3598 1062.21 5.48 12.02 145829960090 16.9563 168593428210 10970736110 20.411 510.266 71981000 20919 2113.83 3.44 68204421390 4.064 47030287560 20752 29.5036 2150.83 27940 91.860925305 47260 285.26 0.65 35.4539 24365 3.82 108.37 52753.11 353220.2 14.25 71318180 196302.45 86.77 5418.1 84.113422384 53.09 886.66 369.15 519.599 2.0115 19.38 35.36 186.551 184950.78 4.15 66.07 64.2029 1.5573 63.2219 75.4 41.03 50512.7 183859 44.92 62.26 6.4 1.694 16204 50.11 18.22 329800000 34.8 550.3 17.96 163.491 5095 2726.85 43450 53.71 6008 50740 43729 251.154 308.209 1547 1304 5114 5.25193 85258 6.93 83823 662.91 38.59 547 169828 99301 196772 18.59 2989.78 167519 1294 54.06 166.94 5.09456 208.7 0.61454 75.87 98.86 10390.33 10.1686 10.1188 122.15 162.53309 42.55 55901 18.249 16901.1 5.097 6.9477 21.8 7.37 8.25 24740.05 1854.34 353.52 211.66 137.97 33.2 54.02 22004.23 42.25 295.35 5693052.64 114.72 194.348 185.07 54797 3.852 41956.06 193.281 1.415 32.51 75.364 5061427.55 26.294883 15263.55 10.784 9559.6 207.158 5.1 168030000 7.829 117684 71.369 1448.07 14.1 9.91323 203.49517 48.845 1816.8935 22.7 16.9435 377 15.8089 16.751 66.49 5665 14.1676 223360000 14.1336 34.446 57.215 107.4 15.438 56.736 48.076 27.7795 27.8481 15.9782 71.16 351.84 30.8378 14.2 120.21 14.0877 31.0588 1692102 0.56 67.85 81.2 25.8193 8.29 25.7995 33 9.18 14.123 487.8 216 5.36 1.99 17.54 0.97 63.6 251.968 21.63 9.01 26.59 2.45 28.6645 1.41 28.806 5.3 54.9549 27.8339 54.5762 60754 12.93 24498.2 419.011 319.4 27.6358 27.6338 128.32 8.02 28.0061 436.86 83.672 12.05 148.7 8.806 498 55.8788 55.6705 433 19.2383 19.2256 147.8 28.4441 28.478 12.516 53.7468 43.58 4.65 28.9081 129.701 53.6358 28.9398 1058.26 9.459 15 2139 124.58 11.042 14.786 2059 258.4 58.9926 31.973 4.30701 58.982 265.33 168.69 1.143 221.237 56.6746 7.176 98.9954 0.615 59.0159 40.5778 1591084 2893.39 97.4361 16950 47590 4.833 38020 140960 184.512 114.123 52.8988 375.175 15.007 710.53 4048.08 70910 1627171 9300.91 64256226 6.101 30.650349799 103680 38.235807506 0.187 155668 61351 311.73 31.651786074 14310 114.341 35045220680 1862000 1569.98 1064.7 2.705 89344 130.2225 1083.23 5.45 12.189 144868800380 16.9459 167529875870 10887827050 20.335 508.378 65609000 20740 2125.71 3.46 67647607600 4.062 47048102160 20746 29.5741 2145.16 29012 91.894740975 47942 291.63 0.66 35.3254 24526 3.89 109.41 52719.71 345908.2 14.29 71944056 200059.86 86.948 5294.3 83.591421969 52.94 894.24 371.16 527.01 2.01217 19.43 35.1 183.834 180058.09 4.17 65.66 64.9886 1.5633 64.0485 75.33 42.91 50229.93 171918 45.816 63.33 6.3 1.671 16088.6 50.81 18.46 331120000 34.83 553.27 17.74 162.567 5127 2605.6 44009 54.88 5932 50458 42719 249.43 307.199 1524 1294 5081 5.27642 84586 6.96 83524 675.24 38.46 545 168775 99388 196983 18.3 3010.6 166105 1305 54.78 170.46 5.03492 203.73 0.61420 73.99 99.72 8593.82 10.2038 10.1229 120.84 162.41459 42.83 56335 17.939 17036.01 5.063 6.92496 21.846 7.37 8.25 25591.23 1855.23 357.15 204.81 150.27 32.79 54.71 21129.21 42.88 298.8 5730258.98 114.07 190.158 178.47 55746 3.472 42386.78 190.914 1.434 33.23 75.647 5241319.8 27.068099 15270.26 10.864 9512.1 209.444 5.08 171400000 7.946 102783 1447.91 16.93 9.88187 204.95889 48.465 1813.292 22.63 16.8437 378 15.8476 16.6687 66.06 5672.2 14.0426 239170000 14.0817 35.193 57.464 107.34 15.297 56.807 47.475 27.5697 27.5778 15.9518 69.03 352.96 30.9894 14.3 120.99 14.0495 30.9595 1685554 0.55 67.16 80.25 25.5679 8.29 25.585 33.6 9.23 14.0404 552.8 214.3 5.28 1.99 17.54 0.97 67 253.624 21.51 8.93 26.44 2.44 28.7669 1.4 28.7749 5.3 54.0622 27.5452 53.8771 60711 12.93 25162.58 421.021 318.5 27.5275 27.4046 126.22 7.96 27.5381 432.96 11.83 145.64 8.854 502 55.5756 55.6246 433 18.9154 18.7925 145.89 28.6623 28.5518 12.111 53.4593 43.34 4.68 29.1118 129.448 53.7069 28.9079 1063.27 9.526 14.9001 2140 124.71 11.12 14.7872 2051 258.48 58.4249 32.185 4.3061 58.05 264.55 168.37 1.148 221.121 53.0212 7.226 96.4633 0.623 60.5548 59.2394 1599780 3340.25 87.4539 16590 47203 4.785 38040 138510 179.454 115.053 54.7998 375.582 15.023 722 3946.27 71070 1604627 9248.07 71115165 6.49 30.471851303 102900 37.994095068 0.217 138934 61337 309.14 32.373449454 14200 115.093 35479543830 1888000 1668.97 1044.98 2.771 89756.1 130.417 1081.46 5.54 11.407 145929235660 16.9808 167740437850 10972703730 20.389 509.73 71931000 21146 2141.47 3.48 68291402680 4.067 47059792960 20598 29.1886 2188.89 28970 91.176640881 47587 290.16 0.66 35.2707 24512 3.85 108.16 52390.75 354387.3 14.38 72527312 201855.19 86.56 5367.5 83.64314069 52.67 910.84 382.3 526.015 2.03456 19.4 35.42 187.72 188170.4 4.22 65.34 63.746 1.5848 62.9035 75.24 42.68 49366.17 185607 45.723 64.57 6.4 1.667 15832.3 50.69 18.26 329870000 34.84 553.49 14.89 162.803 5175 2559.46 43624 55.69 6037 50705 42916 249.686 307.934 1508 1318 5039 5.05305 84845 6.91 83238 715.44 39.46 549 168378 98534 195691 17.7 3005.33 166313 1296 53.9 166.08 4.94223 202.62 0.61266 72.63 98.68 7707.48 10.2334 10.1689 120.89 161.88961 42.77 57567 19.694 17187.3 5.066 6.94737 21.985 7.36 8.24 25290.9 1852.66 363.79 171.17 152.13 33.165 53.99 21905.42 43.18 299.8 5727247.49 115.59 194.228 170.99 50776 3.854 42343.47 191.65 1.405 33.47 75.057 5171670.55 27.222561 15177.55 11.108 9501 204.454 4.78 169900000 7.834 118632 1431.86 14.04 9.84149 195.54508 49.009 1815.051 22.45 16.8575 376 15.5292 17.032 66.42 5666.3 14.0375 239250000 14.144 35.075 56.866 108.94 15.439 56.766 48.276 27.549 27.654 15.8822 70.56 355.77 31.0469 14.3 120.32 14.015 30.9269 1686956 0.55 65.93 79.99 25.449 8.31 25.4696 33 8.98 14.0224 490.7 214.5 5.36 2 17.53 0.98 64.8 256.136 20.89 8.77 26.63 2.45 28.762 1.39 28.7644 5.31 54.6591 27.4897 54.2616 60209 12.95 24381.01 419.3 317.3 27.3438 27.3379 123.75 7.98 27.5919 435.36 11.17 143.52 8.828 504 55.3711 55.782 430 18.8817 19.1865 148.29 27.776 28.2529 12.695 53.406 43.32 4.66 28.4467 131.703 53.5429 28.5277 1062.13 9.526 14.7708 2139 124.75 11.167 15.0325 2055 259.37 58.2473 32.225 4.31754 58.5898 263.94 168.31 1.148 220.533 OpenBenchmarking.org
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 a b c d e f 13 26 39 52 65 31.81 32.69 22.17 52.54 56.67 53.02 1. (CXX) g++ options: -O3
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster b c d e f 4 8 12 16 20 15.894 15.834 7.224 7.176 7.226 1. (CXX) g++ options: -O2 -lOpenCL
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 a b c d e f 20 40 60 80 100 50.93 52.50 51.94 99.75 99.00 96.46 1. (CXX) g++ options: -O3
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency b c d e f 0.27 0.54 0.81 1.08 1.35 1.056 1.200 0.769 0.629 0.625 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 a b c d e f 14 28 42 56 70 31.26 31.93 31.74 59.74 59.02 60.55 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 a b c d e f 13 26 39 52 65 31.23 31.64 32.39 60.02 40.58 59.24 1. (CXX) g++ options: -O3
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only b c d e f 300K 600K 900K 1200K 1500K 947197 833675 1299800 1591084 1599780 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C b c d e f 700 1400 2100 2800 3500 1773.71 1775.67 3342.51 2893.39 3340.25 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 a b c d e f 20 40 60 80 100 51.88 53.17 52.25 95.58 97.44 87.45 1. (CXX) g++ options: -O3
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: LBC, LBRY Credits a b c d e f 4K 8K 12K 16K 20K 9035.20 9845.56 9996.74 16610.00 16950.00 16590.00 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish b c d e f 10K 20K 30K 40K 50K 26232 31050 47385 47590 47203 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 b c d e f 2 4 6 8 10 8.637 7.127 4.774 4.833 4.785 1. (CXX) g++ options: -O3 -fopenmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Skeincoin a b c d e f 8K 16K 24K 32K 40K 21370 22770 22780 38520 38020 38040 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Blake-2 S a b c d e f 30K 60K 90K 120K 150K 79320 85030 86620 141280 140960 138510 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 a b c d e f 40 80 120 160 200 131.94 104.03 131.25 183.45 184.51 179.45 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 a b c d e f 30 60 90 120 150 66.67 71.01 69.36 117.85 114.12 115.05 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 a b c d e f 12 24 36 48 60 31.60 32.62 32.41 53.69 52.90 54.80 1. (CXX) g++ options: -O3
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 b c d e f 80 160 240 320 400 216.97 217.63 373.70 375.18 375.58 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 b c d e f 4 8 12 16 20 8.679 8.705 14.948 15.007 15.023 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Magi a b c d e f 160 320 480 640 800 417.24 442.28 450.42 714.42 710.53 722.00 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Ringcoin a b c d e f 900 1800 2700 3600 4500 2345.04 2508.93 2494.87 4039.03 4048.08 3946.27 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Quad SHA-256, Pyrite a b c d e f 15K 30K 45K 60K 75K 41360 43650 43630 70960 70910 71070 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency b c d e f 300K 600K 900K 1200K 1500K 949091 1098384 1361760 1627171 1604627 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Deepcoin a b c d e f 2K 4K 6K 8K 10K 5436.88 5549.86 5604.27 9210.49 9300.91 9248.07 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time b c d e f 15M 30M 45M 60M 75M 46079885 41998826 71722427 64256226 71115165 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver b c d e f 3 6 9 12 15 10.118 10.121 5.935 6.101 6.490 1. (CXX) g++ options: -O2 -lOpenCL
SPECFEM3D Model: Tomographic Model OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Tomographic Model b c d e f 12 24 36 48 60 49.14 51.56 30.33 30.65 30.47 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Triple SHA-256, Onecoin a b c d e f 20K 40K 60K 80K 100K 61070 62870 62850 102200 103680 102900 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
SPECFEM3D Model: Homogeneous Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Homogeneous Halfspace b c d e f 14 28 42 56 70 62.09 63.90 37.65 38.24 37.99 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Darktable Test: Server Rack - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Server Rack - Acceleration: CPU-only b c d e f 0.0713 0.1426 0.2139 0.2852 0.3565 0.266 0.317 0.209 0.187 0.217
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating b c d e f 30K 60K 90K 120K 150K 92168 92137 142133 155668 138934 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
John The Ripper Test: WPA PSK OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK b c d e f 13K 26K 39K 52K 65K 36403 36403 60805 61351 61337 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
Cpuminer-Opt Algorithm: scrypt OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: scrypt a b c d e f 70 140 210 280 350 185.62 195.41 194.68 310.55 311.73 309.14 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
SPECFEM3D Model: Mount St. Helens OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Mount St. Helens b c d e f 12 24 36 48 60 51.62 51.75 30.83 31.65 32.37 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Myriad-Groestl a b c d e f 3K 6K 9K 12K 15K 8564.28 8774.33 8765.28 14360.00 14310.00 14200.00 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 a b c d e f 30 60 90 120 150 68.88 70.42 71.50 115.20 114.34 115.09 1. (CXX) g++ options: -O3
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 b c d e f 8000M 16000M 24000M 32000M 40000M 21234053100 21897801590 35393577110 35045220680 35479543830 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 b c d e f 400K 800K 1200K 1600K 2000K 1134000 1131000 1862000 1862000 1888000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU b c d e f 400 800 1200 1600 2000 1011.66 1028.14 1564.50 1569.98 1668.97 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU b c d e f 400 800 1200 1600 2000 1705.10 1707.37 1145.28 1064.70 1044.98 MIN: 1702.22 MIN: 1704.07 MIN: 996.9 MIN: 979.41 MIN: 983.94 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
Darktable Test: Boat - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Boat - Acceleration: CPU-only b c d e f 0.9875 1.975 2.9625 3.95 4.9375 4.194 4.389 2.697 2.705 2.771
QuantLib Configuration: Multi-Threaded OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Multi-Threaded a b c d e f 20K 40K 60K 80K 100K 55799.7 55963.1 57317.5 88060.0 89344.0 89756.1 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
ASTC Encoder Preset: Medium OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Medium b c d e f 30 60 90 120 150 81.20 81.26 130.36 130.22 130.42 1. (CXX) g++ options: -O3 -flto -pthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU b c d e f 400 800 1200 1600 2000 1705.51 1705.89 1062.21 1083.23 1081.46 MIN: 1702.42 MIN: 1703.31 MIN: 982.53 MIN: 981.64 MIN: 999.13 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU b c d e f 1.2465 2.493 3.7395 4.986 6.2325 3.45 3.45 5.48 5.45 5.54 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
IndigoBench Acceleration: CPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar b c d e f 3 6 9 12 15 7.592 7.778 12.020 12.189 11.407
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM b c d e f 30000M 60000M 90000M 120000M 150000M 91408403390 94488932900 145829960090 144868800380 145929235660 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
ASTC Encoder Preset: Thorough OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Thorough b c d e f 4 8 12 16 20 10.69 10.69 16.96 16.95 16.98 1. (CXX) g++ options: -O3 -flto -pthread
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM b c d e f 40000M 80000M 120000M 160000M 200000M 106224257980 106192145890 168593428210 167529875870 167740437850 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 b c d e f 2000M 4000M 6000M 8000M 10000M 6922574030 7042483840 10970736110 10887827050 10972703730 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 b c d e f 5 10 15 20 25 12.91 12.91 20.41 20.34 20.39 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 b c d e f 110 220 330 440 550 322.65 322.67 510.27 508.38 509.73 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
John The Ripper Test: HMAC-SHA512 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 b c d e f 15M 30M 45M 60M 75M 45526000 46015000 71981000 65609000 71931000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 7K 14K 21K 28K 35K 32575 32538 20919 20740 21146
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU b c d e f 700 1400 2100 2800 3500 3319.64 3319.30 2113.83 2125.71 2141.47 MIN: 3316.21 MIN: 3315.49 MIN: 1927.97 MIN: 1925.61 MIN: 1947.18 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
LuxCoreRender Scene: Danish Mood - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU b c d e f 0.783 1.566 2.349 3.132 3.915 2.22 2.27 3.44 3.46 3.48 MIN: 0.78 / MAX: 2.61 MIN: 0.86 / MAX: 2.63 MIN: 1.57 / MAX: 3.87 MIN: 1.38 / MAX: 3.89 MIN: 1.71 / MAX: 3.89
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 b c d e f 15000M 30000M 45000M 60000M 75000M 43683166890 43658009260 68204421390 67647607600 68291402680 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
IndigoBench Acceleration: CPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Bedroom b c d e f 0.9151 1.8302 2.7453 3.6604 4.5755 2.689 2.600 4.064 4.062 4.067
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 b c d e f 10000M 20000M 30000M 40000M 50000M 31517307700 30264414210 47030287560 47048102160 47059792960 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 7K 14K 21K 28K 35K 31957 32007 20752 20746 20598
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown b c d e f 7 14 21 28 35 19.03 19.15 29.50 29.57 29.19 MIN: 18.79 / MAX: 19.41 MIN: 18.85 / MAX: 19.5 MIN: 28.9 / MAX: 32.01 MIN: 29.06 / MAX: 32.27 MIN: 28.59 / MAX: 31.95
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU b c d e f 700 1400 2100 2800 3500 3320.34 3321.37 2150.83 2145.16 2188.89 MIN: 3316.93 MIN: 3317.95 MIN: 1926.06 MIN: 1934.78 MIN: 1932.77 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5.02 Mode: CPU b c d e f 6K 12K 18K 24K 30K 18743 18885 27940 29012 28970
SPECFEM3D Model: Layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Layered Halfspace b c d e f 30 60 90 120 150 135.10 140.59 91.86 91.89 91.18 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
John The Ripper Test: bcrypt OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: bcrypt b c d e f 10K 20K 30K 40K 50K 31152 31110 47260 47942 47587 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container b c d e f 100 200 300 400 500 435.68 438.56 285.26 291.63 290.16
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.1 Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only b c d e f 0.1485 0.297 0.4455 0.594 0.7425 0.43 0.43 0.65 0.66 0.66
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon b c d e f 8 16 24 32 40 23.10 23.20 35.45 35.33 35.27 MIN: 22.96 / MAX: 23.41 MIN: 23.04 / MAX: 23.45 MIN: 35.02 / MAX: 37.92 MIN: 34.92 / MAX: 37.85 MIN: 34.8 / MAX: 37.69
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 8K 16K 24K 32K 40K 37384 37367 24365 24526 24512
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU b c d e f 0.8753 1.7506 2.6259 3.5012 4.3765 2.54 2.55 3.82 3.89 3.85 MIN: 0.92 / MAX: 2.97 MIN: 0.93 / MAX: 2.98 MIN: 1.47 / MAX: 4.34 MIN: 1.74 / MAX: 4.37 MIN: 1.53 / MAX: 4.35
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation b c d e f 40 80 120 160 200 165.27 165.49 108.37 109.41 108.16
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C b c d e f 11K 22K 33K 44K 55K 35845.04 34481.92 52753.11 52719.71 52390.75 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 b c d e f 80K 160K 240K 320K 400K 231709.8 231842.8 353220.2 345908.2 354387.3 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
uvg266 Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Medium b c d e f 4 8 12 16 20 9.41 9.43 14.25 14.29 14.38
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth b c d e f 16M 32M 48M 64M 80M 47742886 47676136 71318180 71944056 72527312
nginx Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 b c d e f 40K 80K 120K 160K 200K 135692.91 132783.22 196302.45 200059.86 201855.19 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD b c d e f 30 60 90 120 150 131.53 129.98 86.77 86.95 86.56 1. (CXX) g++ options: -O2 -lOpenCL
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 b c d e f 1200 2400 3600 4800 6000 3577.3 3578.6 5418.1 5294.3 5367.5 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
SPECFEM3D Model: Water-layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Water-layered Halfspace b c d e f 30 60 90 120 150 126.07 126.11 84.11 83.59 83.64 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only b c d e f 20 40 60 80 100 78.97 79.19 53.09 52.94 52.67
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU b c d e f 200 400 600 800 1000 607.54 610.86 886.66 894.24 910.84 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU b c d e f 80 160 240 320 400 255.00 255.73 369.15 371.16 382.30 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig b c d e f 200 400 600 800 1000 778.31 767.22 519.60 527.01 526.02
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU b c d e f 0.6734 1.3468 2.0202 2.6936 3.367 2.98963 2.99292 2.01150 2.01217 2.03456 MIN: 2.87 MIN: 2.87 MIN: 1.64 MIN: 1.64 MIN: 1.64 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium b c d e f 5 10 15 20 25 13.07 13.09 19.38 19.43 19.40 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
TensorFlow Device: CPU - Batch Size: 16 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 16 - Model: ResNet-50 b c d e f 8 16 24 32 40 23.91 23.93 35.36 35.10 35.42
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 a b c d e f 40 80 120 160 200 127.31 134.25 131.51 186.55 183.83 187.72 1. (CXX) g++ options: -O3
nginx Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 b c d e f 40K 80K 120K 160K 200K 127899.06 127678.15 184950.78 180058.09 188170.40 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
LuxCoreRender Scene: DLSC - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU b c d e f 0.9495 1.899 2.8485 3.798 4.7475 2.88 2.87 4.15 4.17 4.22 MIN: 2.81 / MAX: 3 MIN: 2.8 / MAX: 2.98 MIN: 4.05 / MAX: 4.44 MIN: 4 / MAX: 4.47 MIN: 4.08 / MAX: 4.91
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test b c d e f 20 40 60 80 100 96.02 95.46 66.07 65.66 65.34
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 a b c d e f 15 30 45 60 75 44.29 44.88 44.60 64.20 64.99 63.75 1. (CXX) g++ options: -O3
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Exhaustive b c d e f 0.3566 0.7132 1.0698 1.4264 1.783 1.0807 1.0804 1.5573 1.5633 1.5848 1. (CXX) g++ options: -O3 -flto -pthread
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 a b c d e f 14 28 42 56 70 43.71 44.06 44.62 63.22 64.05 62.90 1. (CXX) g++ options: -O3
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only b c d e f 20 40 60 80 100 109.94 110.19 75.40 75.33 75.24
uvg266 Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Super Fast b c d e f 10 20 30 40 50 29.30 29.34 41.03 42.91 42.68
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C b c d e f 11K 22K 33K 44K 55K 34501.92 34573.71 50512.70 50229.93 49366.17 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating b c d e f 40K 80K 120K 160K 200K 127009 127943 183859 171918 185607 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig b c d e f 15 30 45 60 75 65.56 64.96 44.92 45.82 45.72
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU b c d e f 14 28 42 56 70 44.66 44.39 62.26 63.33 64.57 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Natron Input: Spaceship OpenBenchmarking.org FPS, More Is Better Natron 2.4.3 Input: Spaceship b c d e f 2 4 6 8 10 4.6 4.4 6.4 6.3 6.4
Darktable Test: Masskrug - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Masskrug - Acceleration: CPU-only b c d e f 0.5452 1.0904 1.6356 2.1808 2.726 2.364 2.423 1.694 1.671 1.667
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.18.1 Variant: Wownero - Hash Count: 1M b c d e f 3K 6K 9K 12K 15K 11202.1 11160.8 16204.0 16088.6 15832.3 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
uvg266 Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast b c d e f 11 22 33 44 55 35.30 35.09 50.11 50.81 50.69
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU b c d e f 5 10 15 20 25 12.75 12.75 18.22 18.46 18.26 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 512 b c d e f 70M 140M 210M 280M 350M 228820000 228970000 329800000 331120000 329870000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
TensorFlow Device: CPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 b c d e f 8 16 24 32 40 24.14 24.10 34.80 34.83 34.84
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU b c d e f 120 240 360 480 600 383.74 383.97 550.30 553.27 553.49 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU b c d e f 4 8 12 16 20 12.46 12.54 17.96 17.74 14.89 MIN: 12.32 / MAX: 12.57 MIN: 12.43 / MAX: 12.6 MIN: 17.28 / MAX: 18.69 MIN: 16.79 / MAX: 18.55 MIN: 12.73 / MAX: 16.75
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile b c d e f 50 100 150 200 250 230.16 234.32 163.49 162.57 162.80
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 1600 3200 4800 6400 8000 7338 7320 5095 5127 5175
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Garlicoin a b c d e f 600 1200 1800 2400 3000 1893.63 2040.33 2040.66 2726.85 2605.60 2559.46 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 13K 26K 39K 52K 65K 62478 62329 43450 44009 43624
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU b c d e f 13 26 39 52 65 38.73 38.98 53.71 54.88 55.69 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 2K 4K 6K 8K 10K 8523 8512 6008 5932 6037
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 16K 32K 48K 64K 80K 72426 72092 50740 50458 50705
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 13K 26K 39K 52K 65K 61145 61161 43729 42719 42916
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile b c d e f 80 160 240 320 400 357.10 355.83 251.15 249.43 249.69
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja b c d e f 100 200 300 400 500 439.46 435.62 308.21 307.20 307.93
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 500 1000 1500 2000 2500 2156 2150 1547 1524 1508
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 400 800 1200 1600 2000 1848 1840 1304 1294 1318
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 1500 3000 4500 6000 7500 7176 7195 5114 5081 5039
OSPRay Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time b c d e f 1.1872 2.3744 3.5616 4.7488 5.936 3.69580 3.79562 5.25193 5.27642 5.05305
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 30K 60K 90K 120K 150K 120716 120664 85258 84586 84845
LuxCoreRender Scene: Orange Juice - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU b c d e f 2 4 6 8 10 4.88 4.91 6.93 6.96 6.91 MIN: 4.71 / MAX: 4.93 MIN: 4.72 / MAX: 4.97 MIN: 6.23 / MAX: 7.51 MIN: 6.29 / MAX: 7.55 MIN: 6.28 / MAX: 7.27
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 30K 60K 90K 120K 150K 118595 118639 83823 83524 83238
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU b c d e f 150 300 450 600 750 503.59 502.05 662.91 675.24 715.44 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
uvg266 Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Very Fast b c d e f 9 18 27 36 45 27.74 27.74 38.59 38.46 39.46
OpenVKL Benchmark: vklBenchmarkCPU ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 2.0.0 Benchmark: vklBenchmarkCPU ISPC b c d e f 120 240 360 480 600 387 386 547 545 549 MIN: 26 / MAX: 5487 MIN: 26 / MAX: 5478 MIN: 37 / MAX: 8225 MIN: 36 / MAX: 7468 MIN: 37 / MAX: 7452
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 50K 100K 150K 200K 250K 239054 238496 169828 168775 168378
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 30K 60K 90K 120K 150K 139891 139809 99301 99388 98534
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 60K 120K 180K 240K 300K 277357 277524 196772 196983 195691
QMCPACK Input: H4_ae OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: H4_ae b c d e f 6 12 18 24 30 22.73 25.09 18.59 18.30 17.70 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU b c d e f 600 1200 1800 2400 3000 2125.52 2170.03 2989.78 3010.60 3005.33 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 50K 100K 150K 200K 250K 234540 234968 167519 166105 166313
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 400 800 1200 1600 2000 1830 1816 1294 1305 1296
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU b c d e f 12 24 36 48 60 38.75 39.00 54.06 54.78 53.90 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield b c d e f 50 100 150 200 250 228.35 234.62 166.94 170.46 166.08
OSPRay Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time b c d e f 1.1463 2.2926 3.4389 4.5852 5.7315 3.60869 3.65807 5.09456 5.03492 4.94223
QMCPACK Input: Li2_STO_ae OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: Li2_STO_ae b c d e f 60 120 180 240 300 285.85 284.18 208.70 203.73 202.62 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms b c d e f 0.1942 0.3884 0.5826 0.7768 0.971 0.86311 0.86116 0.61454 0.61420 0.61266
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast b c d e f 20 40 60 80 100 54.14 54.29 75.87 73.99 72.63 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
SVT-HEVC Tuning: 7 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 4K b c d e f 20 40 60 80 100 71.18 71.18 98.86 99.72 98.68 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C b c d e f 2K 4K 6K 8K 10K 8025.90 7422.74 10390.33 8593.82 7707.48 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
OSPRay Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time b c d e f 3 6 9 12 15 7.37745 7.36690 10.16860 10.20380 10.23340
OSPRay Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time b c d e f 3 6 9 12 15 7.32159 7.32078 10.11880 10.12290 10.16890
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K b c d e f 30 60 90 120 150 88.07 88.36 122.15 120.84 120.89 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenFOAM Input: drivaerFastback, Small Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time b c d e f 50 100 150 200 250 224.44 223.87 162.53 162.41 161.89 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
CloverLeaf Input: clover_bm OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm a b c d e f 13 26 39 52 65 58.98 55.82 56.10 42.55 42.83 42.77 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write b c d e f 12K 24K 36K 48K 60K 41736 41656 55901 56335 57567 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency b c d e f 6 12 18 24 30 23.96 24.01 17.89 17.75 17.37 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU b c d e f 4K 8K 12K 16K 20K 12503.20 12456.32 16901.10 17036.01 17187.30 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless b c d e f 2 4 6 8 10 6.908 6.983 5.097 5.063 5.066 1. (CXX) g++ options: -O3 -fPIC -lm
OSPRay Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time b c d e f 2 4 6 8 10 5.03922 5.04566 6.94770 6.92496 6.94737
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile b c d e f 7 14 21 28 35 29.83 29.20 21.80 21.85 21.99
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: PartialTweets b c d e f 2 4 6 8 10 5.39 6.51 7.37 7.37 7.36 1. (CXX) g++ options: -O3
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: DistinctUserID b c d e f 2 4 6 8 10 6.04 6.04 8.25 8.25 8.24 1. (CXX) g++ options: -O3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C b c d e f 5K 10K 15K 20K 25K 18953.53 18737.09 24740.05 25591.23 25290.90 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU b c d e f 400 800 1200 1600 2000 1361.39 1359.67 1854.34 1855.23 1852.66 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU b c d e f 80 160 240 320 400 269.31 266.96 353.52 357.15 363.79 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Selenium Benchmark: WASM collisionDetection - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Google Chrome b c d e f 50 100 150 200 250 232.97 193.39 211.66 204.81 171.17 1. chrome 118.0.5993.70
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K b c d e f 30 60 90 120 150 111.84 112.34 137.97 150.27 152.13 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 b c d e f 10 20 30 40 50 44.20 44.00 33.20 32.79 33.17 1. (CXX) g++ options: -O3 -fPIC -lm
Kvazaar Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast b c d e f 12 24 36 48 60 40.71 40.69 54.02 54.71 53.99 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B b c d e f 5K 10K 15K 20K 25K 16564.09 16408.43 22004.23 21129.21 21905.42 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast b c d e f 10 20 30 40 50 32.39 32.33 42.25 42.88 43.18 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU b c d e f 70 140 210 280 350 224.92 224.74 295.35 298.80 299.80 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Memcached Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 b c d e f 1.2M 2.4M 3.6M 4.8M 6M 4464270.25 4305589.62 5693052.64 5730258.98 5727247.49 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam b c d e f 30 60 90 120 150 151.75 150.19 114.72 114.07 115.59
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 4K b c d e f 40 80 120 160 200 146.57 148.55 194.35 190.16 194.23 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-HEVC Tuning: 10 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 4K b c d e f 40 80 120 160 200 139.99 140.06 185.07 178.47 170.99 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency b c d e f 12K 24K 36K 48K 60K 42553 42185 54797 55746 50776 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 b c d e f 1.0307 2.0614 3.0921 4.1228 5.1535 4.515 4.581 3.852 3.472 3.854 1. (CXX) g++ options: -O3 -fPIC -lm
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU b c d e f 9K 18K 27K 36K 45K 32245.66 32241.73 41956.06 42386.78 42343.47 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 4K b c d e f 40 80 120 160 200 149.16 147.09 193.28 190.91 191.65 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Darktable Test: Server Room - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Server Room - Acceleration: CPU-only b c d e f 0.4124 0.8248 1.2372 1.6496 2.062 1.833 1.825 1.415 1.434 1.405
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K b c d e f 8 16 24 32 40 25.70 26.74 32.51 33.23 33.47 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K b c d e f 20 40 60 80 100 58.46 58.18 75.36 75.65 75.06 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Memcached Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 b c d e f 1.1M 2.2M 3.3M 4.4M 5.5M 4058394.64 4149899.46 5061427.55 5241319.80 5171670.55 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenFOAM Input: drivaerFastback, Small Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time b c d e f 8 16 24 32 40 33.88 32.90 26.29 27.07 27.22 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C b c d e f 3K 6K 9K 12K 15K 12080.00 12101.42 15263.55 15270.26 15177.55 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
Y-Cruncher Pi Digits To Calculate: 500M OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.7.10.9513 Pi Digits To Calculate: 500M b c d e f 3 6 9 12 15 13.44 13.52 10.78 10.86 11.11
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.18.1 Variant: Monero - Hash Count: 1M b c d e f 2K 4K 6K 8K 10K 7676.5 7660.4 9559.6 9512.1 9501.0 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OSPRay Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time b c d e f 50 100 150 200 250 169.65 169.84 207.16 209.44 204.45
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU b c d e f 1.3253 2.6506 3.9759 5.3012 6.6265 5.89 5.81 5.10 5.08 4.78 MIN: 4.59 / MAX: 9.41 MIN: 2.72 / MAX: 9.48 MIN: 2.36 / MAX: 12.91 MIN: 3.18 / MAX: 17.17 MIN: 2.77 / MAX: 19.3 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 512 b c d e f 40M 80M 120M 160M 200M 139440000 140020000 168030000 171400000 169900000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
VVenC Video Input: Bosphorus 4K - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Fast b c d e f 2 4 6 8 10 6.539 6.550 7.829 7.946 7.834 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Selenium Benchmark: Octane - Browser: Google Chrome OpenBenchmarking.org Geometric Mean, More Is Better Selenium Benchmark: Octane - Browser: Google Chrome b c d e f 30K 60K 90K 120K 150K 97769 105856 117684 102783 118632 1. chrome 118.0.5993.70
DuckDB Benchmark: IMDB OpenBenchmarking.org Seconds, Fewer Is Better DuckDB 0.9.1 Benchmark: IMDB b c d 20 40 60 80 100 SE +/- 0.48, N = 3 SE +/- 0.54, N = 3 SE +/- 0.27, N = 3 86.57 86.56 71.37 1. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU b c d e f 400 800 1200 1600 2000 1730.89 1733.40 1448.07 1447.91 1431.86 MIN: 1541.53 / MAX: 2245.54 MIN: 1541.89 / MAX: 2247.48 MIN: 1089.83 / MAX: 1961.07 MIN: 1094.97 / MAX: 2000.94 MIN: 963.6 / MAX: 2030.33 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Selenium Benchmark: WASM imageConvolute - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Google Chrome b c d e f 4 8 12 16 20 15.95 15.95 14.10 16.93 14.04 1. chrome 118.0.5993.70
High Performance Conjugate Gradient X Y Z: 104 104 104 - RT: 60 OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 b c d e f 3 6 9 12 15 8.28855 8.26023 9.91323 9.88187 9.84149 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time b c d e f 50 100 150 200 250 230.30 234.40 203.50 204.96 195.55 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D b c d e f 13 26 39 52 65 56.00 58.02 48.85 48.47 49.01 1. (CXX) g++ options: -O2 -lOpenCL
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time b c d e f 500 1000 1500 2000 2500 2166.82 2166.83 1816.89 1813.29 1815.05 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU b c d e f 5 10 15 20 25 19.33 19.09 22.70 22.63 22.45 MIN: 8.18 / MAX: 34.75 MIN: 15 / MAX: 32.89 MIN: 11.49 / MAX: 48.33 MIN: 11.81 / MAX: 48.47 MIN: 11.6 / MAX: 50.7 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 a b c d e f 4 8 12 16 20 14.28 14.48 14.38 16.94 16.84 16.86 1. (CXX) g++ options: -O3
PyBench Total For Average Test Times OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times b c d e f 100 200 300 400 500 427 446 377 378 376
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 a b c d e f 5 10 15 20 25 18.20 18.32 18.28 15.81 15.85 15.53 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 a b c d e f 4 8 12 16 20 14.66 14.82 14.44 16.75 16.67 17.03 1. (CXX) g++ options: -O3
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU b c d e f 15 30 45 60 75 56.41 56.71 66.49 66.06 66.42 MIN: 41.43 / MAX: 98.24 MIN: 40.93 / MAX: 99.23 MIN: 40.75 / MAX: 125.11 MIN: 32.08 / MAX: 125.5 MIN: 39.7 / MAX: 126.38 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
QuantLib Configuration: Single-Threaded OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Single-Threaded a b c d e f 1200 2400 3600 4800 6000 4818.8 5016.1 5014.7 5665.0 5672.2 5666.3 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 a b c d e f 4 8 12 16 20 16.15 16.47 16.36 14.17 14.04 14.04 1. (CXX) g++ options: -O3
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 512 b c d e f 50M 100M 150M 200M 250M 204060000 204100000 223360000 239170000 239250000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 a b c d e f 4 8 12 16 20 16.27 16.48 16.43 14.13 14.08 14.14 1. (CXX) g++ options: -O3
RawTherapee Total Benchmark Time OpenBenchmarking.org Seconds, Fewer Is Better RawTherapee Total Benchmark Time b c d e f 9 18 27 36 45 40.09 40.30 34.45 35.19 35.08 1. RawTherapee, version 5.9, command line.
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 b c d e f 15 30 45 60 75 66.39 66.08 57.22 57.46 56.87 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 4K b c d e f 20 40 60 80 100 93.49 96.00 107.40 107.34 108.94 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
VVenC Video Input: Bosphorus 4K - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Faster b c d e f 4 8 12 16 20 13.27 13.28 15.44 15.30 15.44 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 b c d e f 15 30 45 60 75 65.90 63.91 56.74 56.81 56.77 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte b c d e f 12 24 36 48 60 53.87 54.93 48.08 47.48 48.28 1. (CXX) g++ options: -O2 -lOpenCL
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 a b c d e f 7 14 21 28 35 31.30 31.82 31.55 27.78 27.57 27.55 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 a b c d e f 7 14 21 28 35 31.27 31.83 31.64 27.85 27.58 27.65 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 a b c d e f 5 10 15 20 25 18.24 18.29 18.25 15.98 15.95 15.88 1. (CXX) g++ options: -O3
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark b c d e f 16 32 48 64 80 61.85 63.14 71.16 69.03 70.56 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU b c d e f 80 160 240 320 400 309.56 313.32 351.84 352.96 355.77 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 a b c d e f 8 16 24 32 40 35.22 35.29 35.43 30.84 30.99 31.05 1. (CXX) g++ options: -O3
Selenium Benchmark: WASM imageConvolute - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Firefox b c d e f 4 8 12 16 20 16.3 16.1 14.2 14.3 14.3 1. firefox 118.0.2
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU b c d e f 30 60 90 120 150 106.23 105.65 120.21 120.99 120.32 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 a b c d e f 4 8 12 16 20 15.83 16.05 15.94 14.09 14.05 14.02 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 a b c d e f 8 16 24 32 40 35.28 35.38 35.18 31.06 30.96 30.93 1. (CXX) g++ options: -O3
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite b c d e f 400K 800K 1200K 1600K 2000K 1491682 1479761 1692102 1685554 1686956
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU b c d e f 0.126 0.252 0.378 0.504 0.63 0.49 0.49 0.56 0.55 0.55 MIN: 0.29 / MAX: 3.73 MIN: 0.29 / MAX: 3.11 MIN: 0.28 / MAX: 9.03 MIN: 0.28 / MAX: 13.45 MIN: 0.27 / MAX: 5.91 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU b c d e f 15 30 45 60 75 59.37 59.91 67.85 67.16 65.93 MIN: 52.65 / MAX: 80.39 MIN: 30.66 / MAX: 74.01 MIN: 31.98 / MAX: 153.29 MIN: 42.11 / MAX: 149.84 MIN: 31.99 / MAX: 150.16 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU b c d e f 20 40 60 80 100 71.08 71.15 81.20 80.25 79.99 MIN: 55.16 / MAX: 87.27 MIN: 53.9 / MAX: 90.11 MIN: 43.5 / MAX: 176.88 MIN: 63.46 / MAX: 183.13 MIN: 47.77 / MAX: 173.9 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 a b c d e f 7 14 21 28 35 28.59 29.01 28.81 25.82 25.57 25.45 1. (CXX) g++ options: -O3
simdjson Throughput Test: TopTweet OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: TopTweet b c d e f 2 4 6 8 10 7.29 7.31 8.29 8.29 8.31 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 a b c d e f 7 14 21 28 35 28.62 28.99 28.78 25.80 25.59 25.47 1. (CXX) g++ options: -O3
PyPerformance Benchmark: crypto_pyaes OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: crypto_pyaes b c d e f 9 18 27 36 45 37.5 37.4 33.0 33.6 33.0
PyPerformance Benchmark: json_loads OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: json_loads b c d e f 3 6 9 12 15 10.20 10.10 9.18 9.23 8.98
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 a b c d e f 4 8 12 16 20 15.76 15.90 15.75 14.12 14.04 14.02 1. (CXX) g++ options: -O3
Selenium Benchmark: Kraken - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Firefox b c d e f 120 240 360 480 600 520.2 520.4 487.8 552.8 490.7 1. firefox 118.0.2
Selenium Benchmark: WASM collisionDetection - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Firefox b c d e f 50 100 150 200 250 242.3 242.5 216.0 214.3 214.5 1. firefox 118.0.2
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Highest Compression b c d e f 1.206 2.412 3.618 4.824 6.03 4.74 4.74 5.36 5.28 5.36 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: LargeRandom b c d e f 0.45 0.9 1.35 1.8 2.25 1.77 1.77 1.99 1.99 2.00 1. (CXX) g++ options: -O3
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100 b c d e f 4 8 12 16 20 15.57 15.53 17.54 17.54 17.53 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless, Highest Compression b c d e f 0.2205 0.441 0.6615 0.882 1.1025 0.87 0.88 0.97 0.97 0.98 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
PyPerformance Benchmark: regex_compile OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: regex_compile b c d e f 16 32 48 64 80 71.6 71.4 63.6 67.0 64.8
Selenium Benchmark: Jetstream 2 - Browser: Firefox OpenBenchmarking.org Score, More Is Better Selenium Benchmark: Jetstream 2 - Browser: Firefox b c d e f 60 120 180 240 300 229.49 227.94 251.97 253.62 256.14 1. firefox 118.0.2
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU b c d e f 6 12 18 24 30 23.47 23.41 21.63 21.51 20.89 MIN: 11.19 / MAX: 40.86 MIN: 18.22 / MAX: 41.11 MIN: 9.02 / MAX: 46.9 MIN: 13.48 / MAX: 43.52 MIN: 12.72 / MAX: 44.33 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU b c d e f 3 6 9 12 15 9.84 9.79 9.01 8.93 8.77 MIN: 7.45 / MAX: 19.24 MIN: 7.41 / MAX: 19.03 MIN: 5.26 / MAX: 19.9 MIN: 5.27 / MAX: 19.37 MIN: 5 / MAX: 22.42 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark b c d e f 6 12 18 24 30 24.04 23.75 26.59 26.44 26.63
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless b c d e f 0.5513 1.1026 1.6539 2.2052 2.7565 2.20 2.19 2.45 2.44 2.45 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 a b c d e f 7 14 21 28 35 31.73 32.01 31.90 28.66 28.77 28.76 1. (CXX) g++ options: -O3
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU b c d e f 0.3173 0.6346 0.9519 1.2692 1.5865 1.27 1.28 1.41 1.40 1.39 MIN: 0.78 / MAX: 10.23 MIN: 0.75 / MAX: 10.14 MIN: 0.83 / MAX: 9.26 MIN: 0.75 / MAX: 8.02 MIN: 0.84 / MAX: 11.1 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 a b c d e f 7 14 21 28 35 31.61 31.91 31.91 28.81 28.77 28.76 1. (CXX) g++ options: -O3
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: Kostya b c d e f 1.1948 2.3896 3.5844 4.7792 5.974 4.80 4.81 5.30 5.30 5.31 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 a b c d e f 12 24 36 48 60 49.76 50.81 50.91 54.95 54.06 54.66 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 a b c d e f 7 14 21 28 35 29.65 30.34 29.81 27.83 27.55 27.49 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 a b c d e f 12 24 36 48 60 49.45 50.63 50.85 54.58 53.88 54.26 1. (CXX) g++ options: -O3
Selenium Benchmark: Octane - Browser: Firefox OpenBenchmarking.org Geometric Mean, More Is Better Selenium Benchmark: Octane - Browser: Firefox b c d e f 13K 26K 39K 52K 65K 55056 55067 60754 60711 60209 1. firefox 118.0.2
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU b c d e f 3 6 9 12 15 11.74 11.76 12.93 12.93 12.95 MIN: 9.52 / MAX: 22.89 MIN: 8.2 / MAX: 22.88 MIN: 8.18 / MAX: 24.53 MIN: 8.29 / MAX: 22.42 MIN: 8.15 / MAX: 35.46 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C b c d e f 6K 12K 18K 24K 30K 24092.42 26540.76 24498.20 25162.58 24381.01 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
Selenium Benchmark: Jetstream 2 - Browser: Google Chrome OpenBenchmarking.org Score, More Is Better Selenium Benchmark: Jetstream 2 - Browser: Google Chrome b c d e f 90 180 270 360 450 382.67 388.31 419.01 421.02 419.30 1. chrome 118.0.5993.70
Selenium Benchmark: Kraken - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Google Chrome b c d e f 80 160 240 320 400 349.0 343.2 319.4 318.5 317.3 1. chrome 118.0.5993.70
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 a b c d e f 7 14 21 28 35 29.68 30.04 29.88 27.64 27.53 27.34 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 a b c d e f 7 14 21 28 35 29.39 29.52 29.87 27.63 27.40 27.34 1. (CXX) g++ options: -O3
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU b c d e f 30 60 90 120 150 134.23 135.00 128.32 126.22 123.75 MIN: 63.18 / MAX: 200.65 MIN: 61.37 / MAX: 200.78 MIN: 86.52 / MAX: 370.89 MIN: 85.87 / MAX: 245.66 MIN: 72.91 / MAX: 247.36 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU b c d e f 2 4 6 8 10 7.52 7.37 8.02 7.96 7.98 MIN: 5.11 / MAX: 16.43 MIN: 5.99 / MAX: 11.83 MIN: 5.89 / MAX: 15.12 MIN: 4.21 / MAX: 12.87 MIN: 5.16 / MAX: 18.32 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 a b c d e f 7 14 21 28 35 29.78 29.91 29.86 28.01 27.54 27.59 1. (CXX) g++ options: -O3
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU b c d e f 100 200 300 400 500 469.85 470.00 436.86 432.96 435.36 MIN: 366.95 / MAX: 965.67 MIN: 367.47 / MAX: 966.24 MIN: 257.97 / MAX: 949.62 MIN: 258.52 / MAX: 962.37 MIN: 259 / MAX: 1007.23 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
DuckDB Benchmark: TPC-H Parquet OpenBenchmarking.org Seconds, Fewer Is Better DuckDB 0.9.1 Benchmark: TPC-H Parquet b c d 20 40 60 80 100 SE +/- 0.28, N = 3 SE +/- 0.50, N = 3 SE +/- 0.21, N = 3 89.75 90.45 83.67 1. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU b c d e f 3 6 9 12 15 11.88 11.91 12.05 11.83 11.17 MIN: 9.25 / MAX: 18.16 MIN: 5.89 / MAX: 17.91 MIN: 7.14 / MAX: 26.84 MIN: 6.65 / MAX: 27.36 MIN: 6.05 / MAX: 28.92 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU b c d e f 30 60 90 120 150 154.78 153.73 148.70 145.64 143.52 MIN: 79.79 / MAX: 219.52 MIN: 83.63 / MAX: 218.01 MIN: 107.41 / MAX: 259.59 MIN: 62.64 / MAX: 256.75 MIN: 72.01 / MAX: 257.8 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
GIMP Test: rotate OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: rotate b c d e f 3 6 9 12 15 9.199 9.447 8.806 8.854 8.828
Selenium Benchmark: Speedometer - Browser: Google Chrome OpenBenchmarking.org Runs Per Minute, More Is Better Selenium Benchmark: Speedometer - Browser: Google Chrome b c d e f 110 220 330 440 550 486 471 498 502 504 1. chrome 118.0.5993.70
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 a b c d e f 13 26 39 52 65 58.42 59.20 59.17 55.88 55.58 55.37 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 a b c d e f 13 26 39 52 65 58.65 59.36 59.22 55.67 55.62 55.78 1. (CXX) g++ options: -O3
Selenium Benchmark: Speedometer - Browser: Firefox OpenBenchmarking.org Runs Per Minute, More Is Better Selenium Benchmark: Speedometer - Browser: Firefox b c d e f 90 180 270 360 450 406 407 433 433 430 1. firefox 118.0.2
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 a b c d e f 5 10 15 20 25 18.11 18.53 18.32 19.24 18.92 18.88 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 a b c d e f 5 10 15 20 25 18.11 18.43 18.55 19.23 18.79 19.19 1. (CXX) g++ options: -O3
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU b c d e f 30 60 90 120 150 154.50 153.66 147.80 145.89 148.29 MIN: 131.36 / MAX: 217.7 MIN: 79.35 / MAX: 217.79 MIN: 108.01 / MAX: 317.25 MIN: 66.07 / MAX: 257.13 MIN: 66.12 / MAX: 269.72 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 a b c d e f 7 14 21 28 35 27.22 27.73 27.55 28.44 28.66 27.78 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 a b c d e f 7 14 21 28 35 27.16 27.53 27.62 28.48 28.55 28.25 1. (CXX) g++ options: -O3
GIMP Test: resize OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: resize b c d e f 3 6 9 12 15 12.63 12.39 12.52 12.11 12.70
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 a b c d e f 13 26 39 52 65 55.50 55.94 55.41 53.75 53.46 53.41 1. (CXX) g++ options: -O3
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU b c d e f 10 20 30 40 50 41.65 41.63 43.58 43.34 43.32 MIN: 23.28 / MAX: 44.79 MIN: 23.08 / MAX: 44.79 MIN: 31.3 / MAX: 72.83 MIN: 30.19 / MAX: 72.32 MIN: 31.46 / MAX: 73.03 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
PyPerformance Benchmark: python_startup OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: python_startup b c d e f 1.0913 2.1826 3.2739 4.3652 5.4565 4.85 4.84 4.65 4.68 4.66
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 a b c d e f 7 14 21 28 35 27.99 28.42 28.21 28.91 29.11 28.45 1. (CXX) g++ options: -O3
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 b c d e f 30 60 90 120 150 134.19 134.63 129.70 129.45 131.70 1. (CXX) g++ options: -O3 -fopenmp
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 a b c d e f 12 24 36 48 60 55.10 55.53 55.60 53.64 53.71 53.54 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 a b c d e f 7 14 21 28 35 27.99 28.34 28.40 28.94 28.91 28.53 1. (CXX) g++ options: -O3
CloverLeaf Input: clover_bm16 OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm16 a b c d e f 200 400 600 800 1000 1094.13 1093.01 1091.47 1058.26 1063.27 1062.13 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
GIMP Test: auto-levels OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: auto-levels b c d e f 3 6 9 12 15 9.615 9.748 9.459 9.526 9.526
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 a b c d e f 4 8 12 16 20 15.08 15.22 15.22 15.00 14.90 14.77 1. (CXX) g++ options: -O3
Selenium Benchmark: PSPDFKit WASM - Browser: Google Chrome OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Google Chrome b c d e f 500 1000 1500 2000 2500 2202 2170 2139 2140 2139 1. chrome 118.0.5993.70
CloverLeaf Input: clover_bm64_short OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm64_short a b c d e f 30 60 90 120 150 127.74 127.94 128.11 124.58 124.71 124.75 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
GIMP Test: unsharp-mask OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: unsharp-mask b c d e f 3 6 9 12 15 11.12 11.35 11.04 11.12 11.17
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 a b c d e f 4 8 12 16 20 15.11 15.20 15.18 14.79 14.79 15.03 1. (CXX) g++ options: -O3
Selenium Benchmark: PSPDFKit WASM - Browser: Firefox OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Firefox b c d e f 500 1000 1500 2000 2500 2108 2096 2059 2051 2055 1. firefox 118.0.2
QMCPACK Input: O_ae_pyscf_UHF OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: O_ae_pyscf_UHF b c d e f 60 120 180 240 300 262.57 264.99 258.40 258.48 259.37 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 a b c d e f 13 26 39 52 65 57.60 58.43 57.99 58.99 58.42 58.25 1. (CXX) g++ options: -O3
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: simple-H2O b c d e f 8 16 24 32 40 32.16 32.72 31.97 32.19 32.23 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU b c d e f 0.9892 1.9784 2.9676 3.9568 4.946 4.36881 4.39625 4.30701 4.30610 4.31754 MIN: 4.33 MIN: 4.36 MIN: 4.27 MIN: 4.28 MIN: 4.27 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 a b c d e f 13 26 39 52 65 57.98 58.15 58.47 58.98 58.05 58.59 1. (CXX) g++ options: -O3
QMCPACK Input: FeCO6_b3lyp_gms OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: FeCO6_b3lyp_gms b c d e f 60 120 180 240 300 266.24 266.65 265.33 264.55 263.94 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QMCPACK Input: LiH_ae_MSD OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: LiH_ae_MSD b c d e f 40 80 120 160 200 169.05 168.70 168.69 168.37 168.31 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare d e f 0.2583 0.5166 0.7749 1.0332 1.2915 1.143 1.148 1.148 1. (CXX) g++ options: -O3
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 23.0.1 Time To Compile d e f 50 100 150 200 250 SE +/- 0.37, N = 3 221.24 221.12 220.53
Phoronix Test Suite v10.8.5