new tests Tests for a future article. Intel Core i9-14900K testing with a ASUS PRIME Z790-P WIFI (1402 BIOS) and AMD Radeon RX 7900 XTX 24GB on Ubuntu 23.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2310314-PTS-NEWTESTS84&sor&grs .
new tests Processor Motherboard Chipset Memory Disk Graphics Audio Monitor OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution a b c d e f Intel Core i5-13600K @ 5.10GHz (14 Cores / 20 Threads) ASUS PRIME Z790-P WIFI (1402 BIOS) Intel Device 7a27 32GB 1000GB Western Digital WDS100T1X0E-00AFY0 AMD Radeon RX 7900 XTX 24GB (2304/1249MHz) Realtek ALC897 ASUS VP28U Ubuntu 23.10 6.5.0-9-generic (x86_64) GNOME Shell 45.0 X Server + Wayland 4.6 Mesa 23.2.1-1ubuntu3 (LLVM 15.0.7 DRM 3.54) GCC 13.2.0 ext4 3840x2160 Intel Core i5-14600K @ 5.30GHz (14 Cores / 20 Threads) Intel Core i9-14900K @ 5.70GHz (24 Cores / 32 Threads) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x11d - Thermald 2.5.4 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected Python Details - b, c, d, e, f: Python 3.11.6
new tests heffte: r2c - Stock - double-long - 128 rodinia: OpenMP Streamcluster heffte: c2c - FFTW - float-long - 128 pgbench: 100 - 1000 - Read Only - Average Latency heffte: c2c - Stock - float - 128 heffte: c2c - Stock - float-long - 128 pgbench: 100 - 1000 - Read Only npb: EP.C heffte: c2c - FFTW - float - 128 cpuminer-opt: LBC, LBRY Credits john-the-ripper: Blowfish easywave: e2Asean Grid + BengkuluSept2007 Source - 240 cpuminer-opt: Skeincoin cpuminer-opt: Blake-2 S heffte: r2c - FFTW - float - 128 heffte: r2c - Stock - float-long - 128 heffte: r2c - Stock - double - 128 minibude: OpenMP - BM1 minibude: OpenMP - BM1 cpuminer-opt: Magi cpuminer-opt: Ringcoin cpuminer-opt: Quad SHA-256, Pyrite pgbench: 100 - 1000 - Read Only - Average Latency cpuminer-opt: Deepcoin stockfish: Total Time rodinia: OpenMP CFD Solver specfem3d: Tomographic Model cpuminer-opt: Triple SHA-256, Onecoin specfem3d: Homogeneous Halfspace darktable: Server Rack - CPU-only compress-7zip: Decompression Rating john-the-ripper: WPA PSK cpuminer-opt: scrypt specfem3d: Mount St. Helens cpuminer-opt: Myriad-Groestl heffte: r2c - Stock - float - 128 openssl: SHA256 john-the-ripper: MD5 openvino: Face Detection Retail FP16 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU darktable: Boat - CPU-only quantlib: Multi-Threaded astcenc: Medium onednn: Recurrent Neural Network Inference - f32 - CPU openvino: Face Detection FP16 - CPU indigobench: CPU - Supercar openssl: AES-256-GCM astcenc: Thorough openssl: AES-128-GCM openssl: SHA512 minibude: OpenMP - BM2 minibude: OpenMP - BM2 john-the-ripper: HMAC-SHA512 ospray-studio: 2 - 1080p - 16 - Path Tracer - CPU onednn: Recurrent Neural Network Training - f32 - CPU luxcorerender: Danish Mood - CPU openssl: ChaCha20 indigobench: CPU - Bedroom openssl: ChaCha20-Poly1305 ospray-studio: 1 - 1080p - 16 - Path Tracer - CPU embree: Pathtracer ISPC - Crown onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU v-ray: CPU specfem3d: Layered Halfspace john-the-ripper: bcrypt openradioss: INIVOL and Fluid Structure Interaction Drop Container oidn: RT.ldr_alb_nrm.3840x2160 - CPU-Only embree: Pathtracer ISPC - Asian Dragon ospray-studio: 3 - 1080p - 16 - Path Tracer - CPU luxcorerender: LuxCore Benchmark - CPU openradioss: Rubber O-Ring Seal Installation npb: LU.C openssl: RSA4096 uvg266: Bosphorus 4K - Medium asmfish: 1024 Hash Memory, 26 Depth nginx: 500 rodinia: OpenMP LavaMD openssl: RSA4096 specfem3d: Water-layered Halfspace blender: BMW27 - CPU-Only openvino: Vehicle Detection FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU build-linux-kernel: allmodconfig onednn: IP Shapes 1D - f32 - CPU kvazaar: Bosphorus 4K - Medium tensorflow: CPU - 16 - ResNet-50 heffte: r2c - FFTW - float-long - 128 nginx: 1000 luxcorerender: DLSC - CPU openradioss: Cell Phone Drop Test heffte: r2c - FFTW - double - 128 astcenc: Exhaustive heffte: r2c - FFTW - double-long - 128 blender: Fishy Cat - CPU-Only uvg266: Bosphorus 4K - Super Fast npb: BT.C compress-7zip: Compression Rating build-linux-kernel: defconfig openvino: Machine Translation EN To DE FP16 - CPU natron: Spaceship darktable: Masskrug - CPU-only xmrig: Wownero - 1M uvg266: Bosphorus 4K - Ultra Fast openvino: Face Detection FP16-INT8 - CPU liquid-dsp: 32 - 256 - 512 tensorflow: CPU - 64 - ResNet-50 openvino: Weld Porosity Detection FP16 - CPU luxcorerender: Rainbow Colors and Prism - CPU build-godot: Time To Compile ospray-studio: 2 - 4K - 1 - Path Tracer - CPU cpuminer-opt: Garlicoin ospray-studio: 2 - 1080p - 32 - Path Tracer - CPU openvino: Person Detection FP16 - CPU ospray-studio: 3 - 4K - 1 - Path Tracer - CPU ospray-studio: 3 - 1080p - 32 - Path Tracer - CPU ospray-studio: 1 - 1080p - 32 - Path Tracer - CPU build-nodejs: Time To Compile build-llvm: Ninja ospray-studio: 3 - 1080p - 1 - Path Tracer - CPU ospray-studio: 2 - 1080p - 1 - Path Tracer - CPU ospray-studio: 1 - 4K - 1 - Path Tracer - CPU ospray: gravity_spheres_volume/dim_512/ao/real_time ospray-studio: 2 - 4K - 16 - Path Tracer - CPU luxcorerender: Orange Juice - CPU ospray-studio: 1 - 4K - 16 - Path Tracer - CPU openvino: Person Vehicle Bike Detection FP16 - CPU uvg266: Bosphorus 4K - Very Fast openvkl: vklBenchmarkCPU ISPC ospray-studio: 2 - 4K - 32 - Path Tracer - CPU ospray-studio: 3 - 4K - 16 - Path Tracer - CPU ospray-studio: 3 - 4K - 32 - Path Tracer - CPU qmcpack: H4_ae openvino: Face Detection Retail FP16-INT8 - CPU ospray-studio: 1 - 4K - 32 - Path Tracer - CPU ospray-studio: 1 - 1080p - 1 - Path Tracer - CPU openvino: Person Detection FP32 - CPU openradioss: Bird Strike on Windshield ospray: gravity_spheres_volume/dim_512/scivis/real_time qmcpack: Li2_STO_ae namd: ATPase Simulation - 327,506 Atoms kvazaar: Bosphorus 4K - Ultra Fast svt-hevc: 7 - Bosphorus 4K npb: CG.C ospray: particle_volume/ao/real_time ospray: particle_volume/scivis/real_time svt-vp9: Visual Quality Optimized - Bosphorus 4K openfoam: drivaerFastback, Small Mesh Size - Execution Time cloverleaf: clover_bm pgbench: 100 - 1000 - Read Write pgbench: 100 - 1000 - Read Write - Average Latency openvino: Age Gender Recognition Retail 0013 FP16 - CPU avifenc: 6, Lossless ospray: gravity_spheres_volume/dim_512/pathtracer/real_time build-mesa: Time To Compile simdjson: PartialTweets simdjson: DistinctUserID npb: FT.C openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU selenium: WASM collisionDetection - Google Chrome svt-vp9: PSNR/SSIM Optimized - Bosphorus 4K avifenc: 2 kvazaar: Bosphorus 4K - Super Fast npb: SP.B kvazaar: Bosphorus 4K - Very Fast openvino: Handwritten English Recognition FP16 - CPU memcached: 1:10 openradioss: Bumper Beam svt-av1: Preset 12 - Bosphorus 4K svt-hevc: 10 - Bosphorus 4K pgbench: 100 - 1000 - Read Write - Average Latency avifenc: 6 openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU svt-av1: Preset 13 - Bosphorus 4K darktable: Server Room - CPU-only x265: Bosphorus 4K svt-av1: Preset 8 - Bosphorus 4K memcached: 1:100 openfoam: drivaerFastback, Small Mesh Size - Mesh Time npb: SP.C y-cruncher: 500M xmrig: Monero - 1M ospray: particle_volume/pathtracer/real_time openvino: Face Detection Retail FP16 - CPU liquid-dsp: 8 - 256 - 512 vvenc: Bosphorus 4K - Fast selenium: Octane - Google Chrome duckdb: IMDB openvino: Face Detection FP16 - CPU selenium: WASM imageConvolute - Google Chrome hpcg: 104 104 104 - 60 openfoam: drivaerFastback, Medium Mesh Size - Mesh Time rodinia: OpenMP HotSpot3D openfoam: drivaerFastback, Medium Mesh Size - Execution Time openvino: Vehicle Detection FP16 - CPU heffte: c2c - Stock - double-long - 128 pybench: Total For Average Test Times heffte: c2c - FFTW - double - 512 heffte: c2c - Stock - double - 128 openvino: Road Segmentation ADAS FP16 - CPU quantlib: Single-Threaded heffte: c2c - FFTW - double-long - 256 liquid-dsp: 16 - 256 - 512 heffte: c2c - FFTW - double - 256 rawtherapee: Total Benchmark Time ngspice: C2670 svt-vp9: VMAF Optimized - Bosphorus 4K vvenc: Bosphorus 4K - Faster ngspice: C7552 rodinia: OpenMP Leukocyte heffte: c2c - FFTW - float-long - 256 heffte: c2c - FFTW - float - 256 heffte: c2c - FFTW - double-long - 512 libraw: Post-Processing Benchmark openvino: Vehicle Detection FP16 - CPU heffte: c2c - FFTW - float - 512 selenium: WASM imageConvolute - Firefox openvino: Road Segmentation ADAS FP16 - CPU heffte: c2c - Stock - double-long - 256 heffte: c2c - FFTW - float-long - 512 phpbench: PHP Benchmark Suite openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16 - CPU heffte: r2c - FFTW - double - 256 simdjson: TopTweet heffte: r2c - FFTW - double-long - 256 pyperformance: crypto_pyaes pyperformance: json_loads heffte: c2c - Stock - double - 256 selenium: Kraken - Firefox selenium: WASM collisionDetection - Firefox webp: Quality 100, Highest Compression simdjson: LargeRand webp: Quality 100 webp: Quality 100, Lossless, Highest Compression pyperformance: regex_compile selenium: Jetstream 2 - Firefox openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU node-web-tooling: webp: Quality 100, Lossless heffte: r2c - FFTW - double-long - 512 openvino: Age Gender Recognition Retail 0013 FP16 - CPU heffte: r2c - FFTW - double - 512 simdjson: Kostya heffte: r2c - Stock - float-long - 512 heffte: c2c - Stock - float - 256 heffte: r2c - Stock - float - 512 selenium: Octane - Firefox openvino: Weld Porosity Detection FP16-INT8 - CPU npb: MG.C selenium: Jetstream 2 - Google Chrome selenium: Kraken - Google Chrome heffte: r2c - Stock - double-long - 256 heffte: r2c - Stock - double - 256 openvino: Machine Translation EN To DE FP16 - CPU openvino: Face Detection Retail FP16-INT8 - CPU heffte: c2c - Stock - float-long - 256 openvino: Face Detection FP16-INT8 - CPU duckdb: TPC-H Parquet openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Detection FP16 - CPU gimp: rotate selenium: Speedometer - Google Chrome heffte: r2c - FFTW - float-long - 512 heffte: r2c - FFTW - float - 512 selenium: Speedometer - Firefox heffte: c2c - FFTW - double-long - 128 heffte: c2c - FFTW - double - 128 openvino: Person Detection FP32 - CPU heffte: r2c - Stock - double-long - 512 heffte: r2c - Stock - double - 512 gimp: resize heffte: r2c - FFTW - float-long - 256 openvino: Weld Porosity Detection FP16 - CPU pyperformance: python_startup heffte: c2c - Stock - float-long - 512 easywave: e2Asean Grid + BengkuluSept2007 Source - 1200 heffte: r2c - FFTW - float - 256 heffte: c2c - Stock - float - 512 cloverleaf: clover_bm16 gimp: auto-levels heffte: c2c - Stock - double - 512 selenium: PSPDFKit WASM - Google Chrome cloverleaf: clover_bm64_short gimp: unsharp-mask heffte: c2c - Stock - double-long - 512 selenium: PSPDFKit WASM - Firefox qmcpack: O_ae_pyscf_UHF heffte: r2c - Stock - float - 256 qmcpack: simple-H2O onednn: IP Shapes 3D - f32 - CPU heffte: r2c - Stock - float-long - 256 qmcpack: FeCO6_b3lyp_gms qmcpack: LiH_ae_MSD gromacs: MPI CPU - water_GMX50_bare build-gem5: Time To Compile onednn: IP Shapes 1D - bf16bf16bf16 - CPU a b c d e f 31.8111 50.9336 31.2635 31.2292 51.881 9035.2 21370 79320 131.935 66.6684 31.5989 417.24 2345.04 41360 5436.88 61070 185.62 8564.28 68.8833 55799.7 127.305 44.2936 43.71 1893.63 58.98 14.2837 18.1985 14.658 4818.8 16.1534 16.2749 31.3044 31.2736 18.239 35.2152 15.8331 35.2816 28.5862 28.6186 15.7616 31.7325 31.6129 49.7591 29.6502 49.4547 29.6848 29.3894 29.778 58.4188 58.6489 18.1147 18.1124 27.2229 27.1619 55.5018 27.985 55.1029 27.9897 1094.128868103 15.0777 127.74 15.1102 57.5957 57.9832 32.6945 15.894 52.499 1.054 31.9321 31.6351 947197 1773.71 53.1674 9845.56 26232 8.637 22770 85030 104.025 71.0106 32.6174 216.969 8.679 442.28 2508.93 43650 949091 5549.86 46079885 10.118 49.142967323 62870 62.094293176 0.266 92168 36403 195.41 51.618068965 8774.33 70.4153 21234053100 1134000 1011.66 1705.1 4.194 55963.1 81.1968 1705.51 3.45 7.592 91408403390 10.6857 106224257980 6922574030 12.906 322.654 45526000 32575 3319.64 2.22 43683166890 2.689 31517307700 31957 19.033 3320.34 18743 135.10279615 31152 435.68 0.43 23.1049 37384 2.54 165.27 35845.04 231709.8 9.41 47742886 135692.91 131.529 3577.3 126.074892513 78.97 607.54 255 778.31 2.98963 13.07 23.91 134.246 127899.06 2.88 96.02 44.8791 1.0807 44.0648 109.94 29.3 34501.92 127009 65.562 44.66 4.6 2.364 11202.1 35.3 12.75 228820000 24.14 383.74 12.46 230.157 7338 2040.33 62478 38.73 8523 72426 61145 357.104 439.462 2156 1848 7176 3.6958 120716 4.88 118595 503.59 27.74 387 239054 139891 277357 22.73 2125.52 234540 1830 38.75 228.35 3.60869 285.85 0.86311 54.14 71.18 8025.9 7.37745 7.32159 88.07 224.43546 55.82 41736 23.5 12503.2 6.908 5.03922 29.825 5.39 6.04 18953.53 1361.39 269.31 232.97 111.84 44.202 40.71 16564.09 32.39 224.92 4464270.25 151.75 146.571 139.99 42553 4.515 32245.66 149.161 1.833 25.7 58.458 4058394.64 33.876155 12080 13.443 7676.5 169.646 5.89 139440000 6.539 97769 86.571 1730.89 15.95 8.28855 230.3022 55.996 2166.8213 19.33 14.4836 427 18.3225 14.8193 56.41 5016.1 16.4718 204060000 16.4798 40.086 66.392 93.49 13.272 65.896 53.871 31.8188 31.8292 18.2924 61.85 309.56 35.2893 16.3 106.23 16.0466 35.3807 1491682 0.49 59.37 71.08 29.014 7.29 28.9898 37.5 10.2 15.9022 520.2 242.3 4.74 1.77 15.57 0.87 71.6 229.492 23.47 9.84 24.04 2.20 32.0107 1.27 31.9085 4.8 50.8129 30.3369 50.6268 55056 11.74 24092.42 382.668 349 30.042 29.522 134.23 7.52 29.9091 469.85 89.754 11.88 154.78 9.199 486 59.1962 59.3627 406 18.5277 18.4321 154.5 27.7294 27.527 12.625 55.9418 41.65 4.85 28.418 134.185 55.526 28.337 1093.01 9.615 15.2166 2202 127.94 11.12 15.2009 2108 262.57 58.425 32.159 4.36881 58.1474 266.24 169.05 22.1694 15.834 51.9406 0.91 31.74 32.3869 833675 1775.67 52.2532 9996.74 31050 7.127 22780 86620 131.251 69.3558 32.4099 217.632 8.705 450.42 2494.87 43630 1098384 5604.27 41998826 10.121 51.562965731 62850 63.899206362 0.317 92137 36403 194.68 51.749754354 8765.28 71.5008 21897801590 1131000 1028.14 1707.37 4.389 57317.5 81.2623 1705.89 3.45 7.778 94488932900 10.6929 106192145890 7042483840 12.907 322.668 46015000 32538 3319.3 2.27 43658009260 2.6 30264414210 32007 19.1475 3321.37 18885 140.585720491 31110 438.56 0.43 23.1978 37367 2.55 165.49 34481.92 231842.8 9.43 47676136 132783.22 129.977 3578.6 126.108597881 79.19 610.86 255.73 767.219 2.99292 13.09 23.93 131.511 127678.15 2.87 95.46 44.6026 1.0804 44.6157 110.19 29.34 34573.71 127943 64.959 44.39 4.4 2.423 11160.8 35.09 12.75 228970000 24.1 383.97 12.54 234.319 7320 2040.66 62329 38.98 8512 72092 61161 355.827 435.624 2150 1840 7195 3.79562 120664 4.91 118639 502.05 27.74 386 238496 139809 277524 25.09 2170.03 234968 1816 39 234.62 3.65807 284.18 0.86116 54.29 71.18 7422.74 7.3669 7.32078 88.36 223.87156 56.10 41656 23.705 12456.32 6.983 5.04566 29.202 6.51 6.04 18737.09 1359.67 266.96 193.39 112.34 43.999 40.69 16408.43 32.33 224.74 4305589.62 150.19 148.546 140.06 42185 4.581 32241.73 147.091 1.825 26.74 58.184 4149899.46 32.902566 12101.42 13.518 7660.4 169.835 5.81 140020000 6.55 105856 86.555 1733.4 15.95 8.26023 234.39558 58.024 2166.8275 19.09 14.3837 446 18.275 14.4448 56.71 5014.7 16.3572 204100000 16.4311 40.301 66.076 96 13.276 63.909 54.926 31.5524 31.6357 18.2462 63.14 313.32 35.4345 16.1 105.65 15.942 35.1791 1479761 0.49 59.91 71.15 28.8122 7.31 28.7765 37.4 10.1 15.7502 520.4 242.5 4.74 1.77 15.53 0.88 71.4 227.935 23.41 9.79 23.75 2.19 31.8965 1.28 31.9077 4.81 50.9069 29.8126 50.8481 55067 11.76 26540.76 388.309 343.2 29.8789 29.8734 135 7.37 29.8559 470 90.453 11.91 153.73 9.447 471 59.1701 59.223 407 18.3191 18.5488 153.66 27.5481 27.6161 12.394 55.4126 41.63 4.84 28.2076 134.633 55.6015 28.4031 1091.47 9.748 15.2168 2170 128.11 11.352 15.1774 2096 264.99 57.9937 32.718 4.39625 58.467 266.65 168.7 52.5398 7.224 99.7527 0.734 59.735 60.0213 1299800 3342.51 95.578 16610 47385 4.774 38520 141280 183.452 117.845 53.6889 373.696 14.948 714.42 4039.03 70960 1361760 9210.49 71722427 5.935 30.329244698 102200 37.654774238 0.209 142133 60805 310.55 30.83045146 14360 115.198 35393577110 1862000 1564.5 1145.28 2.697 88060 130.3598 1062.21 5.48 12.02 145829960090 16.9563 168593428210 10970736110 20.411 510.266 71981000 20919 2113.83 3.44 68204421390 4.064 47030287560 20752 29.5036 2150.83 27940 91.860925305 47260 285.26 0.65 35.4539 24365 3.82 108.37 52753.11 353220.2 14.25 71318180 196302.45 86.77 5418.1 84.113422384 53.09 886.66 369.15 519.599 2.0115 19.38 35.36 186.551 184950.78 4.15 66.07 64.2029 1.5573 63.2219 75.4 41.03 50512.7 183859 44.92 62.26 6.4 1.694 16204 50.11 18.22 329800000 34.8 550.3 17.96 163.491 5095 2726.85 43450 53.71 6008 50740 43729 251.154 308.209 1547 1304 5114 5.25193 85258 6.93 83823 662.91 38.59 547 169828 99301 196772 18.59 2989.78 167519 1294 54.06 166.94 5.09456 208.7 0.61454 75.87 98.86 10390.33 10.1686 10.1188 122.15 162.53309 42.55 55901 18.249 16901.1 5.097 6.9477 21.8 7.37 8.25 24740.05 1854.34 353.52 211.66 137.97 33.2 54.02 22004.23 42.25 295.35 5693052.64 114.72 194.348 185.07 54797 3.852 41956.06 193.281 1.415 32.51 75.364 5061427.55 26.294883 15263.55 10.784 9559.6 207.158 5.1 168030000 7.829 117684 71.369 1448.07 14.1 9.91323 203.49517 48.845 1816.8935 22.7 16.9435 377 15.8089 16.751 66.49 5665 14.1676 223360000 14.1336 34.446 57.215 107.4 15.438 56.736 48.076 27.7795 27.8481 15.9782 71.16 351.84 30.8378 14.2 120.21 14.0877 31.0588 1692102 0.56 67.85 81.2 25.8193 8.29 25.7995 33 9.18 14.123 487.8 216 5.36 1.99 17.54 0.97 63.6 251.968 21.63 9.01 26.59 2.45 28.6645 1.41 28.806 5.3 54.9549 27.8339 54.5762 60754 12.93 24498.2 419.011 319.4 27.6358 27.6338 128.32 8.02 28.0061 436.86 83.672 12.05 148.7 8.806 498 55.8788 55.6705 433 19.2383 19.2256 147.8 28.4441 28.478 12.516 53.7468 43.58 4.65 28.9081 129.701 53.6358 28.9398 1058.26 9.459 15 2139 124.58 11.042 14.786 2059 258.4 58.9926 31.973 4.30701 58.982 265.33 168.69 1.143 221.237 56.6746 7.176 98.9954 0.615 59.0159 40.5778 1591084 2893.39 97.4361 16950 47590 4.833 38020 140960 184.512 114.123 52.8988 375.175 15.007 710.53 4048.08 70910 1627171 9300.91 64256226 6.101 30.650349799 103680 38.235807506 0.187 155668 61351 311.73 31.651786074 14310 114.341 35045220680 1862000 1569.98 1064.7 2.705 89344 130.2225 1083.23 5.45 12.189 144868800380 16.9459 167529875870 10887827050 20.335 508.378 65609000 20740 2125.71 3.46 67647607600 4.062 47048102160 20746 29.5741 2145.16 29012 91.894740975 47942 291.63 0.66 35.3254 24526 3.89 109.41 52719.71 345908.2 14.29 71944056 200059.86 86.948 5294.3 83.591421969 52.94 894.24 371.16 527.01 2.01217 19.43 35.1 183.834 180058.09 4.17 65.66 64.9886 1.5633 64.0485 75.33 42.91 50229.93 171918 45.816 63.33 6.3 1.671 16088.6 50.81 18.46 331120000 34.83 553.27 17.74 162.567 5127 2605.6 44009 54.88 5932 50458 42719 249.43 307.199 1524 1294 5081 5.27642 84586 6.96 83524 675.24 38.46 545 168775 99388 196983 18.3 3010.6 166105 1305 54.78 170.46 5.03492 203.73 0.61420 73.99 99.72 8593.82 10.2038 10.1229 120.84 162.41459 42.83 56335 17.939 17036.01 5.063 6.92496 21.846 7.37 8.25 25591.23 1855.23 357.15 204.81 150.27 32.79 54.71 21129.21 42.88 298.8 5730258.98 114.07 190.158 178.47 55746 3.472 42386.78 190.914 1.434 33.23 75.647 5241319.8 27.068099 15270.26 10.864 9512.1 209.444 5.08 171400000 7.946 102783 1447.91 16.93 9.88187 204.95889 48.465 1813.292 22.63 16.8437 378 15.8476 16.6687 66.06 5672.2 14.0426 239170000 14.0817 35.193 57.464 107.34 15.297 56.807 47.475 27.5697 27.5778 15.9518 69.03 352.96 30.9894 14.3 120.99 14.0495 30.9595 1685554 0.55 67.16 80.25 25.5679 8.29 25.585 33.6 9.23 14.0404 552.8 214.3 5.28 1.99 17.54 0.97 67 253.624 21.51 8.93 26.44 2.44 28.7669 1.4 28.7749 5.3 54.0622 27.5452 53.8771 60711 12.93 25162.58 421.021 318.5 27.5275 27.4046 126.22 7.96 27.5381 432.96 11.83 145.64 8.854 502 55.5756 55.6246 433 18.9154 18.7925 145.89 28.6623 28.5518 12.111 53.4593 43.34 4.68 29.1118 129.448 53.7069 28.9079 1063.27 9.526 14.9001 2140 124.71 11.12 14.7872 2051 258.48 58.4249 32.185 4.3061 58.05 264.55 168.37 1.148 221.121 53.0212 7.226 96.4633 0.623 60.5548 59.2394 1599780 3340.25 87.4539 16590 47203 4.785 38040 138510 179.454 115.053 54.7998 375.582 15.023 722 3946.27 71070 1604627 9248.07 71115165 6.49 30.471851303 102900 37.994095068 0.217 138934 61337 309.14 32.373449454 14200 115.093 35479543830 1888000 1668.97 1044.98 2.771 89756.1 130.417 1081.46 5.54 11.407 145929235660 16.9808 167740437850 10972703730 20.389 509.73 71931000 21146 2141.47 3.48 68291402680 4.067 47059792960 20598 29.1886 2188.89 28970 91.176640881 47587 290.16 0.66 35.2707 24512 3.85 108.16 52390.75 354387.3 14.38 72527312 201855.19 86.56 5367.5 83.64314069 52.67 910.84 382.3 526.015 2.03456 19.4 35.42 187.72 188170.4 4.22 65.34 63.746 1.5848 62.9035 75.24 42.68 49366.17 185607 45.723 64.57 6.4 1.667 15832.3 50.69 18.26 329870000 34.84 553.49 14.89 162.803 5175 2559.46 43624 55.69 6037 50705 42916 249.686 307.934 1508 1318 5039 5.05305 84845 6.91 83238 715.44 39.46 549 168378 98534 195691 17.7 3005.33 166313 1296 53.9 166.08 4.94223 202.62 0.61266 72.63 98.68 7707.48 10.2334 10.1689 120.89 161.88961 42.77 57567 19.694 17187.3 5.066 6.94737 21.985 7.36 8.24 25290.9 1852.66 363.79 171.17 152.13 33.165 53.99 21905.42 43.18 299.8 5727247.49 115.59 194.228 170.99 50776 3.854 42343.47 191.65 1.405 33.47 75.057 5171670.55 27.222561 15177.55 11.108 9501 204.454 4.78 169900000 7.834 118632 1431.86 14.04 9.84149 195.54508 49.009 1815.051 22.45 16.8575 376 15.5292 17.032 66.42 5666.3 14.0375 239250000 14.144 35.075 56.866 108.94 15.439 56.766 48.276 27.549 27.654 15.8822 70.56 355.77 31.0469 14.3 120.32 14.015 30.9269 1686956 0.55 65.93 79.99 25.449 8.31 25.4696 33 8.98 14.0224 490.7 214.5 5.36 2 17.53 0.98 64.8 256.136 20.89 8.77 26.63 2.45 28.762 1.39 28.7644 5.31 54.6591 27.4897 54.2616 60209 12.95 24381.01 419.3 317.3 27.3438 27.3379 123.75 7.98 27.5919 435.36 11.17 143.52 8.828 504 55.3711 55.782 430 18.8817 19.1865 148.29 27.776 28.2529 12.695 53.406 43.32 4.66 28.4467 131.703 53.5429 28.5277 1062.13 9.526 14.7708 2139 124.75 11.167 15.0325 2055 259.37 58.2473 32.225 4.31754 58.5898 263.94 168.31 1.148 220.533 OpenBenchmarking.org
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 e f d b a c 13 26 39 52 65 56.67 53.02 52.54 32.69 31.81 22.17 1. (CXX) g++ options: -O3
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster e d f c b 4 8 12 16 20 7.176 7.224 7.226 15.834 15.894 1. (CXX) g++ options: -O2 -lOpenCL
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 d e f b c a 20 40 60 80 100 99.75 99.00 96.46 52.50 51.94 50.93 1. (CXX) g++ options: -O3
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency e f d c b 0.27 0.54 0.81 1.08 1.35 0.615 0.623 0.734 0.910 1.054 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 f d e b c a 14 28 42 56 70 60.55 59.74 59.02 31.93 31.74 31.26 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 d f e c b a 13 26 39 52 65 60.02 59.24 40.58 32.39 31.64 31.23 1. (CXX) g++ options: -O3
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only f e d b c 300K 600K 900K 1200K 1500K 1599780 1591084 1299800 947197 833675 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C d f e c b 700 1400 2100 2800 3500 3342.51 3340.25 2893.39 1775.67 1773.71 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 e d f b c a 20 40 60 80 100 97.44 95.58 87.45 53.17 52.25 51.88 1. (CXX) g++ options: -O3
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: LBC, LBRY Credits e d f c b a 4K 8K 12K 16K 20K 16950.00 16610.00 16590.00 9996.74 9845.56 9035.20 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish e d f c b 10K 20K 30K 40K 50K 47590 47385 47203 31050 26232 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 d f e c b 2 4 6 8 10 4.774 4.785 4.833 7.127 8.637 1. (CXX) g++ options: -O3 -fopenmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Skeincoin d f e c b a 8K 16K 24K 32K 40K 38520 38040 38020 22780 22770 21370 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Blake-2 S d e f c b a 30K 60K 90K 120K 150K 141280 140960 138510 86620 85030 79320 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 e d f a c b 40 80 120 160 200 184.51 183.45 179.45 131.94 131.25 104.03 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 d f e b c a 30 60 90 120 150 117.85 115.05 114.12 71.01 69.36 66.67 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 f d e b c a 12 24 36 48 60 54.80 53.69 52.90 32.62 32.41 31.60 1. (CXX) g++ options: -O3
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 f e d c b 80 160 240 320 400 375.58 375.18 373.70 217.63 216.97 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 f e d c b 4 8 12 16 20 15.023 15.007 14.948 8.705 8.679 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Magi f d e c b a 160 320 480 640 800 722.00 714.42 710.53 450.42 442.28 417.24 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Ringcoin e d f b c a 900 1800 2700 3600 4500 4048.08 4039.03 3946.27 2508.93 2494.87 2345.04 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Quad SHA-256, Pyrite f d e b c a 15K 30K 45K 60K 75K 71070 70960 70910 43650 43630 41360 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency e f d c b 300K 600K 900K 1200K 1500K 1627171 1604627 1361760 1098384 949091 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Deepcoin e f d c b a 2K 4K 6K 8K 10K 9300.91 9248.07 9210.49 5604.27 5549.86 5436.88 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time d f e b c 15M 30M 45M 60M 75M 71722427 71115165 64256226 46079885 41998826 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver d e f b c 3 6 9 12 15 5.935 6.101 6.490 10.118 10.121 1. (CXX) g++ options: -O2 -lOpenCL
SPECFEM3D Model: Tomographic Model OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Tomographic Model d f e b c 12 24 36 48 60 30.33 30.47 30.65 49.14 51.56 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Triple SHA-256, Onecoin e f d b c a 20K 40K 60K 80K 100K 103680 102900 102200 62870 62850 61070 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
SPECFEM3D Model: Homogeneous Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Homogeneous Halfspace d f e b c 14 28 42 56 70 37.65 37.99 38.24 62.09 63.90 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Darktable Test: Server Rack - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Server Rack - Acceleration: CPU-only e d f b c 0.0713 0.1426 0.2139 0.2852 0.3565 0.187 0.209 0.217 0.266 0.317
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating e d f b c 30K 60K 90K 120K 150K 155668 142133 138934 92168 92137 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
John The Ripper Test: WPA PSK OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK e f d c b 13K 26K 39K 52K 65K 61351 61337 60805 36403 36403 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
Cpuminer-Opt Algorithm: scrypt OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: scrypt e d f b c a 70 140 210 280 350 311.73 310.55 309.14 195.41 194.68 185.62 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
SPECFEM3D Model: Mount St. Helens OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Mount St. Helens d e f b c 12 24 36 48 60 30.83 31.65 32.37 51.62 51.75 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Myriad-Groestl d e f b c a 3K 6K 9K 12K 15K 14360.00 14310.00 14200.00 8774.33 8765.28 8564.28 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 d f e c b a 30 60 90 120 150 115.20 115.09 114.34 71.50 70.42 68.88 1. (CXX) g++ options: -O3
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 f d e c b 8000M 16000M 24000M 32000M 40000M 35479543830 35393577110 35045220680 21897801590 21234053100 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 f e d b c 400K 800K 1200K 1600K 2000K 1888000 1862000 1862000 1134000 1131000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU f e d c b 400 800 1200 1600 2000 1668.97 1569.98 1564.50 1028.14 1011.66 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU f e d b c 400 800 1200 1600 2000 1044.98 1064.70 1145.28 1705.10 1707.37 MIN: 983.94 MIN: 979.41 MIN: 996.9 MIN: 1702.22 MIN: 1704.07 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
Darktable Test: Boat - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Boat - Acceleration: CPU-only d e f b c 0.9875 1.975 2.9625 3.95 4.9375 2.697 2.705 2.771 4.194 4.389
QuantLib Configuration: Multi-Threaded OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Multi-Threaded f e d c b a 20K 40K 60K 80K 100K 89756.1 89344.0 88060.0 57317.5 55963.1 55799.7 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
ASTC Encoder Preset: Medium OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Medium f d e c b 30 60 90 120 150 130.42 130.36 130.22 81.26 81.20 1. (CXX) g++ options: -O3 -flto -pthread
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU d f e b c 400 800 1200 1600 2000 1062.21 1081.46 1083.23 1705.51 1705.89 MIN: 982.53 MIN: 999.13 MIN: 981.64 MIN: 1702.42 MIN: 1703.31 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU f d e c b 1.2465 2.493 3.7395 4.986 6.2325 5.54 5.48 5.45 3.45 3.45 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
IndigoBench Acceleration: CPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar e d f c b 3 6 9 12 15 12.189 12.020 11.407 7.778 7.592
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM f d e c b 30000M 60000M 90000M 120000M 150000M 145929235660 145829960090 144868800380 94488932900 91408403390 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
ASTC Encoder Preset: Thorough OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Thorough f d e c b 4 8 12 16 20 16.98 16.96 16.95 10.69 10.69 1. (CXX) g++ options: -O3 -flto -pthread
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM d f e b c 40000M 80000M 120000M 160000M 200000M 168593428210 167740437850 167529875870 106224257980 106192145890 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 f d e c b 2000M 4000M 6000M 8000M 10000M 10972703730 10970736110 10887827050 7042483840 6922574030 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 d f e c b 5 10 15 20 25 20.41 20.39 20.34 12.91 12.91 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 d f e c b 110 220 330 440 550 510.27 509.73 508.38 322.67 322.65 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
John The Ripper Test: HMAC-SHA512 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 d f e c b 15M 30M 45M 60M 75M 71981000 71931000 65609000 46015000 45526000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU e d f c b 7K 14K 21K 28K 35K 20740 20919 21146 32538 32575
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU d e f c b 700 1400 2100 2800 3500 2113.83 2125.71 2141.47 3319.30 3319.64 MIN: 1927.97 MIN: 1925.61 MIN: 1947.18 MIN: 3315.49 MIN: 3316.21 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
LuxCoreRender Scene: Danish Mood - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU f e d c b 0.783 1.566 2.349 3.132 3.915 3.48 3.46 3.44 2.27 2.22 MIN: 1.71 / MAX: 3.89 MIN: 1.38 / MAX: 3.89 MIN: 1.57 / MAX: 3.87 MIN: 0.86 / MAX: 2.63 MIN: 0.78 / MAX: 2.61
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 f d e b c 15000M 30000M 45000M 60000M 75000M 68291402680 68204421390 67647607600 43683166890 43658009260 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
IndigoBench Acceleration: CPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Bedroom f d e b c 0.9151 1.8302 2.7453 3.6604 4.5755 4.067 4.064 4.062 2.689 2.600
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 f e d b c 10000M 20000M 30000M 40000M 50000M 47059792960 47048102160 47030287560 31517307700 30264414210 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU f e d b c 7K 14K 21K 28K 35K 20598 20746 20752 31957 32007
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown e d f c b 7 14 21 28 35 29.57 29.50 29.19 19.15 19.03 MIN: 29.06 / MAX: 32.27 MIN: 28.9 / MAX: 32.01 MIN: 28.59 / MAX: 31.95 MIN: 18.85 / MAX: 19.5 MIN: 18.79 / MAX: 19.41
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU e d f b c 700 1400 2100 2800 3500 2145.16 2150.83 2188.89 3320.34 3321.37 MIN: 1934.78 MIN: 1926.06 MIN: 1932.77 MIN: 3316.93 MIN: 3317.95 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5.02 Mode: CPU e f d c b 6K 12K 18K 24K 30K 29012 28970 27940 18885 18743
SPECFEM3D Model: Layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Layered Halfspace f d e b c 30 60 90 120 150 91.18 91.86 91.89 135.10 140.59 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
John The Ripper Test: bcrypt OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: bcrypt e f d b c 10K 20K 30K 40K 50K 47942 47587 47260 31152 31110 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container d f e b c 100 200 300 400 500 285.26 290.16 291.63 435.68 438.56
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.1 Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only f e d c b 0.1485 0.297 0.4455 0.594 0.7425 0.66 0.66 0.65 0.43 0.43
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon d e f c b 8 16 24 32 40 35.45 35.33 35.27 23.20 23.10 MIN: 35.02 / MAX: 37.92 MIN: 34.92 / MAX: 37.85 MIN: 34.8 / MAX: 37.69 MIN: 23.04 / MAX: 23.45 MIN: 22.96 / MAX: 23.41
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU d f e c b 8K 16K 24K 32K 40K 24365 24512 24526 37367 37384
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU e f d c b 0.8753 1.7506 2.6259 3.5012 4.3765 3.89 3.85 3.82 2.55 2.54 MIN: 1.74 / MAX: 4.37 MIN: 1.53 / MAX: 4.35 MIN: 1.47 / MAX: 4.34 MIN: 0.93 / MAX: 2.98 MIN: 0.92 / MAX: 2.97
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation f d e b c 40 80 120 160 200 108.16 108.37 109.41 165.27 165.49
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C d e f b c 11K 22K 33K 44K 55K 52753.11 52719.71 52390.75 35845.04 34481.92 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 f d e c b 80K 160K 240K 320K 400K 354387.3 353220.2 345908.2 231842.8 231709.8 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
uvg266 Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Medium f e d c b 4 8 12 16 20 14.38 14.29 14.25 9.43 9.41
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth f e d b c 16M 32M 48M 64M 80M 72527312 71944056 71318180 47742886 47676136
nginx Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 f e d b c 40K 80K 120K 160K 200K 201855.19 200059.86 196302.45 135692.91 132783.22 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD f d e c b 30 60 90 120 150 86.56 86.77 86.95 129.98 131.53 1. (CXX) g++ options: -O2 -lOpenCL
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 d f e c b 1200 2400 3600 4800 6000 5418.1 5367.5 5294.3 3578.6 3577.3 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
SPECFEM3D Model: Water-layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Water-layered Halfspace e f d b c 30 60 90 120 150 83.59 83.64 84.11 126.07 126.11 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only f e d b c 20 40 60 80 100 52.67 52.94 53.09 78.97 79.19
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU f e d c b 200 400 600 800 1000 910.84 894.24 886.66 610.86 607.54 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU f e d c b 80 160 240 320 400 382.30 371.16 369.15 255.73 255.00 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig d f e c b 200 400 600 800 1000 519.60 526.02 527.01 767.22 778.31
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU d e f b c 0.6734 1.3468 2.0202 2.6936 3.367 2.01150 2.01217 2.03456 2.98963 2.99292 MIN: 1.64 MIN: 1.64 MIN: 1.64 MIN: 2.87 MIN: 2.87 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium e f d c b 5 10 15 20 25 19.43 19.40 19.38 13.09 13.07 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
TensorFlow Device: CPU - Batch Size: 16 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 16 - Model: ResNet-50 f d e c b 8 16 24 32 40 35.42 35.36 35.10 23.93 23.91
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 f d e b c a 40 80 120 160 200 187.72 186.55 183.83 134.25 131.51 127.31 1. (CXX) g++ options: -O3
nginx Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 f d e b c 40K 80K 120K 160K 200K 188170.40 184950.78 180058.09 127899.06 127678.15 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
LuxCoreRender Scene: DLSC - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU f e d b c 0.9495 1.899 2.8485 3.798 4.7475 4.22 4.17 4.15 2.88 2.87 MIN: 4.08 / MAX: 4.91 MIN: 4 / MAX: 4.47 MIN: 4.05 / MAX: 4.44 MIN: 2.81 / MAX: 3 MIN: 2.8 / MAX: 2.98
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test f e d c b 20 40 60 80 100 65.34 65.66 66.07 95.46 96.02
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 e d f b c a 15 30 45 60 75 64.99 64.20 63.75 44.88 44.60 44.29 1. (CXX) g++ options: -O3
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Exhaustive f e d b c 0.3566 0.7132 1.0698 1.4264 1.783 1.5848 1.5633 1.5573 1.0807 1.0804 1. (CXX) g++ options: -O3 -flto -pthread
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 e d f c b a 14 28 42 56 70 64.05 63.22 62.90 44.62 44.06 43.71 1. (CXX) g++ options: -O3
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only f e d b c 20 40 60 80 100 75.24 75.33 75.40 109.94 110.19
uvg266 Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Super Fast e f d c b 10 20 30 40 50 42.91 42.68 41.03 29.34 29.30
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C d e f c b 11K 22K 33K 44K 55K 50512.70 50229.93 49366.17 34573.71 34501.92 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating f d e c b 40K 80K 120K 160K 200K 185607 183859 171918 127943 127009 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig d f e c b 15 30 45 60 75 44.92 45.72 45.82 64.96 65.56
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU f e d b c 14 28 42 56 70 64.57 63.33 62.26 44.66 44.39 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Natron Input: Spaceship OpenBenchmarking.org FPS, More Is Better Natron 2.4.3 Input: Spaceship f d e b c 2 4 6 8 10 6.4 6.4 6.3 4.6 4.4
Darktable Test: Masskrug - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Masskrug - Acceleration: CPU-only f e d b c 0.5452 1.0904 1.6356 2.1808 2.726 1.667 1.671 1.694 2.364 2.423
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.18.1 Variant: Wownero - Hash Count: 1M d e f b c 3K 6K 9K 12K 15K 16204.0 16088.6 15832.3 11202.1 11160.8 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
uvg266 Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast e f d b c 11 22 33 44 55 50.81 50.69 50.11 35.30 35.09
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU e f d c b 5 10 15 20 25 18.46 18.26 18.22 12.75 12.75 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 512 e f d c b 70M 140M 210M 280M 350M 331120000 329870000 329800000 228970000 228820000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
TensorFlow Device: CPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 f e d b c 8 16 24 32 40 34.84 34.83 34.80 24.14 24.10
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU f e d c b 120 240 360 480 600 553.49 553.27 550.30 383.97 383.74 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU d e f c b 4 8 12 16 20 17.96 17.74 14.89 12.54 12.46 MIN: 17.28 / MAX: 18.69 MIN: 16.79 / MAX: 18.55 MIN: 12.73 / MAX: 16.75 MIN: 12.43 / MAX: 12.6 MIN: 12.32 / MAX: 12.57
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile e f d b c 50 100 150 200 250 162.57 162.80 163.49 230.16 234.32
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU d e f c b 1600 3200 4800 6400 8000 5095 5127 5175 7320 7338
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Garlicoin d e f c b a 600 1200 1800 2400 3000 2726.85 2605.60 2559.46 2040.66 2040.33 1893.63 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU d f e c b 13K 26K 39K 52K 65K 43450 43624 44009 62329 62478
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU f e d c b 13 26 39 52 65 55.69 54.88 53.71 38.98 38.73 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU e d f c b 2K 4K 6K 8K 10K 5932 6008 6037 8512 8523
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU e f d c b 16K 32K 48K 64K 80K 50458 50705 50740 72092 72426
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU e f d b c 13K 26K 39K 52K 65K 42719 42916 43729 61145 61161
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile e f d c b 80 160 240 320 400 249.43 249.69 251.15 355.83 357.10
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja e f d c b 100 200 300 400 500 307.20 307.93 308.21 435.62 439.46
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU f e d c b 500 1000 1500 2000 2500 1508 1524 1547 2150 2156
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU e d f c b 400 800 1200 1600 2000 1294 1304 1318 1840 1848
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU f e d b c 1500 3000 4500 6000 7500 5039 5081 5114 7176 7195
OSPRay Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time e d f c b 1.1872 2.3744 3.5616 4.7488 5.936 5.27642 5.25193 5.05305 3.79562 3.69580
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU e f d c b 30K 60K 90K 120K 150K 84586 84845 85258 120664 120716
LuxCoreRender Scene: Orange Juice - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU e d f c b 2 4 6 8 10 6.96 6.93 6.91 4.91 4.88 MIN: 6.29 / MAX: 7.55 MIN: 6.23 / MAX: 7.51 MIN: 6.28 / MAX: 7.27 MIN: 4.72 / MAX: 4.97 MIN: 4.71 / MAX: 4.93
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU f e d b c 30K 60K 90K 120K 150K 83238 83524 83823 118595 118639
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU f e d b c 150 300 450 600 750 715.44 675.24 662.91 503.59 502.05 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
uvg266 Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Very Fast f d e c b 9 18 27 36 45 39.46 38.59 38.46 27.74 27.74
OpenVKL Benchmark: vklBenchmarkCPU ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 2.0.0 Benchmark: vklBenchmarkCPU ISPC f d e b c 120 240 360 480 600 549 547 545 387 386 MIN: 37 / MAX: 7452 MIN: 37 / MAX: 8225 MIN: 36 / MAX: 7468 MIN: 26 / MAX: 5487 MIN: 26 / MAX: 5478
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU f e d c b 50K 100K 150K 200K 250K 168378 168775 169828 238496 239054
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU f d e c b 30K 60K 90K 120K 150K 98534 99301 99388 139809 139891
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU f d e b c 60K 120K 180K 240K 300K 195691 196772 196983 277357 277524
QMCPACK Input: H4_ae OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: H4_ae f e d b c 6 12 18 24 30 17.70 18.30 18.59 22.73 25.09 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU e f d c b 600 1200 1800 2400 3000 3010.60 3005.33 2989.78 2170.03 2125.52 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU e f d b c 50K 100K 150K 200K 250K 166105 166313 167519 234540 234968
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU d f e c b 400 800 1200 1600 2000 1294 1296 1305 1816 1830
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU e d f c b 12 24 36 48 60 54.78 54.06 53.90 39.00 38.75 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield f d e b c 50 100 150 200 250 166.08 166.94 170.46 228.35 234.62
OSPRay Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time d e f c b 1.1463 2.2926 3.4389 4.5852 5.7315 5.09456 5.03492 4.94223 3.65807 3.60869
QMCPACK Input: Li2_STO_ae OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: Li2_STO_ae f e d c b 60 120 180 240 300 202.62 203.73 208.70 284.18 285.85 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms f e d c b 0.1942 0.3884 0.5826 0.7768 0.971 0.61266 0.61420 0.61454 0.86116 0.86311
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast d e f c b 20 40 60 80 100 75.87 73.99 72.63 54.29 54.14 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
SVT-HEVC Tuning: 7 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 4K e d f c b 20 40 60 80 100 99.72 98.86 98.68 71.18 71.18 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C d e b f c 2K 4K 6K 8K 10K 10390.33 8593.82 8025.90 7707.48 7422.74 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
OSPRay Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time f e d b c 3 6 9 12 15 10.23340 10.20380 10.16860 7.37745 7.36690
OSPRay Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time f e d b c 3 6 9 12 15 10.16890 10.12290 10.11880 7.32159 7.32078
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K d f e c b 30 60 90 120 150 122.15 120.89 120.84 88.36 88.07 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
OpenFOAM Input: drivaerFastback, Small Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time f e d c b 50 100 150 200 250 161.89 162.41 162.53 223.87 224.44 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
CloverLeaf Input: clover_bm OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm d f e b c a 13 26 39 52 65 42.55 42.77 42.83 55.82 56.10 58.98 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write f e d b c 12K 24K 36K 48K 60K 57567 56335 55901 41736 41656 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency f e d b c 6 12 18 24 30 17.37 17.75 17.89 23.50 23.71 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU f e d b c 4K 8K 12K 16K 20K 17187.30 17036.01 16901.10 12503.20 12456.32 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless e f d b c 2 4 6 8 10 5.063 5.066 5.097 6.908 6.983 1. (CXX) g++ options: -O3 -fPIC -lm
OSPRay Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time d f e c b 2 4 6 8 10 6.94770 6.94737 6.92496 5.04566 5.03922
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile d e f c b 7 14 21 28 35 21.80 21.85 21.99 29.20 29.83
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: PartialTweets e d f c b 2 4 6 8 10 7.37 7.37 7.36 6.51 5.39 1. (CXX) g++ options: -O3
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: DistinctUserID e d f c b 2 4 6 8 10 8.25 8.25 8.24 6.04 6.04 1. (CXX) g++ options: -O3
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C e f d b c 5K 10K 15K 20K 25K 25591.23 25290.90 24740.05 18953.53 18737.09 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU e d f b c 400 800 1200 1600 2000 1855.23 1854.34 1852.66 1361.39 1359.67 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU f e d b c 80 160 240 320 400 363.79 357.15 353.52 269.31 266.96 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Selenium Benchmark: WASM collisionDetection - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Google Chrome f c e d b 50 100 150 200 250 171.17 193.39 204.81 211.66 232.97 1. chrome 118.0.5993.70
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K f e d c b 30 60 90 120 150 152.13 150.27 137.97 112.34 111.84 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 e f d c b 10 20 30 40 50 32.79 33.17 33.20 44.00 44.20 1. (CXX) g++ options: -O3 -fPIC -lm
Kvazaar Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast e d f b c 12 24 36 48 60 54.71 54.02 53.99 40.71 40.69 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B d f e b c 5K 10K 15K 20K 25K 22004.23 21905.42 21129.21 16564.09 16408.43 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast f e d b c 10 20 30 40 50 43.18 42.88 42.25 32.39 32.33 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU f e d b c 70 140 210 280 350 299.80 298.80 295.35 224.92 224.74 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Memcached Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 e f d b c 1.2M 2.4M 3.6M 4.8M 6M 5730258.98 5727247.49 5693052.64 4464270.25 4305589.62 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam e d f c b 30 60 90 120 150 114.07 114.72 115.59 150.19 151.75
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 4K d f e c b 40 80 120 160 200 194.35 194.23 190.16 148.55 146.57 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-HEVC Tuning: 10 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 4K d e f c b 40 80 120 160 200 185.07 178.47 170.99 140.06 139.99 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency e d f b c 12K 24K 36K 48K 60K 55746 54797 50776 42553 42185 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 e d f b c 1.0307 2.0614 3.0921 4.1228 5.1535 3.472 3.852 3.854 4.515 4.581 1. (CXX) g++ options: -O3 -fPIC -lm
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU e f d b c 9K 18K 27K 36K 45K 42386.78 42343.47 41956.06 32245.66 32241.73 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 4K d f e b c 40 80 120 160 200 193.28 191.65 190.91 149.16 147.09 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Darktable Test: Server Room - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Server Room - Acceleration: CPU-only f d e c b 0.4124 0.8248 1.2372 1.6496 2.062 1.405 1.415 1.434 1.825 1.833
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K f e d c b 8 16 24 32 40 33.47 33.23 32.51 26.74 25.70 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K e d f b c 20 40 60 80 100 75.65 75.36 75.06 58.46 58.18 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
Memcached Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 e f d c b 1.1M 2.2M 3.3M 4.4M 5.5M 5241319.80 5171670.55 5061427.55 4149899.46 4058394.64 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OpenFOAM Input: drivaerFastback, Small Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time d e f c b 8 16 24 32 40 26.29 27.07 27.22 32.90 33.88 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C e d f c b 3K 6K 9K 12K 15K 15270.26 15263.55 15177.55 12101.42 12080.00 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
Y-Cruncher Pi Digits To Calculate: 500M OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.7.10.9513 Pi Digits To Calculate: 500M d e f b c 3 6 9 12 15 10.78 10.86 11.11 13.44 13.52
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.18.1 Variant: Monero - Hash Count: 1M d e f b c 2K 4K 6K 8K 10K 9559.6 9512.1 9501.0 7676.5 7660.4 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OSPRay Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time e d f c b 50 100 150 200 250 209.44 207.16 204.45 169.84 169.65
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU f e d c b 1.3253 2.6506 3.9759 5.3012 6.6265 4.78 5.08 5.10 5.81 5.89 MIN: 2.77 / MAX: 19.3 MIN: 3.18 / MAX: 17.17 MIN: 2.36 / MAX: 12.91 MIN: 2.72 / MAX: 9.48 MIN: 4.59 / MAX: 9.41 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 512 e f d c b 40M 80M 120M 160M 200M 171400000 169900000 168030000 140020000 139440000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
VVenC Video Input: Bosphorus 4K - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Fast e f d c b 2 4 6 8 10 7.946 7.834 7.829 6.550 6.539 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Selenium Benchmark: Octane - Browser: Google Chrome OpenBenchmarking.org Geometric Mean, More Is Better Selenium Benchmark: Octane - Browser: Google Chrome f d c e b 30K 60K 90K 120K 150K 118632 117684 105856 102783 97769 1. chrome 118.0.5993.70
DuckDB Benchmark: IMDB OpenBenchmarking.org Seconds, Fewer Is Better DuckDB 0.9.1 Benchmark: IMDB d c b 20 40 60 80 100 SE +/- 0.27, N = 3 SE +/- 0.54, N = 3 SE +/- 0.48, N = 3 71.37 86.56 86.57 1. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU f e d b c 400 800 1200 1600 2000 1431.86 1447.91 1448.07 1730.89 1733.40 MIN: 963.6 / MAX: 2030.33 MIN: 1094.97 / MAX: 2000.94 MIN: 1089.83 / MAX: 1961.07 MIN: 1541.53 / MAX: 2245.54 MIN: 1541.89 / MAX: 2247.48 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Selenium Benchmark: WASM imageConvolute - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Google Chrome f d b c e 4 8 12 16 20 14.04 14.10 15.95 15.95 16.93 1. chrome 118.0.5993.70
High Performance Conjugate Gradient X Y Z: 104 104 104 - RT: 60 OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 d e f b c 3 6 9 12 15 9.91323 9.88187 9.84149 8.28855 8.26023 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time f d e b c 50 100 150 200 250 195.55 203.50 204.96 230.30 234.40 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D e d f b c 13 26 39 52 65 48.47 48.85 49.01 56.00 58.02 1. (CXX) g++ options: -O2 -lOpenCL
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time e f d b c 500 1000 1500 2000 2500 1813.29 1815.05 1816.89 2166.82 2166.83 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU c b f e d 5 10 15 20 25 19.09 19.33 22.45 22.63 22.70 MIN: 15 / MAX: 32.89 MIN: 8.18 / MAX: 34.75 MIN: 11.6 / MAX: 50.7 MIN: 11.81 / MAX: 48.47 MIN: 11.49 / MAX: 48.33 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 d f e b c a 4 8 12 16 20 16.94 16.86 16.84 14.48 14.38 14.28 1. (CXX) g++ options: -O3
PyBench Total For Average Test Times OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times f d e b c 100 200 300 400 500 376 377 378 427 446
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 b c a e d f 5 10 15 20 25 18.32 18.28 18.20 15.85 15.81 15.53 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 f d e b a c 4 8 12 16 20 17.03 16.75 16.67 14.82 14.66 14.44 1. (CXX) g++ options: -O3
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU b c e f d 15 30 45 60 75 56.41 56.71 66.06 66.42 66.49 MIN: 41.43 / MAX: 98.24 MIN: 40.93 / MAX: 99.23 MIN: 32.08 / MAX: 125.5 MIN: 39.7 / MAX: 126.38 MIN: 40.75 / MAX: 125.11 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
QuantLib Configuration: Single-Threaded OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Single-Threaded e f d b c a 1200 2400 3600 4800 6000 5672.2 5666.3 5665.0 5016.1 5014.7 4818.8 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 b c a d e f 4 8 12 16 20 16.47 16.36 16.15 14.17 14.04 14.04 1. (CXX) g++ options: -O3
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 512 f e d c b 50M 100M 150M 200M 250M 239250000 239170000 223360000 204100000 204060000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 b c a f d e 4 8 12 16 20 16.48 16.43 16.27 14.14 14.13 14.08 1. (CXX) g++ options: -O3
RawTherapee Total Benchmark Time OpenBenchmarking.org Seconds, Fewer Is Better RawTherapee Total Benchmark Time d f e b c 9 18 27 36 45 34.45 35.08 35.19 40.09 40.30 1. RawTherapee, version 5.9, command line.
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 f d e c b 15 30 45 60 75 56.87 57.22 57.46 66.08 66.39 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 4K f d e c b 20 40 60 80 100 108.94 107.40 107.34 96.00 93.49 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
VVenC Video Input: Bosphorus 4K - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Faster f d e c b 4 8 12 16 20 15.44 15.44 15.30 13.28 13.27 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 d f e c b 15 30 45 60 75 56.74 56.77 56.81 63.91 65.90 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte e d f b c 12 24 36 48 60 47.48 48.08 48.28 53.87 54.93 1. (CXX) g++ options: -O2 -lOpenCL
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 b c a d e f 7 14 21 28 35 31.82 31.55 31.30 27.78 27.57 27.55 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 b c a d f e 7 14 21 28 35 31.83 31.64 31.27 27.85 27.65 27.58 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 b c a d e f 5 10 15 20 25 18.29 18.25 18.24 15.98 15.95 15.88 1. (CXX) g++ options: -O3
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark d f e c b 16 32 48 64 80 71.16 70.56 69.03 63.14 61.85 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU f e d c b 80 160 240 320 400 355.77 352.96 351.84 313.32 309.56 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 c b a f e d 8 16 24 32 40 35.43 35.29 35.22 31.05 30.99 30.84 1. (CXX) g++ options: -O3
Selenium Benchmark: WASM imageConvolute - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Firefox d e f c b 4 8 12 16 20 14.2 14.3 14.3 16.1 16.3 1. firefox 118.0.2
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU e f d b c 30 60 90 120 150 120.99 120.32 120.21 106.23 105.65 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 b c a d e f 4 8 12 16 20 16.05 15.94 15.83 14.09 14.05 14.02 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 b a c d e f 8 16 24 32 40 35.38 35.28 35.18 31.06 30.96 30.93 1. (CXX) g++ options: -O3
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite d f e b c 400K 800K 1200K 1600K 2000K 1692102 1686956 1685554 1491682 1479761
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU b c e f d 0.126 0.252 0.378 0.504 0.63 0.49 0.49 0.55 0.55 0.56 MIN: 0.29 / MAX: 3.73 MIN: 0.29 / MAX: 3.11 MIN: 0.28 / MAX: 13.45 MIN: 0.27 / MAX: 5.91 MIN: 0.28 / MAX: 9.03 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU b c f e d 15 30 45 60 75 59.37 59.91 65.93 67.16 67.85 MIN: 52.65 / MAX: 80.39 MIN: 30.66 / MAX: 74.01 MIN: 31.99 / MAX: 150.16 MIN: 42.11 / MAX: 149.84 MIN: 31.98 / MAX: 153.29 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU b c f e d 20 40 60 80 100 71.08 71.15 79.99 80.25 81.20 MIN: 55.16 / MAX: 87.27 MIN: 53.9 / MAX: 90.11 MIN: 47.77 / MAX: 173.9 MIN: 63.46 / MAX: 183.13 MIN: 43.5 / MAX: 176.88 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 b c a d e f 7 14 21 28 35 29.01 28.81 28.59 25.82 25.57 25.45 1. (CXX) g++ options: -O3
simdjson Throughput Test: TopTweet OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: TopTweet f e d c b 2 4 6 8 10 8.31 8.29 8.29 7.31 7.29 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 b c a d e f 7 14 21 28 35 28.99 28.78 28.62 25.80 25.59 25.47 1. (CXX) g++ options: -O3
PyPerformance Benchmark: crypto_pyaes OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: crypto_pyaes d f e c b 9 18 27 36 45 33.0 33.0 33.6 37.4 37.5
PyPerformance Benchmark: json_loads OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: json_loads f d e c b 3 6 9 12 15 8.98 9.18 9.23 10.10 10.20
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 b a c d e f 4 8 12 16 20 15.90 15.76 15.75 14.12 14.04 14.02 1. (CXX) g++ options: -O3
Selenium Benchmark: Kraken - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Firefox d f b c e 120 240 360 480 600 487.8 490.7 520.2 520.4 552.8 1. firefox 118.0.2
Selenium Benchmark: WASM collisionDetection - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Firefox e f d b c 50 100 150 200 250 214.3 214.5 216.0 242.3 242.5 1. firefox 118.0.2
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Highest Compression f d e c b 1.206 2.412 3.618 4.824 6.03 5.36 5.36 5.28 4.74 4.74 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: LargeRandom f e d c b 0.45 0.9 1.35 1.8 2.25 2.00 1.99 1.99 1.77 1.77 1. (CXX) g++ options: -O3
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100 e d f b c 4 8 12 16 20 17.54 17.54 17.53 15.57 15.53 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless, Highest Compression f e d c b 0.2205 0.441 0.6615 0.882 1.1025 0.98 0.97 0.97 0.88 0.87 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
PyPerformance Benchmark: regex_compile OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: regex_compile d f e c b 16 32 48 64 80 63.6 64.8 67.0 71.4 71.6
Selenium Benchmark: Jetstream 2 - Browser: Firefox OpenBenchmarking.org Score, More Is Better Selenium Benchmark: Jetstream 2 - Browser: Firefox f e d b c 60 120 180 240 300 256.14 253.62 251.97 229.49 227.94 1. firefox 118.0.2
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU f e d c b 6 12 18 24 30 20.89 21.51 21.63 23.41 23.47 MIN: 12.72 / MAX: 44.33 MIN: 13.48 / MAX: 43.52 MIN: 9.02 / MAX: 46.9 MIN: 18.22 / MAX: 41.11 MIN: 11.19 / MAX: 40.86 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU f e d c b 3 6 9 12 15 8.77 8.93 9.01 9.79 9.84 MIN: 5 / MAX: 22.42 MIN: 5.27 / MAX: 19.37 MIN: 5.26 / MAX: 19.9 MIN: 7.41 / MAX: 19.03 MIN: 7.45 / MAX: 19.24 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark f d e b c 6 12 18 24 30 26.63 26.59 26.44 24.04 23.75
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless f d e b c 0.5513 1.1026 1.6539 2.2052 2.7565 2.45 2.45 2.44 2.20 2.19 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 b c a e f d 7 14 21 28 35 32.01 31.90 31.73 28.77 28.76 28.66 1. (CXX) g++ options: -O3
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU b c f e d 0.3173 0.6346 0.9519 1.2692 1.5865 1.27 1.28 1.39 1.40 1.41 MIN: 0.78 / MAX: 10.23 MIN: 0.75 / MAX: 10.14 MIN: 0.84 / MAX: 11.1 MIN: 0.75 / MAX: 8.02 MIN: 0.83 / MAX: 9.26 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 b c a d e f 7 14 21 28 35 31.91 31.91 31.61 28.81 28.77 28.76 1. (CXX) g++ options: -O3
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: Kostya f e d c b 1.1948 2.3896 3.5844 4.7792 5.974 5.31 5.30 5.30 4.81 4.80 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 d f e c b a 12 24 36 48 60 54.95 54.66 54.06 50.91 50.81 49.76 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 b c a d e f 7 14 21 28 35 30.34 29.81 29.65 27.83 27.55 27.49 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 d f e c b a 12 24 36 48 60 54.58 54.26 53.88 50.85 50.63 49.45 1. (CXX) g++ options: -O3
Selenium Benchmark: Octane - Browser: Firefox OpenBenchmarking.org Geometric Mean, More Is Better Selenium Benchmark: Octane - Browser: Firefox d e f c b 13K 26K 39K 52K 65K 60754 60711 60209 55067 55056 1. firefox 118.0.2
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU b c d e f 3 6 9 12 15 11.74 11.76 12.93 12.93 12.95 MIN: 9.52 / MAX: 22.89 MIN: 8.2 / MAX: 22.88 MIN: 8.18 / MAX: 24.53 MIN: 8.29 / MAX: 22.42 MIN: 8.15 / MAX: 35.46 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C c e d f b 6K 12K 18K 24K 30K 26540.76 25162.58 24498.20 24381.01 24092.42 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
Selenium Benchmark: Jetstream 2 - Browser: Google Chrome OpenBenchmarking.org Score, More Is Better Selenium Benchmark: Jetstream 2 - Browser: Google Chrome e f d c b 90 180 270 360 450 421.02 419.30 419.01 388.31 382.67 1. chrome 118.0.5993.70
Selenium Benchmark: Kraken - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Google Chrome f e d c b 80 160 240 320 400 317.3 318.5 319.4 343.2 349.0 1. chrome 118.0.5993.70
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 b c a d e f 7 14 21 28 35 30.04 29.88 29.68 27.64 27.53 27.34 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 c b a d e f 7 14 21 28 35 29.87 29.52 29.39 27.63 27.40 27.34 1. (CXX) g++ options: -O3
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU f e d b c 30 60 90 120 150 123.75 126.22 128.32 134.23 135.00 MIN: 72.91 / MAX: 247.36 MIN: 85.87 / MAX: 245.66 MIN: 86.52 / MAX: 370.89 MIN: 63.18 / MAX: 200.65 MIN: 61.37 / MAX: 200.78 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU c b e f d 2 4 6 8 10 7.37 7.52 7.96 7.98 8.02 MIN: 5.99 / MAX: 11.83 MIN: 5.11 / MAX: 16.43 MIN: 4.21 / MAX: 12.87 MIN: 5.16 / MAX: 18.32 MIN: 5.89 / MAX: 15.12 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 b c a d f e 7 14 21 28 35 29.91 29.86 29.78 28.01 27.59 27.54 1. (CXX) g++ options: -O3
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU e f d b c 100 200 300 400 500 432.96 435.36 436.86 469.85 470.00 MIN: 258.52 / MAX: 962.37 MIN: 259 / MAX: 1007.23 MIN: 257.97 / MAX: 949.62 MIN: 366.95 / MAX: 965.67 MIN: 367.47 / MAX: 966.24 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
DuckDB Benchmark: TPC-H Parquet OpenBenchmarking.org Seconds, Fewer Is Better DuckDB 0.9.1 Benchmark: TPC-H Parquet d b c 20 40 60 80 100 SE +/- 0.21, N = 3 SE +/- 0.28, N = 3 SE +/- 0.50, N = 3 83.67 89.75 90.45 1. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU f e b c d 3 6 9 12 15 11.17 11.83 11.88 11.91 12.05 MIN: 6.05 / MAX: 28.92 MIN: 6.65 / MAX: 27.36 MIN: 9.25 / MAX: 18.16 MIN: 5.89 / MAX: 17.91 MIN: 7.14 / MAX: 26.84 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU f e d c b 30 60 90 120 150 143.52 145.64 148.70 153.73 154.78 MIN: 72.01 / MAX: 257.8 MIN: 62.64 / MAX: 256.75 MIN: 107.41 / MAX: 259.59 MIN: 83.63 / MAX: 218.01 MIN: 79.79 / MAX: 219.52 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
GIMP Test: rotate OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: rotate d f e b c 3 6 9 12 15 8.806 8.828 8.854 9.199 9.447
Selenium Benchmark: Speedometer - Browser: Google Chrome OpenBenchmarking.org Runs Per Minute, More Is Better Selenium Benchmark: Speedometer - Browser: Google Chrome f e d b c 110 220 330 440 550 504 502 498 486 471 1. chrome 118.0.5993.70
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 b c a d e f 13 26 39 52 65 59.20 59.17 58.42 55.88 55.58 55.37 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 b c a f d e 13 26 39 52 65 59.36 59.22 58.65 55.78 55.67 55.62 1. (CXX) g++ options: -O3
Selenium Benchmark: Speedometer - Browser: Firefox OpenBenchmarking.org Runs Per Minute, More Is Better Selenium Benchmark: Speedometer - Browser: Firefox e d f c b 90 180 270 360 450 433 433 430 407 406 1. firefox 118.0.2
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 d e f b c a 5 10 15 20 25 19.24 18.92 18.88 18.53 18.32 18.11 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 d f e c b a 5 10 15 20 25 19.23 19.19 18.79 18.55 18.43 18.11 1. (CXX) g++ options: -O3
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU e d f c b 30 60 90 120 150 145.89 147.80 148.29 153.66 154.50 MIN: 66.07 / MAX: 257.13 MIN: 108.01 / MAX: 317.25 MIN: 66.12 / MAX: 269.72 MIN: 79.35 / MAX: 217.79 MIN: 131.36 / MAX: 217.7 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 e d f b c a 7 14 21 28 35 28.66 28.44 27.78 27.73 27.55 27.22 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 e d f c b a 7 14 21 28 35 28.55 28.48 28.25 27.62 27.53 27.16 1. (CXX) g++ options: -O3
GIMP Test: resize OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: resize e c d b f 3 6 9 12 15 12.11 12.39 12.52 12.63 12.70
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 b a c d e f 13 26 39 52 65 55.94 55.50 55.41 53.75 53.46 53.41 1. (CXX) g++ options: -O3
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU c b f e d 10 20 30 40 50 41.63 41.65 43.32 43.34 43.58 MIN: 23.08 / MAX: 44.79 MIN: 23.28 / MAX: 44.79 MIN: 31.46 / MAX: 73.03 MIN: 30.19 / MAX: 72.32 MIN: 31.3 / MAX: 72.83 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
PyPerformance Benchmark: python_startup OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: python_startup d f e c b 1.0913 2.1826 3.2739 4.3652 5.4565 4.65 4.66 4.68 4.84 4.85
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 e d f b c a 7 14 21 28 35 29.11 28.91 28.45 28.42 28.21 27.99 1. (CXX) g++ options: -O3
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 e d f b c 30 60 90 120 150 129.45 129.70 131.70 134.19 134.63 1. (CXX) g++ options: -O3 -fopenmp
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 c b a e d f 12 24 36 48 60 55.60 55.53 55.10 53.71 53.64 53.54 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 d e f c b a 7 14 21 28 35 28.94 28.91 28.53 28.40 28.34 27.99 1. (CXX) g++ options: -O3
CloverLeaf Input: clover_bm16 OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm16 d f e c b a 200 400 600 800 1000 1058.26 1062.13 1063.27 1091.47 1093.01 1094.13 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
GIMP Test: auto-levels OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: auto-levels d e f b c 3 6 9 12 15 9.459 9.526 9.526 9.615 9.748
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 c b a d e f 4 8 12 16 20 15.22 15.22 15.08 15.00 14.90 14.77 1. (CXX) g++ options: -O3
Selenium Benchmark: PSPDFKit WASM - Browser: Google Chrome OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Google Chrome d f e c b 500 1000 1500 2000 2500 2139 2139 2140 2170 2202 1. chrome 118.0.5993.70
CloverLeaf Input: clover_bm64_short OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm64_short d e f a b c 30 60 90 120 150 124.58 124.71 124.75 127.74 127.94 128.11 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
GIMP Test: unsharp-mask OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: unsharp-mask d b e f c 3 6 9 12 15 11.04 11.12 11.12 11.17 11.35
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 b c a f e d 4 8 12 16 20 15.20 15.18 15.11 15.03 14.79 14.79 1. (CXX) g++ options: -O3
Selenium Benchmark: PSPDFKit WASM - Browser: Firefox OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Firefox e f d c b 500 1000 1500 2000 2500 2051 2055 2059 2096 2108 1. firefox 118.0.2
QMCPACK Input: O_ae_pyscf_UHF OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: O_ae_pyscf_UHF d e f b c 60 120 180 240 300 258.40 258.48 259.37 262.57 264.99 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 d b e f c a 13 26 39 52 65 58.99 58.43 58.42 58.25 57.99 57.60 1. (CXX) g++ options: -O3
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: simple-H2O d b e f c 8 16 24 32 40 31.97 32.16 32.19 32.23 32.72 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU e d f b c 0.9892 1.9784 2.9676 3.9568 4.946 4.30610 4.30701 4.31754 4.36881 4.39625 MIN: 4.28 MIN: 4.27 MIN: 4.27 MIN: 4.33 MIN: 4.36 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 d f c b e a 13 26 39 52 65 58.98 58.59 58.47 58.15 58.05 57.98 1. (CXX) g++ options: -O3
QMCPACK Input: FeCO6_b3lyp_gms OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: FeCO6_b3lyp_gms f e d b c 60 120 180 240 300 263.94 264.55 265.33 266.24 266.65 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QMCPACK Input: LiH_ae_MSD OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: LiH_ae_MSD f e d c b 40 80 120 160 200 168.31 168.37 168.69 168.70 169.05 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare f e d 0.2583 0.5166 0.7749 1.0332 1.2915 1.148 1.148 1.143 1. (CXX) g++ options: -O3
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 23.0.1 Time To Compile f e d 50 100 150 200 250 SE +/- 0.37, N = 3 220.53 221.12 221.24
Phoronix Test Suite v10.8.5