new tests Tests for a future article. Intel Core i9-14900K testing with a ASUS PRIME Z790-P WIFI (1402 BIOS) and AMD Radeon RX 7900 XTX 24GB on Ubuntu 23.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2310314-PTS-NEWTESTS84&grr&rdt .
new tests Processor Motherboard Chipset Memory Disk Graphics Audio Monitor OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution a b c d e f Intel Core i5-13600K @ 5.10GHz (14 Cores / 20 Threads) ASUS PRIME Z790-P WIFI (1402 BIOS) Intel Device 7a27 32GB 1000GB Western Digital WDS100T1X0E-00AFY0 AMD Radeon RX 7900 XTX 24GB (2304/1249MHz) Realtek ALC897 ASUS VP28U Ubuntu 23.10 6.5.0-9-generic (x86_64) GNOME Shell 45.0 X Server + Wayland 4.6 Mesa 23.2.1-1ubuntu3 (LLVM 15.0.7 DRM 3.54) GCC 13.2.0 ext4 3840x2160 Intel Core i5-14600K @ 5.30GHz (14 Cores / 20 Threads) Intel Core i9-14900K @ 5.70GHz (24 Cores / 32 Threads) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x11d - Thermald 2.5.4 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected Python Details - b, c, d, e, f: Python 3.11.6
new tests openfoam: drivaerFastback, Medium Mesh Size - Execution Time openfoam: drivaerFastback, Medium Mesh Size - Mesh Time cloverleaf: clover_bm16 build-linux-kernel: allmodconfig openvkl: vklBenchmarkCPU ISPC build-gem5: Time To Compile build-llvm: Ninja openradioss: INIVOL and Fluid Structure Interaction Drop Container minibude: OpenMP - BM2 minibude: OpenMP - BM2 build-nodejs: Time To Compile qmcpack: FeCO6_b3lyp_gms qmcpack: O_ae_pyscf_UHF duckdb: TPC-H Parquet tensorflow: CPU - 64 - ResNet-50 qmcpack: Li2_STO_ae ospray-studio: 3 - 4K - 32 - Path Tracer - CPU duckdb: IMDB openfoam: drivaerFastback, Small Mesh Size - Execution Time openfoam: drivaerFastback, Small Mesh Size - Mesh Time ospray-studio: 2 - 4K - 32 - Path Tracer - CPU ospray-studio: 1 - 4K - 32 - Path Tracer - CPU hpcg: 104 104 104 - 60 openradioss: Bird Strike on Windshield build-godot: Time To Compile openssl: AES-128-GCM openssl: SHA256 openssl: ChaCha20-Poly1305 openssl: AES-256-GCM openssl: ChaCha20 openssl: SHA512 qmcpack: LiH_ae_MSD ospray: particle_volume/scivis/real_time ospray: particle_volume/pathtracer/real_time pgbench: 100 - 1000 - Read Only pgbench: 100 - 1000 - Read Only - Average Latency pgbench: 100 - 1000 - Read Write - Average Latency pgbench: 100 - 1000 - Read Only - Average Latency pgbench: 100 - 1000 - Read Write - Average Latency pgbench: 100 - 1000 - Read Write easywave: e2Asean Grid + BengkuluSept2007 Source - 1200 openradioss: Rubber O-Ring Seal Installation openradioss: Bumper Beam cloverleaf: clover_bm64_short ospray-studio: 3 - 4K - 16 - Path Tracer - CPU asmfish: 1024 Hash Memory, 26 Depth xmrig: Monero - 1M specfem3d: Layered Halfspace ospray: particle_volume/ao/real_time npb: SP.C ospray-studio: 2 - 4K - 16 - Path Tracer - CPU rodinia: OpenMP LavaMD ospray-studio: 1 - 4K - 16 - Path Tracer - CPU specfem3d: Water-layered Halfspace selenium: Jetstream 2 - Firefox nginx: 1000 nginx: 500 blender: Fishy Cat - CPU-Only gromacs: MPI CPU - water_GMX50_bare simdjson: Kostya ospray-studio: 1 - 4K - 1 - Path Tracer - CPU ospray-studio: 2 - 4K - 1 - Path Tracer - CPU vvenc: Bosphorus 4K - Fast onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Training - f32 - CPU openradioss: Cell Phone Drop Test onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU simdjson: LargeRand pyperformance: python_startup xmrig: Wownero - 1M ospray-studio: 3 - 4K - 1 - Path Tracer - CPU ospray: gravity_spheres_volume/dim_512/scivis/real_time v-ray: CPU ospray: gravity_spheres_volume/dim_512/ao/real_time ospray-studio: 3 - 1080p - 1 - Path Tracer - CPU minibude: OpenMP - BM1 minibude: OpenMP - BM1 npb: BT.C ospray-studio: 2 - 1080p - 1 - Path Tracer - CPU ospray-studio: 1 - 1080p - 1 - Path Tracer - CPU memcached: 1:100 memcached: 1:10 ospray-studio: 3 - 1080p - 16 - Path Tracer - CPU ospray-studio: 3 - 1080p - 32 - Path Tracer - CPU openvino: Face Detection FP16 - CPU openvino: Face Detection FP16 - CPU ospray: gravity_spheres_volume/dim_512/pathtracer/real_time blender: BMW27 - CPU-Only luxcorerender: Orange Juice - CPU luxcorerender: Danish Mood - CPU ngspice: C7552 tensorflow: CPU - 16 - ResNet-50 openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU luxcorerender: LuxCore Benchmark - CPU ngspice: C2670 luxcorerender: DLSC - CPU indigobench: CPU - Bedroom indigobench: CPU - Supercar openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP32 - CPU openvino: Person Detection FP32 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Handwritten English Recognition FP16 - CPU openvino: Handwritten English Recognition FP16 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU john-the-ripper: MD5 john-the-ripper: HMAC-SHA512 openssl: RSA4096 openssl: RSA4096 stockfish: Total Time simdjson: PartialTweets ospray-studio: 2 - 1080p - 16 - Path Tracer - CPU simdjson: DistinctUserID ospray-studio: 2 - 1080p - 32 - Path Tracer - CPU ospray-studio: 1 - 1080p - 16 - Path Tracer - CPU ospray-studio: 1 - 1080p - 32 - Path Tracer - CPU simdjson: TopTweet oidn: RT.ldr_alb_nrm.3840x2160 - CPU-Only selenium: Jetstream 2 - Google Chrome build-linux-kernel: defconfig rodinia: OpenMP HotSpot3D uvg266: Bosphorus 4K - Medium rodinia: OpenMP Leukocyte cloverleaf: clover_bm specfem3d: Homogeneous Halfspace npb: LU.C node-web-tooling: vvenc: Bosphorus 4K - Faster selenium: Octane - Firefox specfem3d: Mount St. Helens specfem3d: Tomographic Model astcenc: Exhaustive selenium: Octane - Google Chrome namd: ATPase Simulation - 327,506 Atoms rawtherapee: Total Benchmark Time avifenc: 2 selenium: Speedometer - Firefox kvazaar: Bosphorus 4K - Medium qmcpack: simple-H2O quantlib: Multi-Threaded embree: Pathtracer ISPC - Crown selenium: Speedometer - Google Chrome john-the-ripper: WPA PSK cpuminer-opt: scrypt john-the-ripper: bcrypt john-the-ripper: Blowfish liquid-dsp: 32 - 256 - 512 liquid-dsp: 8 - 256 - 512 liquid-dsp: 16 - 256 - 512 cpuminer-opt: Blake-2 S cpuminer-opt: Garlicoin cpuminer-opt: Deepcoin cpuminer-opt: Magi cpuminer-opt: Myriad-Groestl cpuminer-opt: Ringcoin cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Triple SHA-256, Onecoin cpuminer-opt: LBC, LBRY Credits cpuminer-opt: Skeincoin selenium: PSPDFKit WASM - Firefox selenium: PSPDFKit WASM - Google Chrome embree: Pathtracer ISPC - Asian Dragon webp: Quality 100, Lossless, Highest Compression pyperformance: regex_compile build-mesa: Time To Compile pyperformance: crypto_pyaes pyperformance: json_loads compress-7zip: Decompression Rating compress-7zip: Compression Rating selenium: Kraken - Firefox astcenc: Thorough qmcpack: H4_ae x265: Bosphorus 4K natron: Spaceship npb: FT.C npb: SP.B libraw: Post-Processing Benchmark npb: CG.C uvg266: Bosphorus 4K - Very Fast uvg266: Bosphorus 4K - Super Fast heffte: c2c - Stock - double-long - 512 heffte: c2c - Stock - double - 512 kvazaar: Bosphorus 4K - Very Fast selenium: Kraken - Google Chrome quantlib: Single-Threaded onednn: IP Shapes 1D - f32 - CPU heffte: c2c - FFTW - double - 512 heffte: c2c - FFTW - double-long - 512 uvg266: Bosphorus 4K - Ultra Fast selenium: WASM collisionDetection - Firefox y-cruncher: 500M kvazaar: Bosphorus 4K - Super Fast phpbench: PHP Benchmark Suite gimp: resize luxcorerender: Rainbow Colors and Prism - CPU gimp: unsharp-mask rodinia: OpenMP Streamcluster selenium: WASM collisionDetection - Google Chrome webp: Quality 100, Lossless gimp: auto-levels svt-av1: Preset 8 - Bosphorus 4K pybench: Total For Average Test Times kvazaar: Bosphorus 4K - Ultra Fast onednn: IP Shapes 3D - f32 - CPU gimp: rotate heffte: r2c - Stock - double-long - 512 heffte: r2c - Stock - double - 512 heffte: c2c - Stock - float-long - 512 heffte: c2c - Stock - float - 512 selenium: WASM imageConvolute - Firefox heffte: r2c - FFTW - double-long - 512 heffte: r2c - FFTW - double - 512 rodinia: OpenMP CFD Solver heffte: c2c - FFTW - float-long - 512 heffte: c2c - FFTW - float - 512 svt-hevc: 7 - Bosphorus 4K astcenc: Medium npb: MG.C easywave: e2Asean Grid + BengkuluSept2007 Source - 240 svt-vp9: VMAF Optimized - Bosphorus 4K svt-vp9: Visual Quality Optimized - Bosphorus 4K selenium: WASM imageConvolute - Google Chrome avifenc: 6, Lossless svt-vp9: PSNR/SSIM Optimized - Bosphorus 4K heffte: r2c - Stock - float - 512 heffte: r2c - Stock - float-long - 512 webp: Quality 100, Highest Compression heffte: r2c - FFTW - float-long - 512 heffte: r2c - FFTW - float - 512 svt-hevc: 10 - Bosphorus 4K svt-av1: Preset 13 - Bosphorus 4K svt-av1: Preset 12 - Bosphorus 4K avifenc: 6 darktable: Boat - CPU-only npb: EP.C darktable: Masskrug - CPU-only darktable: Server Room - CPU-only heffte: c2c - Stock - double - 256 heffte: c2c - Stock - double-long - 256 heffte: c2c - FFTW - double-long - 256 heffte: c2c - FFTW - double - 256 webp: Quality 100 heffte: r2c - FFTW - double - 256 heffte: r2c - FFTW - double-long - 256 heffte: r2c - Stock - double-long - 256 heffte: c2c - Stock - float - 256 heffte: r2c - Stock - double - 256 heffte: c2c - Stock - float-long - 256 heffte: c2c - FFTW - float-long - 256 heffte: c2c - FFTW - float - 256 heffte: r2c - FFTW - float-long - 256 heffte: r2c - FFTW - float - 256 heffte: r2c - Stock - float-long - 256 heffte: r2c - Stock - float - 256 darktable: Server Rack - CPU-only heffte: c2c - Stock - double-long - 128 heffte: c2c - Stock - double - 128 heffte: c2c - FFTW - double - 128 heffte: c2c - FFTW - double-long - 128 heffte: r2c - Stock - double-long - 128 heffte: c2c - Stock - float-long - 128 heffte: r2c - Stock - double - 128 heffte: c2c - Stock - float - 128 heffte: c2c - FFTW - float - 128 heffte: r2c - FFTW - double-long - 128 heffte: r2c - FFTW - double - 128 heffte: c2c - FFTW - float-long - 128 heffte: r2c - Stock - float-long - 128 heffte: r2c - Stock - float - 128 heffte: r2c - FFTW - float-long - 128 heffte: r2c - FFTW - float - 128 onednn: IP Shapes 3D - bf16bf16bf16 - CPU a b c d e f 1094.128868103 127.74 58.98 55799.7 185.62 79320 1893.63 5436.88 417.24 8564.28 2345.04 41360 61070 9035.2 21370 15.1102 15.0777 4818.8 18.1985 18.239 27.2229 27.1619 27.985 27.9897 31.7325 31.6129 35.2816 35.2152 49.4547 49.7591 58.4188 58.6489 15.7616 15.8331 16.1534 16.2749 28.5862 28.6186 29.6848 29.6502 29.3894 29.778 31.3044 31.2736 55.5018 55.1029 57.9832 57.5957 14.2837 14.658 18.1124 18.1147 31.8111 31.2292 31.5989 31.2635 51.881 43.71 44.2936 50.9336 66.6684 68.8833 127.305 131.935 2166.8213 230.3022 1093.01 778.31 387 439.462 435.68 12.906 322.654 357.104 266.24 262.57 89.754 24.14 285.85 277357 86.571 224.43546 33.876155 239054 234540 8.28855 228.35 230.157 106224257980 21234053100 31517307700 91408403390 43683166890 6922574030 169.05 7.32159 169.646 947197 1.056 42553 949091 23.96 41736 134.185 165.27 151.75 127.94 139891 47742886 7676.5 135.10279615 7.37745 12080 120716 131.529 118595 126.074892513 229.492 127899.06 135692.91 109.94 4.8 7176 7338 6.539 3320.34 3319.64 96.02 1705.51 1705.1 1.77 4.85 11202.1 8523 3.60869 18743 3.6958 2156 8.679 216.969 34501.92 1848 1830 4058394.64 4464270.25 37384 72426 1730.89 3.45 5.03922 78.97 4.88 2.22 65.896 23.91 469.85 12.75 2.54 66.392 2.88 2.689 7.592 134.23 44.66 154.78 38.73 154.5 38.75 23.47 255 11.88 503.59 71.08 224.92 59.37 269.31 56.41 106.23 7.52 2125.52 9.84 607.54 41.65 383.74 0.49 32245.66 19.33 309.56 11.74 1361.39 5.89 1011.66 1.27 12503.2 1134000 45526000 231709.8 3577.3 46079885 5.39 32575 6.04 62478 31957 61145 7.29 0.43 382.668 65.562 55.996 9.41 53.871 55.82 62.094293176 35845.04 24.04 13.272 55056 51.618068965 49.142967323 1.0807 97769 0.86311 40.086 44.202 406 13.07 32.159 55963.1 19.033 486 36403 195.41 31152 26232 228820000 139440000 204060000 85030 2040.33 5549.86 442.28 8774.33 2508.93 43650 62870 9845.56 22770 2108 2202 23.1049 0.87 71.6 29.825 37.5 10.2 92168 127009 520.2 10.6857 22.73 25.7 4.6 18953.53 16564.09 61.85 8025.9 27.74 29.3 15.2009 15.2166 32.39 349 5016.1 2.98963 18.3225 18.2924 35.3 242.3 13.443 40.71 1491682 12.625 12.46 11.12 15.894 232.97 2.20 9.615 58.458 427 54.14 4.36881 9.199 27.7294 27.527 28.418 28.337 16.3 32.0107 31.9085 10.118 35.3807 35.2893 71.18 81.1968 24092.42 8.637 93.49 88.07 15.95 6.908 111.84 50.6268 50.8129 4.74 59.1962 59.3627 139.99 149.161 146.571 4.515 4.194 1773.71 2.364 1.833 15.9022 16.0466 16.4718 16.4798 15.57 29.014 28.9898 30.042 30.3369 29.522 29.9091 31.8188 31.8292 55.9418 55.526 58.1474 58.425 0.266 14.4836 14.8193 18.4321 18.5277 32.6945 31.6351 32.6174 31.9321 53.1674 44.0648 44.8791 52.499 71.0106 70.4153 134.246 104.025 2166.8275 234.39558 1091.47 767.219 386 435.624 438.56 12.907 322.668 355.827 266.65 264.99 90.453 24.1 284.18 277524 86.555 223.87156 32.902566 238496 234968 8.26023 234.62 234.319 106192145890 21897801590 30264414210 94488932900 43658009260 7042483840 168.7 7.32078 169.835 833675 1.2 42185 1098384 24.006 41656 134.633 165.49 150.19 128.11 139809 47676136 7660.4 140.585720491 7.3669 12101.42 120664 129.977 118639 126.108597881 227.935 127678.15 132783.22 110.19 4.81 7195 7320 6.55 3321.37 3319.3 95.46 1705.89 1707.37 1.77 4.84 11160.8 8512 3.65807 18885 3.79562 2150 8.705 217.632 34573.71 1840 1816 4149899.46 4305589.62 37367 72092 1733.4 3.45 5.04566 79.19 4.91 2.27 63.909 23.93 470 12.75 2.55 66.076 2.87 2.6 7.778 135 44.39 153.73 38.98 153.66 39 23.41 255.73 11.91 502.05 71.15 224.74 59.91 266.96 56.71 105.65 7.37 2170.03 9.79 610.86 41.63 383.97 0.49 32241.73 19.09 313.32 11.76 1359.67 5.81 1028.14 1.28 12456.32 1131000 46015000 231842.8 3578.6 41998826 6.51 32538 6.04 62329 32007 61161 7.31 0.43 388.309 64.959 58.024 9.43 54.926 56.10 63.899206362 34481.92 23.75 13.276 55067 51.749754354 51.562965731 1.0804 105856 0.86116 40.301 43.999 407 13.09 32.718 57317.5 19.1475 471 36403 194.68 31110 31050 228970000 140020000 204100000 86620 2040.66 5604.27 450.42 8765.28 2494.87 43630 62850 9996.74 22780 2096 2170 23.1978 0.88 71.4 29.202 37.4 10.1 92137 127943 520.4 10.6929 25.09 26.74 4.4 18737.09 16408.43 63.14 7422.74 27.74 29.34 15.1774 15.2168 32.33 343.2 5014.7 2.99292 18.275 18.2462 35.09 242.5 13.518 40.69 1479761 12.394 12.54 11.352 15.834 193.39 2.19 9.748 58.184 446 54.29 4.39625 9.447 27.5481 27.6161 28.2076 28.4031 16.1 31.8965 31.9077 10.121 35.1791 35.4345 71.18 81.2623 26540.76 7.127 96 88.36 15.95 6.983 112.34 50.8481 50.9069 4.74 59.1701 59.223 140.06 147.091 148.546 4.581 4.389 1775.67 2.423 1.825 15.7502 15.942 16.3572 16.4311 15.53 28.8122 28.7765 29.8789 29.8126 29.8734 29.8559 31.5524 31.6357 55.4126 55.6015 58.467 57.9937 0.317 14.3837 14.4448 18.5488 18.3191 22.1694 32.3869 32.4099 31.74 52.2532 44.6157 44.6026 51.9406 69.3558 71.5008 131.511 131.251 1816.8935 203.49517 1058.26 519.599 547 221.237 308.209 285.26 20.411 510.266 251.154 265.33 258.4 83.672 34.8 208.7 196772 71.369 162.53309 26.294883 169828 167519 9.91323 166.94 163.491 168593428210 35393577110 47030287560 145829960090 68204421390 10970736110 168.69 10.1188 207.158 1299800 0.769 54797 1361760 17.889 55901 129.701 108.37 114.72 124.58 99301 71318180 9559.6 91.860925305 10.1686 15263.55 85258 86.77 83823 84.113422384 251.968 184950.78 196302.45 75.4 1.143 5.3 5114 5095 7.829 2150.83 2113.83 66.07 1062.21 1145.28 1.99 4.65 16204 6008 5.09456 27940 5.25193 1547 14.948 373.696 50512.7 1304 1294 5061427.55 5693052.64 24365 50740 1448.07 5.48 6.9477 53.09 6.93 3.44 56.736 35.36 436.86 18.22 3.82 57.215 4.15 4.064 12.02 128.32 62.26 148.7 53.71 147.8 54.06 21.63 369.15 12.05 662.91 81.2 295.35 67.85 353.52 66.49 120.21 8.02 2989.78 9.01 886.66 43.58 550.3 0.56 41956.06 22.7 351.84 12.93 1854.34 5.1 1564.5 1.41 16901.1 1862000 71981000 353220.2 5418.1 71722427 7.37 20919 8.25 43450 20752 43729 8.29 0.65 419.011 44.92 48.845 14.25 48.076 42.55 37.654774238 52753.11 26.59 15.438 60754 30.83045146 30.329244698 1.5573 117684 0.61454 34.446 33.2 433 19.38 31.973 88060 29.5036 498 60805 310.55 47260 47385 329800000 168030000 223360000 141280 2726.85 9210.49 714.42 14360 4039.03 70960 102200 16610 38520 2059 2139 35.4539 0.97 63.6 21.8 33 9.18 142133 183859 487.8 16.9563 18.59 32.51 6.4 24740.05 22004.23 71.16 10390.33 38.59 41.03 14.786 15 42.25 319.4 5665 2.0115 15.8089 15.9782 50.11 216 10.784 54.02 1692102 12.516 17.96 11.042 7.224 211.66 2.45 9.459 75.364 377 75.87 4.30701 8.806 28.4441 28.478 28.9081 28.9398 14.2 28.6645 28.806 5.935 31.0588 30.8378 98.86 130.3598 24498.2 4.774 107.4 122.15 14.1 5.097 137.97 54.5762 54.9549 5.36 55.8788 55.6705 185.07 193.281 194.348 3.852 2.697 3342.51 1.694 1.415 14.123 14.0877 14.1676 14.1336 17.54 25.8193 25.7995 27.6358 27.8339 27.6338 28.0061 27.7795 27.8481 53.7468 53.6358 58.982 58.9926 0.209 16.9435 16.751 19.2256 19.2383 52.5398 60.0213 53.6889 59.735 95.578 63.2219 64.2029 99.7527 117.845 115.198 186.551 183.452 1813.292 204.95889 1063.27 527.01 545 221.121 307.199 291.63 20.335 508.378 249.43 264.55 258.48 34.83 203.73 196983 162.41459 27.068099 168775 166105 9.88187 170.46 162.567 167529875870 35045220680 47048102160 144868800380 67647607600 10887827050 168.37 10.1229 209.444 1591084 0.629 55746 1627171 17.751 56335 129.448 109.41 114.07 124.71 99388 71944056 9512.1 91.894740975 10.2038 15270.26 84586 86.948 83524 83.591421969 253.624 180058.09 200059.86 75.33 1.148 5.3 5081 5127 7.946 2145.16 2125.71 65.66 1083.23 1064.7 1.99 4.68 16088.6 5932 5.03492 29012 5.27642 1524 15.007 375.175 50229.93 1294 1305 5241319.8 5730258.98 24526 50458 1447.91 5.45 6.92496 52.94 6.96 3.46 56.807 35.1 432.96 18.46 3.89 57.464 4.17 4.062 12.189 126.22 63.33 145.64 54.88 145.89 54.78 21.51 371.16 11.83 675.24 80.25 298.8 67.16 357.15 66.06 120.99 7.96 3010.6 8.93 894.24 43.34 553.27 0.55 42386.78 22.63 352.96 12.93 1855.23 5.08 1569.98 1.4 17036.01 1862000 65609000 345908.2 5294.3 64256226 7.37 20740 8.25 44009 20746 42719 8.29 0.66 421.021 45.816 48.465 14.29 47.475 42.83 38.235807506 52719.71 26.44 15.297 60711 31.651786074 30.650349799 1.5633 102783 0.61420 35.193 32.79 433 19.43 32.185 89344 29.5741 502 61351 311.73 47942 47590 331120000 171400000 239170000 140960 2605.6 9300.91 710.53 14310 4048.08 70910 103680 16950 38020 2051 2140 35.3254 0.97 67 21.846 33.6 9.23 155668 171918 552.8 16.9459 18.3 33.23 6.3 25591.23 21129.21 69.03 8593.82 38.46 42.91 14.7872 14.9001 42.88 318.5 5672.2 2.01217 15.8476 15.9518 50.81 214.3 10.864 54.71 1685554 12.111 17.74 11.12 7.176 204.81 2.44 9.526 75.647 378 73.99 4.3061 8.854 28.6623 28.5518 29.1118 28.9079 14.3 28.7669 28.7749 6.101 30.9595 30.9894 99.72 130.2225 25162.58 4.833 107.34 120.84 16.93 5.063 150.27 53.8771 54.0622 5.28 55.5756 55.6246 178.47 190.914 190.158 3.472 2.705 2893.39 1.671 1.434 14.0404 14.0495 14.0426 14.0817 17.54 25.5679 25.585 27.5275 27.5452 27.4046 27.5381 27.5697 27.5778 53.4593 53.7069 58.05 58.4249 0.187 16.8437 16.6687 18.7925 18.9154 56.6746 40.5778 52.8988 59.0159 97.4361 64.0485 64.9886 98.9954 114.123 114.341 183.834 184.512 1815.051 195.54508 1062.13 526.015 549 220.533 307.934 290.16 20.389 509.73 249.686 263.94 259.37 34.84 202.62 195691 161.88961 27.222561 168378 166313 9.84149 166.08 162.803 167740437850 35479543830 47059792960 145929235660 68291402680 10972703730 168.31 10.1689 204.454 1599780 0.625 50776 1604627 17.371 57567 131.703 108.16 115.59 124.75 98534 72527312 9501 91.176640881 10.2334 15177.55 84845 86.56 83238 83.64314069 256.136 188170.4 201855.19 75.24 1.148 5.31 5039 5175 7.834 2188.89 2141.47 65.34 1081.46 1044.98 2 4.66 15832.3 6037 4.94223 28970 5.05305 1508 15.023 375.582 49366.17 1318 1296 5171670.55 5727247.49 24512 50705 1431.86 5.54 6.94737 52.67 6.91 3.48 56.766 35.42 435.36 18.26 3.85 56.866 4.22 4.067 11.407 123.75 64.57 143.52 55.69 148.29 53.9 20.89 382.3 11.17 715.44 79.99 299.8 65.93 363.79 66.42 120.32 7.98 3005.33 8.77 910.84 43.32 553.49 0.55 42343.47 22.45 355.77 12.95 1852.66 4.78 1668.97 1.39 17187.3 1888000 71931000 354387.3 5367.5 71115165 7.36 21146 8.24 43624 20598 42916 8.31 0.66 419.3 45.723 49.009 14.38 48.276 42.77 37.994095068 52390.75 26.63 15.439 60209 32.373449454 30.471851303 1.5848 118632 0.61266 35.075 33.165 430 19.4 32.225 89756.1 29.1886 504 61337 309.14 47587 47203 329870000 169900000 239250000 138510 2559.46 9248.07 722 14200 3946.27 71070 102900 16590 38040 2055 2139 35.2707 0.98 64.8 21.985 33 8.98 138934 185607 490.7 16.9808 17.7 33.47 6.4 25290.9 21905.42 70.56 7707.48 39.46 42.68 15.0325 14.7708 43.18 317.3 5666.3 2.03456 15.5292 15.8822 50.69 214.5 11.108 53.99 1686956 12.695 14.89 11.167 7.226 171.17 2.45 9.526 75.057 376 72.63 4.31754 8.828 27.776 28.2529 28.4467 28.5277 14.3 28.762 28.7644 6.49 30.9269 31.0469 98.68 130.417 24381.01 4.785 108.94 120.89 14.04 5.066 152.13 54.2616 54.6591 5.36 55.3711 55.782 170.99 191.65 194.228 3.854 2.771 3340.25 1.667 1.405 14.0224 14.015 14.0375 14.144 17.53 25.449 25.4696 27.3438 27.4897 27.3379 27.5919 27.549 27.654 53.406 53.5429 58.5898 58.2473 0.217 16.8575 17.032 19.1865 18.8817 53.0212 59.2394 54.7998 60.5548 87.4539 62.9035 63.746 96.4633 115.053 115.093 187.72 179.454 OpenBenchmarking.org
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time b c d e f 500 1000 1500 2000 2500 2166.82 2166.83 1816.89 1813.29 1815.05 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time b c d e f 50 100 150 200 250 230.30 234.40 203.50 204.96 195.55 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
CloverLeaf Input: clover_bm16 OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm16 a b c d e f 200 400 600 800 1000 1094.13 1093.01 1091.47 1058.26 1063.27 1062.13 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig b c d e f 200 400 600 800 1000 778.31 767.22 519.60 527.01 526.02
OpenVKL Benchmark: vklBenchmarkCPU ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 2.0.0 Benchmark: vklBenchmarkCPU ISPC b c d e f 120 240 360 480 600 387 386 547 545 549 MIN: 26 / MAX: 5487 MIN: 26 / MAX: 5478 MIN: 37 / MAX: 8225 MIN: 36 / MAX: 7468 MIN: 37 / MAX: 7452
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 23.0.1 Time To Compile d e f 50 100 150 200 250 SE +/- 0.37, N = 3 221.24 221.12 220.53
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja b c d e f 100 200 300 400 500 439.46 435.62 308.21 307.20 307.93
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container b c d e f 100 200 300 400 500 435.68 438.56 285.26 291.63 290.16
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 b c d e f 5 10 15 20 25 12.91 12.91 20.41 20.34 20.39 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 b c d e f 110 220 330 440 550 322.65 322.67 510.27 508.38 509.73 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile b c d e f 80 160 240 320 400 357.10 355.83 251.15 249.43 249.69
QMCPACK Input: FeCO6_b3lyp_gms OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: FeCO6_b3lyp_gms b c d e f 60 120 180 240 300 266.24 266.65 265.33 264.55 263.94 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QMCPACK Input: O_ae_pyscf_UHF OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: O_ae_pyscf_UHF b c d e f 60 120 180 240 300 262.57 264.99 258.40 258.48 259.37 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
DuckDB Benchmark: TPC-H Parquet OpenBenchmarking.org Seconds, Fewer Is Better DuckDB 0.9.1 Benchmark: TPC-H Parquet b c d 20 40 60 80 100 SE +/- 0.28, N = 3 SE +/- 0.50, N = 3 SE +/- 0.21, N = 3 89.75 90.45 83.67 1. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl
TensorFlow Device: CPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 b c d e f 8 16 24 32 40 24.14 24.10 34.80 34.83 34.84
QMCPACK Input: Li2_STO_ae OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: Li2_STO_ae b c d e f 60 120 180 240 300 285.85 284.18 208.70 203.73 202.62 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 60K 120K 180K 240K 300K 277357 277524 196772 196983 195691
DuckDB Benchmark: IMDB OpenBenchmarking.org Seconds, Fewer Is Better DuckDB 0.9.1 Benchmark: IMDB b c d 20 40 60 80 100 SE +/- 0.48, N = 3 SE +/- 0.54, N = 3 SE +/- 0.27, N = 3 86.57 86.56 71.37 1. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl
OpenFOAM Input: drivaerFastback, Small Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time b c d e f 50 100 150 200 250 224.44 223.87 162.53 162.41 161.89 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: drivaerFastback, Small Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time b c d e f 8 16 24 32 40 33.88 32.90 26.29 27.07 27.22 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 50K 100K 150K 200K 250K 239054 238496 169828 168775 168378
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 50K 100K 150K 200K 250K 234540 234968 167519 166105 166313
High Performance Conjugate Gradient X Y Z: 104 104 104 - RT: 60 OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 b c d e f 3 6 9 12 15 8.28855 8.26023 9.91323 9.88187 9.84149 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield b c d e f 50 100 150 200 250 228.35 234.62 166.94 170.46 166.08
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile b c d e f 50 100 150 200 250 230.16 234.32 163.49 162.57 162.80
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM b c d e f 40000M 80000M 120000M 160000M 200000M 106224257980 106192145890 168593428210 167529875870 167740437850 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 b c d e f 8000M 16000M 24000M 32000M 40000M 21234053100 21897801590 35393577110 35045220680 35479543830 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 b c d e f 10000M 20000M 30000M 40000M 50000M 31517307700 30264414210 47030287560 47048102160 47059792960 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM b c d e f 30000M 60000M 90000M 120000M 150000M 91408403390 94488932900 145829960090 144868800380 145929235660 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 b c d e f 15000M 30000M 45000M 60000M 75000M 43683166890 43658009260 68204421390 67647607600 68291402680 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 b c d e f 2000M 4000M 6000M 8000M 10000M 6922574030 7042483840 10970736110 10887827050 10972703730 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
QMCPACK Input: LiH_ae_MSD OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: LiH_ae_MSD b c d e f 40 80 120 160 200 169.05 168.70 168.69 168.37 168.31 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OSPRay Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time b c d e f 3 6 9 12 15 7.32159 7.32078 10.11880 10.12290 10.16890
OSPRay Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time b c d e f 50 100 150 200 250 169.65 169.84 207.16 209.44 204.45
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only b c d e f 300K 600K 900K 1200K 1500K 947197 833675 1299800 1591084 1599780 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency b c d e f 0.27 0.54 0.81 1.08 1.35 1.054 0.910 0.734 0.615 0.623 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency b c d e f 12K 24K 36K 48K 60K 42553 42185 54797 55746 50776 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency b c d e f 300K 600K 900K 1200K 1500K 949091 1098384 1361760 1627171 1604627 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency b c d e f 6 12 18 24 30 23.50 23.71 18.25 17.94 19.69 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write b c d e f 12K 24K 36K 48K 60K 41736 41656 55901 56335 57567 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 b c d e f 30 60 90 120 150 134.19 134.63 129.70 129.45 131.70 1. (CXX) g++ options: -O3 -fopenmp
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation b c d e f 40 80 120 160 200 165.27 165.49 108.37 109.41 108.16
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam b c d e f 30 60 90 120 150 151.75 150.19 114.72 114.07 115.59
CloverLeaf Input: clover_bm64_short OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm64_short a b c d e f 30 60 90 120 150 127.74 127.94 128.11 124.58 124.71 124.75 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 30K 60K 90K 120K 150K 139891 139809 99301 99388 98534
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth b c d e f 16M 32M 48M 64M 80M 47742886 47676136 71318180 71944056 72527312
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.18.1 Variant: Monero - Hash Count: 1M b c d e f 2K 4K 6K 8K 10K 7676.5 7660.4 9559.6 9512.1 9501.0 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
SPECFEM3D Model: Layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Layered Halfspace b c d e f 30 60 90 120 150 135.10 140.59 91.86 91.89 91.18 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OSPRay Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time b c d e f 3 6 9 12 15 7.37745 7.36690 10.16860 10.20380 10.23340
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C b c d e f 3K 6K 9K 12K 15K 12080.00 12101.42 15263.55 15270.26 15177.55 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 30K 60K 90K 120K 150K 120716 120664 85258 84586 84845
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD b c d e f 30 60 90 120 150 131.53 129.98 86.77 86.95 86.56 1. (CXX) g++ options: -O2 -lOpenCL
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 30K 60K 90K 120K 150K 118595 118639 83823 83524 83238
SPECFEM3D Model: Water-layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Water-layered Halfspace b c d e f 30 60 90 120 150 126.07 126.11 84.11 83.59 83.64 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Selenium Benchmark: Jetstream 2 - Browser: Firefox OpenBenchmarking.org Score, More Is Better Selenium Benchmark: Jetstream 2 - Browser: Firefox b c d e f 60 120 180 240 300 229.49 227.94 251.97 253.62 256.14 1. firefox 118.0.2
nginx Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 b c d e f 40K 80K 120K 160K 200K 127899.06 127678.15 184950.78 180058.09 188170.40 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 b c d e f 40K 80K 120K 160K 200K 135692.91 132783.22 196302.45 200059.86 201855.19 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only b c d e f 20 40 60 80 100 109.94 110.19 75.40 75.33 75.24
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare d e f 0.2583 0.5166 0.7749 1.0332 1.2915 1.143 1.148 1.148 1. (CXX) g++ options: -O3
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: Kostya b c d e f 1.1948 2.3896 3.5844 4.7792 5.974 4.80 4.81 5.30 5.30 5.31 1. (CXX) g++ options: -O3
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 1500 3000 4500 6000 7500 7176 7195 5114 5081 5039
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 1600 3200 4800 6400 8000 7338 7320 5095 5127 5175
VVenC Video Input: Bosphorus 4K - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Fast b c d e f 2 4 6 8 10 6.539 6.550 7.829 7.946 7.834 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU b c d e f 700 1400 2100 2800 3500 3320.34 3321.37 2150.83 2145.16 2188.89 MIN: 3316.93 MIN: 3317.95 MIN: 1926.06 MIN: 1934.78 MIN: 1932.77 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU b c d e f 700 1400 2100 2800 3500 3319.64 3319.30 2113.83 2125.71 2141.47 MIN: 3316.21 MIN: 3315.49 MIN: 1927.97 MIN: 1925.61 MIN: 1947.18 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test b c d e f 20 40 60 80 100 96.02 95.46 66.07 65.66 65.34
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU b c d e f 400 800 1200 1600 2000 1705.51 1705.89 1062.21 1083.23 1081.46 MIN: 1702.42 MIN: 1703.31 MIN: 982.53 MIN: 981.64 MIN: 999.13 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU b c d e f 400 800 1200 1600 2000 1705.10 1707.37 1145.28 1064.70 1044.98 MIN: 1702.22 MIN: 1704.07 MIN: 996.9 MIN: 979.41 MIN: 983.94 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: LargeRandom b c d e f 0.45 0.9 1.35 1.8 2.25 1.77 1.77 1.99 1.99 2.00 1. (CXX) g++ options: -O3
PyPerformance Benchmark: python_startup OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: python_startup b c d e f 1.0913 2.1826 3.2739 4.3652 5.4565 4.85 4.84 4.65 4.68 4.66
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.18.1 Variant: Wownero - Hash Count: 1M b c d e f 3K 6K 9K 12K 15K 11202.1 11160.8 16204.0 16088.6 15832.3 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 2K 4K 6K 8K 10K 8523 8512 6008 5932 6037
OSPRay Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time b c d e f 1.1463 2.2926 3.4389 4.5852 5.7315 3.60869 3.65807 5.09456 5.03492 4.94223
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5.02 Mode: CPU b c d e f 6K 12K 18K 24K 30K 18743 18885 27940 29012 28970
OSPRay Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time b c d e f 1.1872 2.3744 3.5616 4.7488 5.936 3.69580 3.79562 5.25193 5.27642 5.05305
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 500 1000 1500 2000 2500 2156 2150 1547 1524 1508
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 b c d e f 4 8 12 16 20 8.679 8.705 14.948 15.007 15.023 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 b c d e f 80 160 240 320 400 216.97 217.63 373.70 375.18 375.58 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C b c d e f 11K 22K 33K 44K 55K 34501.92 34573.71 50512.70 50229.93 49366.17 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 400 800 1200 1600 2000 1848 1840 1304 1294 1318
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU b c d e f 400 800 1200 1600 2000 1830 1816 1294 1305 1296
Memcached Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 b c d e f 1.1M 2.2M 3.3M 4.4M 5.5M 4058394.64 4149899.46 5061427.55 5241319.80 5171670.55 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Memcached Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 b c d e f 1.2M 2.4M 3.6M 4.8M 6M 4464270.25 4305589.62 5693052.64 5730258.98 5727247.49 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 8K 16K 24K 32K 40K 37384 37367 24365 24526 24512
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 16K 32K 48K 64K 80K 72426 72092 50740 50458 50705
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU b c d e f 400 800 1200 1600 2000 1730.89 1733.40 1448.07 1447.91 1431.86 MIN: 1541.53 / MAX: 2245.54 MIN: 1541.89 / MAX: 2247.48 MIN: 1089.83 / MAX: 1961.07 MIN: 1094.97 / MAX: 2000.94 MIN: 963.6 / MAX: 2030.33 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU b c d e f 1.2465 2.493 3.7395 4.986 6.2325 3.45 3.45 5.48 5.45 5.54 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OSPRay Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time b c d e f 2 4 6 8 10 5.03922 5.04566 6.94770 6.92496 6.94737
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only b c d e f 20 40 60 80 100 78.97 79.19 53.09 52.94 52.67
LuxCoreRender Scene: Orange Juice - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU b c d e f 2 4 6 8 10 4.88 4.91 6.93 6.96 6.91 MIN: 4.71 / MAX: 4.93 MIN: 4.72 / MAX: 4.97 MIN: 6.23 / MAX: 7.51 MIN: 6.29 / MAX: 7.55 MIN: 6.28 / MAX: 7.27
LuxCoreRender Scene: Danish Mood - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU b c d e f 0.783 1.566 2.349 3.132 3.915 2.22 2.27 3.44 3.46 3.48 MIN: 0.78 / MAX: 2.61 MIN: 0.86 / MAX: 2.63 MIN: 1.57 / MAX: 3.87 MIN: 1.38 / MAX: 3.89 MIN: 1.71 / MAX: 3.89
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 b c d e f 15 30 45 60 75 65.90 63.91 56.74 56.81 56.77 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
TensorFlow Device: CPU - Batch Size: 16 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 16 - Model: ResNet-50 b c d e f 8 16 24 32 40 23.91 23.93 35.36 35.10 35.42
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU b c d e f 100 200 300 400 500 469.85 470.00 436.86 432.96 435.36 MIN: 366.95 / MAX: 965.67 MIN: 367.47 / MAX: 966.24 MIN: 257.97 / MAX: 949.62 MIN: 258.52 / MAX: 962.37 MIN: 259 / MAX: 1007.23 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU b c d e f 5 10 15 20 25 12.75 12.75 18.22 18.46 18.26 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU b c d e f 0.8753 1.7506 2.6259 3.5012 4.3765 2.54 2.55 3.82 3.89 3.85 MIN: 0.92 / MAX: 2.97 MIN: 0.93 / MAX: 2.98 MIN: 1.47 / MAX: 4.34 MIN: 1.74 / MAX: 4.37 MIN: 1.53 / MAX: 4.35
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 b c d e f 15 30 45 60 75 66.39 66.08 57.22 57.46 56.87 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
LuxCoreRender Scene: DLSC - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU b c d e f 0.9495 1.899 2.8485 3.798 4.7475 2.88 2.87 4.15 4.17 4.22 MIN: 2.81 / MAX: 3 MIN: 2.8 / MAX: 2.98 MIN: 4.05 / MAX: 4.44 MIN: 4 / MAX: 4.47 MIN: 4.08 / MAX: 4.91
IndigoBench Acceleration: CPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Bedroom b c d e f 0.9151 1.8302 2.7453 3.6604 4.5755 2.689 2.600 4.064 4.062 4.067
IndigoBench Acceleration: CPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar b c d e f 3 6 9 12 15 7.592 7.778 12.020 12.189 11.407
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU b c d e f 30 60 90 120 150 134.23 135.00 128.32 126.22 123.75 MIN: 63.18 / MAX: 200.65 MIN: 61.37 / MAX: 200.78 MIN: 86.52 / MAX: 370.89 MIN: 85.87 / MAX: 245.66 MIN: 72.91 / MAX: 247.36 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU b c d e f 14 28 42 56 70 44.66 44.39 62.26 63.33 64.57 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU b c d e f 30 60 90 120 150 154.78 153.73 148.70 145.64 143.52 MIN: 79.79 / MAX: 219.52 MIN: 83.63 / MAX: 218.01 MIN: 107.41 / MAX: 259.59 MIN: 62.64 / MAX: 256.75 MIN: 72.01 / MAX: 257.8 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU b c d e f 13 26 39 52 65 38.73 38.98 53.71 54.88 55.69 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU b c d e f 30 60 90 120 150 154.50 153.66 147.80 145.89 148.29 MIN: 131.36 / MAX: 217.7 MIN: 79.35 / MAX: 217.79 MIN: 108.01 / MAX: 317.25 MIN: 66.07 / MAX: 257.13 MIN: 66.12 / MAX: 269.72 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU b c d e f 12 24 36 48 60 38.75 39.00 54.06 54.78 53.90 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU b c d e f 6 12 18 24 30 23.47 23.41 21.63 21.51 20.89 MIN: 11.19 / MAX: 40.86 MIN: 18.22 / MAX: 41.11 MIN: 9.02 / MAX: 46.9 MIN: 13.48 / MAX: 43.52 MIN: 12.72 / MAX: 44.33 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU b c d e f 80 160 240 320 400 255.00 255.73 369.15 371.16 382.30 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU b c d e f 3 6 9 12 15 11.88 11.91 12.05 11.83 11.17 MIN: 9.25 / MAX: 18.16 MIN: 5.89 / MAX: 17.91 MIN: 7.14 / MAX: 26.84 MIN: 6.65 / MAX: 27.36 MIN: 6.05 / MAX: 28.92 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU b c d e f 150 300 450 600 750 503.59 502.05 662.91 675.24 715.44 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU b c d e f 20 40 60 80 100 71.08 71.15 81.20 80.25 79.99 MIN: 55.16 / MAX: 87.27 MIN: 53.9 / MAX: 90.11 MIN: 43.5 / MAX: 176.88 MIN: 63.46 / MAX: 183.13 MIN: 47.77 / MAX: 173.9 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU b c d e f 70 140 210 280 350 224.92 224.74 295.35 298.80 299.80 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU b c d e f 15 30 45 60 75 59.37 59.91 67.85 67.16 65.93 MIN: 52.65 / MAX: 80.39 MIN: 30.66 / MAX: 74.01 MIN: 31.98 / MAX: 153.29 MIN: 42.11 / MAX: 149.84 MIN: 31.99 / MAX: 150.16 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU b c d e f 80 160 240 320 400 269.31 266.96 353.52 357.15 363.79 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU b c d e f 15 30 45 60 75 56.41 56.71 66.49 66.06 66.42 MIN: 41.43 / MAX: 98.24 MIN: 40.93 / MAX: 99.23 MIN: 40.75 / MAX: 125.11 MIN: 32.08 / MAX: 125.5 MIN: 39.7 / MAX: 126.38 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU b c d e f 30 60 90 120 150 106.23 105.65 120.21 120.99 120.32 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU b c d e f 2 4 6 8 10 7.52 7.37 8.02 7.96 7.98 MIN: 5.11 / MAX: 16.43 MIN: 5.99 / MAX: 11.83 MIN: 5.89 / MAX: 15.12 MIN: 4.21 / MAX: 12.87 MIN: 5.16 / MAX: 18.32 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU b c d e f 600 1200 1800 2400 3000 2125.52 2170.03 2989.78 3010.60 3005.33 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU b c d e f 3 6 9 12 15 9.84 9.79 9.01 8.93 8.77 MIN: 7.45 / MAX: 19.24 MIN: 7.41 / MAX: 19.03 MIN: 5.26 / MAX: 19.9 MIN: 5.27 / MAX: 19.37 MIN: 5 / MAX: 22.42 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU b c d e f 200 400 600 800 1000 607.54 610.86 886.66 894.24 910.84 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU b c d e f 10 20 30 40 50 41.65 41.63 43.58 43.34 43.32 MIN: 23.28 / MAX: 44.79 MIN: 23.08 / MAX: 44.79 MIN: 31.3 / MAX: 72.83 MIN: 30.19 / MAX: 72.32 MIN: 31.46 / MAX: 73.03 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU b c d e f 120 240 360 480 600 383.74 383.97 550.30 553.27 553.49 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU b c d e f 0.126 0.252 0.378 0.504 0.63 0.49 0.49 0.56 0.55 0.55 MIN: 0.29 / MAX: 3.73 MIN: 0.29 / MAX: 3.11 MIN: 0.28 / MAX: 9.03 MIN: 0.28 / MAX: 13.45 MIN: 0.27 / MAX: 5.91 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU b c d e f 9K 18K 27K 36K 45K 32245.66 32241.73 41956.06 42386.78 42343.47 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU b c d e f 5 10 15 20 25 19.33 19.09 22.70 22.63 22.45 MIN: 8.18 / MAX: 34.75 MIN: 15 / MAX: 32.89 MIN: 11.49 / MAX: 48.33 MIN: 11.81 / MAX: 48.47 MIN: 11.6 / MAX: 50.7 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU b c d e f 80 160 240 320 400 309.56 313.32 351.84 352.96 355.77 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU b c d e f 3 6 9 12 15 11.74 11.76 12.93 12.93 12.95 MIN: 9.52 / MAX: 22.89 MIN: 8.2 / MAX: 22.88 MIN: 8.18 / MAX: 24.53 MIN: 8.29 / MAX: 22.42 MIN: 8.15 / MAX: 35.46 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU b c d e f 400 800 1200 1600 2000 1361.39 1359.67 1854.34 1855.23 1852.66 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU b c d e f 1.3253 2.6506 3.9759 5.3012 6.6265 5.89 5.81 5.10 5.08 4.78 MIN: 4.59 / MAX: 9.41 MIN: 2.72 / MAX: 9.48 MIN: 2.36 / MAX: 12.91 MIN: 3.18 / MAX: 17.17 MIN: 2.77 / MAX: 19.3 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU b c d e f 400 800 1200 1600 2000 1011.66 1028.14 1564.50 1569.98 1668.97 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU b c d e f 0.3173 0.6346 0.9519 1.2692 1.5865 1.27 1.28 1.41 1.40 1.39 MIN: 0.78 / MAX: 10.23 MIN: 0.75 / MAX: 10.14 MIN: 0.83 / MAX: 9.26 MIN: 0.75 / MAX: 8.02 MIN: 0.84 / MAX: 11.1 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU b c d e f 4K 8K 12K 16K 20K 12503.20 12456.32 16901.10 17036.01 17187.30 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 b c d e f 400K 800K 1200K 1600K 2000K 1134000 1131000 1862000 1862000 1888000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
John The Ripper Test: HMAC-SHA512 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 b c d e f 15M 30M 45M 60M 75M 45526000 46015000 71981000 65609000 71931000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 b c d e f 80K 160K 240K 320K 400K 231709.8 231842.8 353220.2 345908.2 354387.3 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 b c d e f 1200 2400 3600 4800 6000 3577.3 3578.6 5418.1 5294.3 5367.5 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time b c d e f 15M 30M 45M 60M 75M 46079885 41998826 71722427 64256226 71115165 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: PartialTweets b c d e f 2 4 6 8 10 5.39 6.51 7.37 7.37 7.36 1. (CXX) g++ options: -O3
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 7K 14K 21K 28K 35K 32575 32538 20919 20740 21146
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: DistinctUserID b c d e f 2 4 6 8 10 6.04 6.04 8.25 8.25 8.24 1. (CXX) g++ options: -O3
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 13K 26K 39K 52K 65K 62478 62329 43450 44009 43624
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU b c d e f 7K 14K 21K 28K 35K 31957 32007 20752 20746 20598
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU b c d e f 13K 26K 39K 52K 65K 61145 61161 43729 42719 42916
simdjson Throughput Test: TopTweet OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: TopTweet b c d e f 2 4 6 8 10 7.29 7.31 8.29 8.29 8.31 1. (CXX) g++ options: -O3
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.1 Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only b c d e f 0.1485 0.297 0.4455 0.594 0.7425 0.43 0.43 0.65 0.66 0.66
Selenium Benchmark: Jetstream 2 - Browser: Google Chrome OpenBenchmarking.org Score, More Is Better Selenium Benchmark: Jetstream 2 - Browser: Google Chrome b c d e f 90 180 270 360 450 382.67 388.31 419.01 421.02 419.30 1. chrome 118.0.5993.70
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig b c d e f 15 30 45 60 75 65.56 64.96 44.92 45.82 45.72
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D b c d e f 13 26 39 52 65 56.00 58.02 48.85 48.47 49.01 1. (CXX) g++ options: -O2 -lOpenCL
uvg266 Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Medium b c d e f 4 8 12 16 20 9.41 9.43 14.25 14.29 14.38
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte b c d e f 12 24 36 48 60 53.87 54.93 48.08 47.48 48.28 1. (CXX) g++ options: -O2 -lOpenCL
CloverLeaf Input: clover_bm OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm a b c d e f 13 26 39 52 65 58.98 55.82 56.10 42.55 42.83 42.77 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
SPECFEM3D Model: Homogeneous Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Homogeneous Halfspace b c d e f 14 28 42 56 70 62.09 63.90 37.65 38.24 37.99 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C b c d e f 11K 22K 33K 44K 55K 35845.04 34481.92 52753.11 52719.71 52390.75 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark b c d e f 6 12 18 24 30 24.04 23.75 26.59 26.44 26.63
VVenC Video Input: Bosphorus 4K - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Faster b c d e f 4 8 12 16 20 13.27 13.28 15.44 15.30 15.44 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Selenium Benchmark: Octane - Browser: Firefox OpenBenchmarking.org Geometric Mean, More Is Better Selenium Benchmark: Octane - Browser: Firefox b c d e f 13K 26K 39K 52K 65K 55056 55067 60754 60711 60209 1. firefox 118.0.2
SPECFEM3D Model: Mount St. Helens OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Mount St. Helens b c d e f 12 24 36 48 60 51.62 51.75 30.83 31.65 32.37 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Tomographic Model OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Tomographic Model b c d e f 12 24 36 48 60 49.14 51.56 30.33 30.65 30.47 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Exhaustive b c d e f 0.3566 0.7132 1.0698 1.4264 1.783 1.0807 1.0804 1.5573 1.5633 1.5848 1. (CXX) g++ options: -O3 -flto -pthread
Selenium Benchmark: Octane - Browser: Google Chrome OpenBenchmarking.org Geometric Mean, More Is Better Selenium Benchmark: Octane - Browser: Google Chrome b c d e f 30K 60K 90K 120K 150K 97769 105856 117684 102783 118632 1. chrome 118.0.5993.70
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms b c d e f 0.1942 0.3884 0.5826 0.7768 0.971 0.86311 0.86116 0.61454 0.61420 0.61266
RawTherapee Total Benchmark Time OpenBenchmarking.org Seconds, Fewer Is Better RawTherapee Total Benchmark Time b c d e f 9 18 27 36 45 40.09 40.30 34.45 35.19 35.08 1. RawTherapee, version 5.9, command line.
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 b c d e f 10 20 30 40 50 44.20 44.00 33.20 32.79 33.17 1. (CXX) g++ options: -O3 -fPIC -lm
Selenium Benchmark: Speedometer - Browser: Firefox OpenBenchmarking.org Runs Per Minute, More Is Better Selenium Benchmark: Speedometer - Browser: Firefox b c d e f 90 180 270 360 450 406 407 433 433 430 1. firefox 118.0.2
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium b c d e f 5 10 15 20 25 13.07 13.09 19.38 19.43 19.40 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: simple-H2O b c d e f 8 16 24 32 40 32.16 32.72 31.97 32.19 32.23 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QuantLib Configuration: Multi-Threaded OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Multi-Threaded a b c d e f 20K 40K 60K 80K 100K 55799.7 55963.1 57317.5 88060.0 89344.0 89756.1 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown b c d e f 7 14 21 28 35 19.03 19.15 29.50 29.57 29.19 MIN: 18.79 / MAX: 19.41 MIN: 18.85 / MAX: 19.5 MIN: 28.9 / MAX: 32.01 MIN: 29.06 / MAX: 32.27 MIN: 28.59 / MAX: 31.95
Selenium Benchmark: Speedometer - Browser: Google Chrome OpenBenchmarking.org Runs Per Minute, More Is Better Selenium Benchmark: Speedometer - Browser: Google Chrome b c d e f 110 220 330 440 550 486 471 498 502 504 1. chrome 118.0.5993.70
John The Ripper Test: WPA PSK OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK b c d e f 13K 26K 39K 52K 65K 36403 36403 60805 61351 61337 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
Cpuminer-Opt Algorithm: scrypt OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: scrypt a b c d e f 70 140 210 280 350 185.62 195.41 194.68 310.55 311.73 309.14 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
John The Ripper Test: bcrypt OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: bcrypt b c d e f 10K 20K 30K 40K 50K 31152 31110 47260 47942 47587 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish b c d e f 10K 20K 30K 40K 50K 26232 31050 47385 47590 47203 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 512 b c d e f 70M 140M 210M 280M 350M 228820000 228970000 329800000 331120000 329870000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 512 b c d e f 40M 80M 120M 160M 200M 139440000 140020000 168030000 171400000 169900000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 512 b c d e f 50M 100M 150M 200M 250M 204060000 204100000 223360000 239170000 239250000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Blake-2 S a b c d e f 30K 60K 90K 120K 150K 79320 85030 86620 141280 140960 138510 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Garlicoin a b c d e f 600 1200 1800 2400 3000 1893.63 2040.33 2040.66 2726.85 2605.60 2559.46 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Deepcoin a b c d e f 2K 4K 6K 8K 10K 5436.88 5549.86 5604.27 9210.49 9300.91 9248.07 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Magi a b c d e f 160 320 480 640 800 417.24 442.28 450.42 714.42 710.53 722.00 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Myriad-Groestl a b c d e f 3K 6K 9K 12K 15K 8564.28 8774.33 8765.28 14360.00 14310.00 14200.00 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Ringcoin a b c d e f 900 1800 2700 3600 4500 2345.04 2508.93 2494.87 4039.03 4048.08 3946.27 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Quad SHA-256, Pyrite a b c d e f 15K 30K 45K 60K 75K 41360 43650 43630 70960 70910 71070 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Triple SHA-256, Onecoin a b c d e f 20K 40K 60K 80K 100K 61070 62870 62850 102200 103680 102900 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: LBC, LBRY Credits a b c d e f 4K 8K 12K 16K 20K 9035.20 9845.56 9996.74 16610.00 16950.00 16590.00 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Skeincoin a b c d e f 8K 16K 24K 32K 40K 21370 22770 22780 38520 38020 38040 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Selenium Benchmark: PSPDFKit WASM - Browser: Firefox OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Firefox b c d e f 500 1000 1500 2000 2500 2108 2096 2059 2051 2055 1. firefox 118.0.2
Selenium Benchmark: PSPDFKit WASM - Browser: Google Chrome OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Google Chrome b c d e f 500 1000 1500 2000 2500 2202 2170 2139 2140 2139 1. chrome 118.0.5993.70
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon b c d e f 8 16 24 32 40 23.10 23.20 35.45 35.33 35.27 MIN: 22.96 / MAX: 23.41 MIN: 23.04 / MAX: 23.45 MIN: 35.02 / MAX: 37.92 MIN: 34.92 / MAX: 37.85 MIN: 34.8 / MAX: 37.69
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless, Highest Compression b c d e f 0.2205 0.441 0.6615 0.882 1.1025 0.87 0.88 0.97 0.97 0.98 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
PyPerformance Benchmark: regex_compile OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: regex_compile b c d e f 16 32 48 64 80 71.6 71.4 63.6 67.0 64.8
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile b c d e f 7 14 21 28 35 29.83 29.20 21.80 21.85 21.99
PyPerformance Benchmark: crypto_pyaes OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: crypto_pyaes b c d e f 9 18 27 36 45 37.5 37.4 33.0 33.6 33.0
PyPerformance Benchmark: json_loads OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: json_loads b c d e f 3 6 9 12 15 10.20 10.10 9.18 9.23 8.98
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating b c d e f 30K 60K 90K 120K 150K 92168 92137 142133 155668 138934 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating b c d e f 40K 80K 120K 160K 200K 127009 127943 183859 171918 185607 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Selenium Benchmark: Kraken - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Firefox b c d e f 120 240 360 480 600 520.2 520.4 487.8 552.8 490.7 1. firefox 118.0.2
ASTC Encoder Preset: Thorough OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Thorough b c d e f 4 8 12 16 20 10.69 10.69 16.96 16.95 16.98 1. (CXX) g++ options: -O3 -flto -pthread
QMCPACK Input: H4_ae OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: H4_ae b c d e f 6 12 18 24 30 22.73 25.09 18.59 18.30 17.70 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K b c d e f 8 16 24 32 40 25.70 26.74 32.51 33.23 33.47 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Natron Input: Spaceship OpenBenchmarking.org FPS, More Is Better Natron 2.4.3 Input: Spaceship b c d e f 2 4 6 8 10 4.6 4.4 6.4 6.3 6.4
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C b c d e f 5K 10K 15K 20K 25K 18953.53 18737.09 24740.05 25591.23 25290.90 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B b c d e f 5K 10K 15K 20K 25K 16564.09 16408.43 22004.23 21129.21 21905.42 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark b c d e f 16 32 48 64 80 61.85 63.14 71.16 69.03 70.56 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C b c d e f 2K 4K 6K 8K 10K 8025.90 7422.74 10390.33 8593.82 7707.48 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
uvg266 Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Very Fast b c d e f 9 18 27 36 45 27.74 27.74 38.59 38.46 39.46
uvg266 Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Super Fast b c d e f 10 20 30 40 50 29.30 29.34 41.03 42.91 42.68
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 a b c d e f 4 8 12 16 20 15.11 15.20 15.18 14.79 14.79 15.03 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 a b c d e f 4 8 12 16 20 15.08 15.22 15.22 15.00 14.90 14.77 1. (CXX) g++ options: -O3
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast b c d e f 10 20 30 40 50 32.39 32.33 42.25 42.88 43.18 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Selenium Benchmark: Kraken - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Google Chrome b c d e f 80 160 240 320 400 349.0 343.2 319.4 318.5 317.3 1. chrome 118.0.5993.70
QuantLib Configuration: Single-Threaded OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Single-Threaded a b c d e f 1200 2400 3600 4800 6000 4818.8 5016.1 5014.7 5665.0 5672.2 5666.3 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU b c d e f 0.6734 1.3468 2.0202 2.6936 3.367 2.98963 2.99292 2.01150 2.01217 2.03456 MIN: 2.87 MIN: 2.87 MIN: 1.64 MIN: 1.64 MIN: 1.64 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 a b c d e f 5 10 15 20 25 18.20 18.32 18.28 15.81 15.85 15.53 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 a b c d e f 5 10 15 20 25 18.24 18.29 18.25 15.98 15.95 15.88 1. (CXX) g++ options: -O3
uvg266 Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast b c d e f 11 22 33 44 55 35.30 35.09 50.11 50.81 50.69
Selenium Benchmark: WASM collisionDetection - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Firefox b c d e f 50 100 150 200 250 242.3 242.5 216.0 214.3 214.5 1. firefox 118.0.2
Y-Cruncher Pi Digits To Calculate: 500M OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.7.10.9513 Pi Digits To Calculate: 500M b c d e f 3 6 9 12 15 13.44 13.52 10.78 10.86 11.11
Kvazaar Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast b c d e f 12 24 36 48 60 40.71 40.69 54.02 54.71 53.99 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite b c d e f 400K 800K 1200K 1600K 2000K 1491682 1479761 1692102 1685554 1686956
GIMP Test: resize OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: resize b c d e f 3 6 9 12 15 12.63 12.39 12.52 12.11 12.70
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU b c d e f 4 8 12 16 20 12.46 12.54 17.96 17.74 14.89 MIN: 12.32 / MAX: 12.57 MIN: 12.43 / MAX: 12.6 MIN: 17.28 / MAX: 18.69 MIN: 16.79 / MAX: 18.55 MIN: 12.73 / MAX: 16.75
GIMP Test: unsharp-mask OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: unsharp-mask b c d e f 3 6 9 12 15 11.12 11.35 11.04 11.12 11.17
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster b c d e f 4 8 12 16 20 15.894 15.834 7.224 7.176 7.226 1. (CXX) g++ options: -O2 -lOpenCL
Selenium Benchmark: WASM collisionDetection - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Google Chrome b c d e f 50 100 150 200 250 232.97 193.39 211.66 204.81 171.17 1. chrome 118.0.5993.70
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless b c d e f 0.5513 1.1026 1.6539 2.2052 2.7565 2.20 2.19 2.45 2.44 2.45 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
GIMP Test: auto-levels OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: auto-levels b c d e f 3 6 9 12 15 9.615 9.748 9.459 9.526 9.526
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K b c d e f 20 40 60 80 100 58.46 58.18 75.36 75.65 75.06 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
PyBench Total For Average Test Times OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times b c d e f 100 200 300 400 500 427 446 377 378 376
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast b c d e f 20 40 60 80 100 54.14 54.29 75.87 73.99 72.63 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU b c d e f 0.9892 1.9784 2.9676 3.9568 4.946 4.36881 4.39625 4.30701 4.30610 4.31754 MIN: 4.33 MIN: 4.36 MIN: 4.27 MIN: 4.28 MIN: 4.27 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
GIMP Test: rotate OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: rotate b c d e f 3 6 9 12 15 9.199 9.447 8.806 8.854 8.828
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 a b c d e f 7 14 21 28 35 27.22 27.73 27.55 28.44 28.66 27.78 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 a b c d e f 7 14 21 28 35 27.16 27.53 27.62 28.48 28.55 28.25 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 a b c d e f 7 14 21 28 35 27.99 28.42 28.21 28.91 29.11 28.45 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 a b c d e f 7 14 21 28 35 27.99 28.34 28.40 28.94 28.91 28.53 1. (CXX) g++ options: -O3
Selenium Benchmark: WASM imageConvolute - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Firefox b c d e f 4 8 12 16 20 16.3 16.1 14.2 14.3 14.3 1. firefox 118.0.2
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 a b c d e f 7 14 21 28 35 31.73 32.01 31.90 28.66 28.77 28.76 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 a b c d e f 7 14 21 28 35 31.61 31.91 31.91 28.81 28.77 28.76 1. (CXX) g++ options: -O3
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver b c d e f 3 6 9 12 15 10.118 10.121 5.935 6.101 6.490 1. (CXX) g++ options: -O2 -lOpenCL
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 a b c d e f 8 16 24 32 40 35.28 35.38 35.18 31.06 30.96 30.93 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 a b c d e f 8 16 24 32 40 35.22 35.29 35.43 30.84 30.99 31.05 1. (CXX) g++ options: -O3
SVT-HEVC Tuning: 7 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 4K b c d e f 20 40 60 80 100 71.18 71.18 98.86 99.72 98.68 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
ASTC Encoder Preset: Medium OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Medium b c d e f 30 60 90 120 150 81.20 81.26 130.36 130.22 130.42 1. (CXX) g++ options: -O3 -flto -pthread
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C b c d e f 6K 12K 18K 24K 30K 24092.42 26540.76 24498.20 25162.58 24381.01 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 b c d e f 2 4 6 8 10 8.637 7.127 4.774 4.833 4.785 1. (CXX) g++ options: -O3 -fopenmp
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 4K b c d e f 20 40 60 80 100 93.49 96.00 107.40 107.34 108.94 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K b c d e f 30 60 90 120 150 88.07 88.36 122.15 120.84 120.89 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Selenium Benchmark: WASM imageConvolute - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Google Chrome b c d e f 4 8 12 16 20 15.95 15.95 14.10 16.93 14.04 1. chrome 118.0.5993.70
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless b c d e f 2 4 6 8 10 6.908 6.983 5.097 5.063 5.066 1. (CXX) g++ options: -O3 -fPIC -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K b c d e f 30 60 90 120 150 111.84 112.34 137.97 150.27 152.13 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 a b c d e f 12 24 36 48 60 49.45 50.63 50.85 54.58 53.88 54.26 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 a b c d e f 12 24 36 48 60 49.76 50.81 50.91 54.95 54.06 54.66 1. (CXX) g++ options: -O3
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Highest Compression b c d e f 1.206 2.412 3.618 4.824 6.03 4.74 4.74 5.36 5.28 5.36 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 a b c d e f 13 26 39 52 65 58.42 59.20 59.17 55.88 55.58 55.37 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 a b c d e f 13 26 39 52 65 58.65 59.36 59.22 55.67 55.62 55.78 1. (CXX) g++ options: -O3
SVT-HEVC Tuning: 10 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 4K b c d e f 40 80 120 160 200 139.99 140.06 185.07 178.47 170.99 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 4K b c d e f 40 80 120 160 200 149.16 147.09 193.28 190.91 191.65 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 4K b c d e f 40 80 120 160 200 146.57 148.55 194.35 190.16 194.23 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 b c d e f 1.0307 2.0614 3.0921 4.1228 5.1535 4.515 4.581 3.852 3.472 3.854 1. (CXX) g++ options: -O3 -fPIC -lm
Darktable Test: Boat - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Boat - Acceleration: CPU-only b c d e f 0.9875 1.975 2.9625 3.95 4.9375 4.194 4.389 2.697 2.705 2.771
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C b c d e f 700 1400 2100 2800 3500 1773.71 1775.67 3342.51 2893.39 3340.25 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
Darktable Test: Masskrug - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Masskrug - Acceleration: CPU-only b c d e f 0.5452 1.0904 1.6356 2.1808 2.726 2.364 2.423 1.694 1.671 1.667
Darktable Test: Server Room - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Server Room - Acceleration: CPU-only b c d e f 0.4124 0.8248 1.2372 1.6496 2.062 1.833 1.825 1.415 1.434 1.405
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 a b c d e f 4 8 12 16 20 15.76 15.90 15.75 14.12 14.04 14.02 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 a b c d e f 4 8 12 16 20 15.83 16.05 15.94 14.09 14.05 14.02 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 a b c d e f 4 8 12 16 20 16.15 16.47 16.36 14.17 14.04 14.04 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 a b c d e f 4 8 12 16 20 16.27 16.48 16.43 14.13 14.08 14.14 1. (CXX) g++ options: -O3
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100 b c d e f 4 8 12 16 20 15.57 15.53 17.54 17.54 17.53 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 a b c d e f 7 14 21 28 35 28.59 29.01 28.81 25.82 25.57 25.45 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 a b c d e f 7 14 21 28 35 28.62 28.99 28.78 25.80 25.59 25.47 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 a b c d e f 7 14 21 28 35 29.68 30.04 29.88 27.64 27.53 27.34 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 a b c d e f 7 14 21 28 35 29.65 30.34 29.81 27.83 27.55 27.49 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 a b c d e f 7 14 21 28 35 29.39 29.52 29.87 27.63 27.40 27.34 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 a b c d e f 7 14 21 28 35 29.78 29.91 29.86 28.01 27.54 27.59 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 a b c d e f 7 14 21 28 35 31.30 31.82 31.55 27.78 27.57 27.55 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 a b c d e f 7 14 21 28 35 31.27 31.83 31.64 27.85 27.58 27.65 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 a b c d e f 13 26 39 52 65 55.50 55.94 55.41 53.75 53.46 53.41 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 a b c d e f 12 24 36 48 60 55.10 55.53 55.60 53.64 53.71 53.54 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 a b c d e f 13 26 39 52 65 57.98 58.15 58.47 58.98 58.05 58.59 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 a b c d e f 13 26 39 52 65 57.60 58.43 57.99 58.99 58.42 58.25 1. (CXX) g++ options: -O3
Darktable Test: Server Rack - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Server Rack - Acceleration: CPU-only b c d e f 0.0713 0.1426 0.2139 0.2852 0.3565 0.266 0.317 0.209 0.187 0.217
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 a b c d e f 4 8 12 16 20 14.28 14.48 14.38 16.94 16.84 16.86 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 a b c d e f 4 8 12 16 20 14.66 14.82 14.44 16.75 16.67 17.03 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 a b c d e f 5 10 15 20 25 18.11 18.43 18.55 19.23 18.79 19.19 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 a b c d e f 5 10 15 20 25 18.11 18.53 18.32 19.24 18.92 18.88 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 a b c d e f 13 26 39 52 65 31.81 32.69 22.17 52.54 56.67 53.02 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 a b c d e f 13 26 39 52 65 31.23 31.64 32.39 60.02 40.58 59.24 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 a b c d e f 12 24 36 48 60 31.60 32.62 32.41 53.69 52.90 54.80 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 a b c d e f 14 28 42 56 70 31.26 31.93 31.74 59.74 59.02 60.55 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 a b c d e f 20 40 60 80 100 51.88 53.17 52.25 95.58 97.44 87.45 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 a b c d e f 14 28 42 56 70 43.71 44.06 44.62 63.22 64.05 62.90 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 a b c d e f 15 30 45 60 75 44.29 44.88 44.60 64.20 64.99 63.75 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 a b c d e f 20 40 60 80 100 50.93 52.50 51.94 99.75 99.00 96.46 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 a b c d e f 30 60 90 120 150 66.67 71.01 69.36 117.85 114.12 115.05 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 a b c d e f 30 60 90 120 150 68.88 70.42 71.50 115.20 114.34 115.09 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 a b c d e f 40 80 120 160 200 127.31 134.25 131.51 186.55 183.83 187.72 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 a b c d e f 40 80 120 160 200 131.94 104.03 131.25 183.45 184.51 179.45 1. (CXX) g++ options: -O3
Phoronix Test Suite v10.8.5