new tests Tests for a future article. Intel Core i9-14900K testing with a ASUS PRIME Z790-P WIFI (1402 BIOS) and AMD Radeon RX 7900 XTX 24GB on Ubuntu 23.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2310314-PTS-NEWTESTS84&sor&grr .
new tests Processor Motherboard Chipset Memory Disk Graphics Audio Monitor OS Kernel Desktop Display Server OpenGL Compiler File-System Screen Resolution a b c d e f Intel Core i5-13600K @ 5.10GHz (14 Cores / 20 Threads) ASUS PRIME Z790-P WIFI (1402 BIOS) Intel Device 7a27 32GB 1000GB Western Digital WDS100T1X0E-00AFY0 AMD Radeon RX 7900 XTX 24GB (2304/1249MHz) Realtek ALC897 ASUS VP28U Ubuntu 23.10 6.5.0-9-generic (x86_64) GNOME Shell 45.0 X Server + Wayland 4.6 Mesa 23.2.1-1ubuntu3 (LLVM 15.0.7 DRM 3.54) GCC 13.2.0 ext4 3840x2160 Intel Core i5-14600K @ 5.30GHz (14 Cores / 20 Threads) Intel Core i9-14900K @ 5.70GHz (24 Cores / 32 Threads) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x11d - Thermald 2.5.4 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected Python Details - b, c, d, e, f: Python 3.11.6
new tests openfoam: drivaerFastback, Medium Mesh Size - Execution Time openfoam: drivaerFastback, Medium Mesh Size - Mesh Time cloverleaf: clover_bm16 build-linux-kernel: allmodconfig openvkl: vklBenchmarkCPU ISPC build-gem5: Time To Compile build-llvm: Ninja openradioss: INIVOL and Fluid Structure Interaction Drop Container minibude: OpenMP - BM2 minibude: OpenMP - BM2 build-nodejs: Time To Compile qmcpack: FeCO6_b3lyp_gms qmcpack: O_ae_pyscf_UHF duckdb: TPC-H Parquet tensorflow: CPU - 64 - ResNet-50 qmcpack: Li2_STO_ae ospray-studio: 3 - 4K - 32 - Path Tracer - CPU duckdb: IMDB openfoam: drivaerFastback, Small Mesh Size - Execution Time openfoam: drivaerFastback, Small Mesh Size - Mesh Time ospray-studio: 2 - 4K - 32 - Path Tracer - CPU ospray-studio: 1 - 4K - 32 - Path Tracer - CPU hpcg: 104 104 104 - 60 openradioss: Bird Strike on Windshield build-godot: Time To Compile openssl: AES-128-GCM openssl: SHA256 openssl: ChaCha20-Poly1305 openssl: AES-256-GCM openssl: ChaCha20 openssl: SHA512 qmcpack: LiH_ae_MSD ospray: particle_volume/scivis/real_time ospray: particle_volume/pathtracer/real_time pgbench: 100 - 1000 - Read Only pgbench: 100 - 1000 - Read Only - Average Latency pgbench: 100 - 1000 - Read Write - Average Latency pgbench: 100 - 1000 - Read Only - Average Latency pgbench: 100 - 1000 - Read Write - Average Latency pgbench: 100 - 1000 - Read Write easywave: e2Asean Grid + BengkuluSept2007 Source - 1200 openradioss: Rubber O-Ring Seal Installation openradioss: Bumper Beam cloverleaf: clover_bm64_short ospray-studio: 3 - 4K - 16 - Path Tracer - CPU asmfish: 1024 Hash Memory, 26 Depth xmrig: Monero - 1M specfem3d: Layered Halfspace ospray: particle_volume/ao/real_time npb: SP.C ospray-studio: 2 - 4K - 16 - Path Tracer - CPU rodinia: OpenMP LavaMD ospray-studio: 1 - 4K - 16 - Path Tracer - CPU specfem3d: Water-layered Halfspace selenium: Jetstream 2 - Firefox nginx: 1000 nginx: 500 blender: Fishy Cat - CPU-Only gromacs: MPI CPU - water_GMX50_bare simdjson: Kostya ospray-studio: 1 - 4K - 1 - Path Tracer - CPU ospray-studio: 2 - 4K - 1 - Path Tracer - CPU vvenc: Bosphorus 4K - Fast onednn: Recurrent Neural Network Training - bf16bf16bf16 - CPU onednn: Recurrent Neural Network Training - f32 - CPU openradioss: Cell Phone Drop Test onednn: Recurrent Neural Network Inference - f32 - CPU onednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPU simdjson: LargeRand pyperformance: python_startup xmrig: Wownero - 1M ospray-studio: 3 - 4K - 1 - Path Tracer - CPU ospray: gravity_spheres_volume/dim_512/scivis/real_time v-ray: CPU ospray: gravity_spheres_volume/dim_512/ao/real_time ospray-studio: 3 - 1080p - 1 - Path Tracer - CPU minibude: OpenMP - BM1 minibude: OpenMP - BM1 npb: BT.C ospray-studio: 2 - 1080p - 1 - Path Tracer - CPU ospray-studio: 1 - 1080p - 1 - Path Tracer - CPU memcached: 1:100 memcached: 1:10 ospray-studio: 3 - 1080p - 16 - Path Tracer - CPU ospray-studio: 3 - 1080p - 32 - Path Tracer - CPU openvino: Face Detection FP16 - CPU openvino: Face Detection FP16 - CPU ospray: gravity_spheres_volume/dim_512/pathtracer/real_time blender: BMW27 - CPU-Only luxcorerender: Orange Juice - CPU luxcorerender: Danish Mood - CPU ngspice: C7552 tensorflow: CPU - 16 - ResNet-50 openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU luxcorerender: LuxCore Benchmark - CPU ngspice: C2670 luxcorerender: DLSC - CPU indigobench: CPU - Bedroom indigobench: CPU - Supercar openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP32 - CPU openvino: Person Detection FP32 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Handwritten English Recognition FP16 - CPU openvino: Handwritten English Recognition FP16 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU john-the-ripper: MD5 john-the-ripper: HMAC-SHA512 openssl: RSA4096 openssl: RSA4096 stockfish: Total Time simdjson: PartialTweets ospray-studio: 2 - 1080p - 16 - Path Tracer - CPU simdjson: DistinctUserID ospray-studio: 2 - 1080p - 32 - Path Tracer - CPU ospray-studio: 1 - 1080p - 16 - Path Tracer - CPU ospray-studio: 1 - 1080p - 32 - Path Tracer - CPU simdjson: TopTweet oidn: RT.ldr_alb_nrm.3840x2160 - CPU-Only selenium: Jetstream 2 - Google Chrome build-linux-kernel: defconfig rodinia: OpenMP HotSpot3D uvg266: Bosphorus 4K - Medium rodinia: OpenMP Leukocyte cloverleaf: clover_bm specfem3d: Homogeneous Halfspace npb: LU.C node-web-tooling: vvenc: Bosphorus 4K - Faster selenium: Octane - Firefox specfem3d: Mount St. Helens specfem3d: Tomographic Model astcenc: Exhaustive selenium: Octane - Google Chrome namd: ATPase Simulation - 327,506 Atoms rawtherapee: Total Benchmark Time avifenc: 2 selenium: Speedometer - Firefox kvazaar: Bosphorus 4K - Medium qmcpack: simple-H2O quantlib: Multi-Threaded embree: Pathtracer ISPC - Crown selenium: Speedometer - Google Chrome john-the-ripper: WPA PSK cpuminer-opt: scrypt john-the-ripper: bcrypt john-the-ripper: Blowfish liquid-dsp: 32 - 256 - 512 liquid-dsp: 8 - 256 - 512 liquid-dsp: 16 - 256 - 512 cpuminer-opt: Blake-2 S cpuminer-opt: Garlicoin cpuminer-opt: Deepcoin cpuminer-opt: Magi cpuminer-opt: Myriad-Groestl cpuminer-opt: Ringcoin cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Triple SHA-256, Onecoin cpuminer-opt: LBC, LBRY Credits cpuminer-opt: Skeincoin selenium: PSPDFKit WASM - Firefox selenium: PSPDFKit WASM - Google Chrome embree: Pathtracer ISPC - Asian Dragon webp: Quality 100, Lossless, Highest Compression pyperformance: regex_compile build-mesa: Time To Compile pyperformance: crypto_pyaes pyperformance: json_loads compress-7zip: Decompression Rating compress-7zip: Compression Rating selenium: Kraken - Firefox astcenc: Thorough qmcpack: H4_ae x265: Bosphorus 4K natron: Spaceship npb: FT.C npb: SP.B libraw: Post-Processing Benchmark npb: CG.C uvg266: Bosphorus 4K - Very Fast uvg266: Bosphorus 4K - Super Fast heffte: c2c - Stock - double-long - 512 heffte: c2c - Stock - double - 512 kvazaar: Bosphorus 4K - Very Fast selenium: Kraken - Google Chrome quantlib: Single-Threaded onednn: IP Shapes 1D - f32 - CPU heffte: c2c - FFTW - double - 512 heffte: c2c - FFTW - double-long - 512 uvg266: Bosphorus 4K - Ultra Fast selenium: WASM collisionDetection - Firefox y-cruncher: 500M kvazaar: Bosphorus 4K - Super Fast phpbench: PHP Benchmark Suite gimp: resize luxcorerender: Rainbow Colors and Prism - CPU gimp: unsharp-mask rodinia: OpenMP Streamcluster selenium: WASM collisionDetection - Google Chrome webp: Quality 100, Lossless gimp: auto-levels svt-av1: Preset 8 - Bosphorus 4K pybench: Total For Average Test Times kvazaar: Bosphorus 4K - Ultra Fast onednn: IP Shapes 3D - f32 - CPU gimp: rotate heffte: r2c - Stock - double-long - 512 heffte: r2c - Stock - double - 512 heffte: c2c - Stock - float-long - 512 heffte: c2c - Stock - float - 512 selenium: WASM imageConvolute - Firefox heffte: r2c - FFTW - double-long - 512 heffte: r2c - FFTW - double - 512 rodinia: OpenMP CFD Solver heffte: c2c - FFTW - float-long - 512 heffte: c2c - FFTW - float - 512 svt-hevc: 7 - Bosphorus 4K astcenc: Medium npb: MG.C easywave: e2Asean Grid + BengkuluSept2007 Source - 240 svt-vp9: VMAF Optimized - Bosphorus 4K svt-vp9: Visual Quality Optimized - Bosphorus 4K selenium: WASM imageConvolute - Google Chrome avifenc: 6, Lossless svt-vp9: PSNR/SSIM Optimized - Bosphorus 4K heffte: r2c - Stock - float - 512 heffte: r2c - Stock - float-long - 512 webp: Quality 100, Highest Compression heffte: r2c - FFTW - float-long - 512 heffte: r2c - FFTW - float - 512 svt-hevc: 10 - Bosphorus 4K svt-av1: Preset 13 - Bosphorus 4K svt-av1: Preset 12 - Bosphorus 4K avifenc: 6 darktable: Boat - CPU-only npb: EP.C darktable: Masskrug - CPU-only darktable: Server Room - CPU-only heffte: c2c - Stock - double - 256 heffte: c2c - Stock - double-long - 256 heffte: c2c - FFTW - double-long - 256 heffte: c2c - FFTW - double - 256 webp: Quality 100 heffte: r2c - FFTW - double - 256 heffte: r2c - FFTW - double-long - 256 heffte: r2c - Stock - double-long - 256 heffte: c2c - Stock - float - 256 heffte: r2c - Stock - double - 256 heffte: c2c - Stock - float-long - 256 heffte: c2c - FFTW - float-long - 256 heffte: c2c - FFTW - float - 256 heffte: r2c - FFTW - float-long - 256 heffte: r2c - FFTW - float - 256 heffte: r2c - Stock - float-long - 256 heffte: r2c - Stock - float - 256 darktable: Server Rack - CPU-only heffte: c2c - Stock - double-long - 128 heffte: c2c - Stock - double - 128 heffte: c2c - FFTW - double - 128 heffte: c2c - FFTW - double-long - 128 heffte: r2c - Stock - double-long - 128 heffte: c2c - Stock - float-long - 128 heffte: r2c - Stock - double - 128 heffte: c2c - Stock - float - 128 heffte: c2c - FFTW - float - 128 heffte: r2c - FFTW - double-long - 128 heffte: r2c - FFTW - double - 128 heffte: c2c - FFTW - float-long - 128 heffte: r2c - Stock - float-long - 128 heffte: r2c - Stock - float - 128 heffte: r2c - FFTW - float-long - 128 heffte: r2c - FFTW - float - 128 onednn: IP Shapes 3D - bf16bf16bf16 - CPU a b c d e f 1094.128868103 127.74 58.98 55799.7 185.62 79320 1893.63 5436.88 417.24 8564.28 2345.04 41360 61070 9035.2 21370 15.1102 15.0777 4818.8 18.1985 18.239 27.2229 27.1619 27.985 27.9897 31.7325 31.6129 35.2816 35.2152 49.4547 49.7591 58.4188 58.6489 15.7616 15.8331 16.1534 16.2749 28.5862 28.6186 29.6848 29.6502 29.3894 29.778 31.3044 31.2736 55.5018 55.1029 57.9832 57.5957 14.2837 14.658 18.1124 18.1147 31.8111 31.2292 31.5989 31.2635 51.881 43.71 44.2936 50.9336 66.6684 68.8833 127.305 131.935 2166.8213 230.3022 1093.01 778.31 387 439.462 435.68 12.906 322.654 357.104 266.24 262.57 89.754 24.14 285.85 277357 86.571 224.43546 33.876155 239054 234540 8.28855 228.35 230.157 106224257980 21234053100 31517307700 91408403390 43683166890 6922574030 169.05 7.32159 169.646 947197 1.054 42553 949091 23.5 41736 134.185 165.27 151.75 127.94 139891 47742886 7676.5 135.10279615 7.37745 12080 120716 131.529 118595 126.074892513 229.492 127899.06 135692.91 109.94 4.8 7176 7338 6.539 3320.34 3319.64 96.02 1705.51 1705.1 1.77 4.85 11202.1 8523 3.60869 18743 3.6958 2156 8.679 216.969 34501.92 1848 1830 4058394.64 4464270.25 37384 72426 1730.89 3.45 5.03922 78.97 4.88 2.22 65.896 23.91 469.85 12.75 2.54 66.392 2.88 2.689 7.592 134.23 44.66 154.78 38.73 154.5 38.75 23.47 255 11.88 503.59 71.08 224.92 59.37 269.31 56.41 106.23 7.52 2125.52 9.84 607.54 41.65 383.74 0.49 32245.66 19.33 309.56 11.74 1361.39 5.89 1011.66 1.27 12503.2 1134000 45526000 231709.8 3577.3 46079885 5.39 32575 6.04 62478 31957 61145 7.29 0.43 382.668 65.562 55.996 9.41 53.871 55.82 62.094293176 35845.04 24.04 13.272 55056 51.618068965 49.142967323 1.0807 97769 0.86311 40.086 44.202 406 13.07 32.159 55963.1 19.033 486 36403 195.41 31152 26232 228820000 139440000 204060000 85030 2040.33 5549.86 442.28 8774.33 2508.93 43650 62870 9845.56 22770 2108 2202 23.1049 0.87 71.6 29.825 37.5 10.2 92168 127009 520.2 10.6857 22.73 25.7 4.6 18953.53 16564.09 61.85 8025.9 27.74 29.3 15.2009 15.2166 32.39 349 5016.1 2.98963 18.3225 18.2924 35.3 242.3 13.443 40.71 1491682 12.625 12.46 11.12 15.894 232.97 2.20 9.615 58.458 427 54.14 4.36881 9.199 27.7294 27.527 28.418 28.337 16.3 32.0107 31.9085 10.118 35.3807 35.2893 71.18 81.1968 24092.42 8.637 93.49 88.07 15.95 6.908 111.84 50.6268 50.8129 4.74 59.1962 59.3627 139.99 149.161 146.571 4.515 4.194 1773.71 2.364 1.833 15.9022 16.0466 16.4718 16.4798 15.57 29.014 28.9898 30.042 30.3369 29.522 29.9091 31.8188 31.8292 55.9418 55.526 58.1474 58.425 0.266 14.4836 14.8193 18.4321 18.5277 32.6945 31.6351 32.6174 31.9321 53.1674 44.0648 44.8791 52.499 71.0106 70.4153 134.246 104.025 2166.8275 234.39558 1091.47 767.219 386 435.624 438.56 12.907 322.668 355.827 266.65 264.99 90.453 24.1 284.18 277524 86.555 223.87156 32.902566 238496 234968 8.26023 234.62 234.319 106192145890 21897801590 30264414210 94488932900 43658009260 7042483840 168.7 7.32078 169.835 833675 0.91 42185 1098384 23.705 41656 134.633 165.49 150.19 128.11 139809 47676136 7660.4 140.585720491 7.3669 12101.42 120664 129.977 118639 126.108597881 227.935 127678.15 132783.22 110.19 4.81 7195 7320 6.55 3321.37 3319.3 95.46 1705.89 1707.37 1.77 4.84 11160.8 8512 3.65807 18885 3.79562 2150 8.705 217.632 34573.71 1840 1816 4149899.46 4305589.62 37367 72092 1733.4 3.45 5.04566 79.19 4.91 2.27 63.909 23.93 470 12.75 2.55 66.076 2.87 2.6 7.778 135 44.39 153.73 38.98 153.66 39 23.41 255.73 11.91 502.05 71.15 224.74 59.91 266.96 56.71 105.65 7.37 2170.03 9.79 610.86 41.63 383.97 0.49 32241.73 19.09 313.32 11.76 1359.67 5.81 1028.14 1.28 12456.32 1131000 46015000 231842.8 3578.6 41998826 6.51 32538 6.04 62329 32007 61161 7.31 0.43 388.309 64.959 58.024 9.43 54.926 56.10 63.899206362 34481.92 23.75 13.276 55067 51.749754354 51.562965731 1.0804 105856 0.86116 40.301 43.999 407 13.09 32.718 57317.5 19.1475 471 36403 194.68 31110 31050 228970000 140020000 204100000 86620 2040.66 5604.27 450.42 8765.28 2494.87 43630 62850 9996.74 22780 2096 2170 23.1978 0.88 71.4 29.202 37.4 10.1 92137 127943 520.4 10.6929 25.09 26.74 4.4 18737.09 16408.43 63.14 7422.74 27.74 29.34 15.1774 15.2168 32.33 343.2 5014.7 2.99292 18.275 18.2462 35.09 242.5 13.518 40.69 1479761 12.394 12.54 11.352 15.834 193.39 2.19 9.748 58.184 446 54.29 4.39625 9.447 27.5481 27.6161 28.2076 28.4031 16.1 31.8965 31.9077 10.121 35.1791 35.4345 71.18 81.2623 26540.76 7.127 96 88.36 15.95 6.983 112.34 50.8481 50.9069 4.74 59.1701 59.223 140.06 147.091 148.546 4.581 4.389 1775.67 2.423 1.825 15.7502 15.942 16.3572 16.4311 15.53 28.8122 28.7765 29.8789 29.8126 29.8734 29.8559 31.5524 31.6357 55.4126 55.6015 58.467 57.9937 0.317 14.3837 14.4448 18.5488 18.3191 22.1694 32.3869 32.4099 31.74 52.2532 44.6157 44.6026 51.9406 69.3558 71.5008 131.511 131.251 1816.8935 203.49517 1058.26 519.599 547 221.237 308.209 285.26 20.411 510.266 251.154 265.33 258.4 83.672 34.8 208.7 196772 71.369 162.53309 26.294883 169828 167519 9.91323 166.94 163.491 168593428210 35393577110 47030287560 145829960090 68204421390 10970736110 168.69 10.1188 207.158 1299800 0.734 54797 1361760 18.249 55901 129.701 108.37 114.72 124.58 99301 71318180 9559.6 91.860925305 10.1686 15263.55 85258 86.77 83823 84.113422384 251.968 184950.78 196302.45 75.4 1.143 5.3 5114 5095 7.829 2150.83 2113.83 66.07 1062.21 1145.28 1.99 4.65 16204 6008 5.09456 27940 5.25193 1547 14.948 373.696 50512.7 1304 1294 5061427.55 5693052.64 24365 50740 1448.07 5.48 6.9477 53.09 6.93 3.44 56.736 35.36 436.86 18.22 3.82 57.215 4.15 4.064 12.02 128.32 62.26 148.7 53.71 147.8 54.06 21.63 369.15 12.05 662.91 81.2 295.35 67.85 353.52 66.49 120.21 8.02 2989.78 9.01 886.66 43.58 550.3 0.56 41956.06 22.7 351.84 12.93 1854.34 5.1 1564.5 1.41 16901.1 1862000 71981000 353220.2 5418.1 71722427 7.37 20919 8.25 43450 20752 43729 8.29 0.65 419.011 44.92 48.845 14.25 48.076 42.55 37.654774238 52753.11 26.59 15.438 60754 30.83045146 30.329244698 1.5573 117684 0.61454 34.446 33.2 433 19.38 31.973 88060 29.5036 498 60805 310.55 47260 47385 329800000 168030000 223360000 141280 2726.85 9210.49 714.42 14360 4039.03 70960 102200 16610 38520 2059 2139 35.4539 0.97 63.6 21.8 33 9.18 142133 183859 487.8 16.9563 18.59 32.51 6.4 24740.05 22004.23 71.16 10390.33 38.59 41.03 14.786 15 42.25 319.4 5665 2.0115 15.8089 15.9782 50.11 216 10.784 54.02 1692102 12.516 17.96 11.042 7.224 211.66 2.45 9.459 75.364 377 75.87 4.30701 8.806 28.4441 28.478 28.9081 28.9398 14.2 28.6645 28.806 5.935 31.0588 30.8378 98.86 130.3598 24498.2 4.774 107.4 122.15 14.1 5.097 137.97 54.5762 54.9549 5.36 55.8788 55.6705 185.07 193.281 194.348 3.852 2.697 3342.51 1.694 1.415 14.123 14.0877 14.1676 14.1336 17.54 25.8193 25.7995 27.6358 27.8339 27.6338 28.0061 27.7795 27.8481 53.7468 53.6358 58.982 58.9926 0.209 16.9435 16.751 19.2256 19.2383 52.5398 60.0213 53.6889 59.735 95.578 63.2219 64.2029 99.7527 117.845 115.198 186.551 183.452 1813.292 204.95889 1063.27 527.01 545 221.121 307.199 291.63 20.335 508.378 249.43 264.55 258.48 34.83 203.73 196983 162.41459 27.068099 168775 166105 9.88187 170.46 162.567 167529875870 35045220680 47048102160 144868800380 67647607600 10887827050 168.37 10.1229 209.444 1591084 0.615 55746 1627171 17.939 56335 129.448 109.41 114.07 124.71 99388 71944056 9512.1 91.894740975 10.2038 15270.26 84586 86.948 83524 83.591421969 253.624 180058.09 200059.86 75.33 1.148 5.3 5081 5127 7.946 2145.16 2125.71 65.66 1083.23 1064.7 1.99 4.68 16088.6 5932 5.03492 29012 5.27642 1524 15.007 375.175 50229.93 1294 1305 5241319.8 5730258.98 24526 50458 1447.91 5.45 6.92496 52.94 6.96 3.46 56.807 35.1 432.96 18.46 3.89 57.464 4.17 4.062 12.189 126.22 63.33 145.64 54.88 145.89 54.78 21.51 371.16 11.83 675.24 80.25 298.8 67.16 357.15 66.06 120.99 7.96 3010.6 8.93 894.24 43.34 553.27 0.55 42386.78 22.63 352.96 12.93 1855.23 5.08 1569.98 1.4 17036.01 1862000 65609000 345908.2 5294.3 64256226 7.37 20740 8.25 44009 20746 42719 8.29 0.66 421.021 45.816 48.465 14.29 47.475 42.83 38.235807506 52719.71 26.44 15.297 60711 31.651786074 30.650349799 1.5633 102783 0.61420 35.193 32.79 433 19.43 32.185 89344 29.5741 502 61351 311.73 47942 47590 331120000 171400000 239170000 140960 2605.6 9300.91 710.53 14310 4048.08 70910 103680 16950 38020 2051 2140 35.3254 0.97 67 21.846 33.6 9.23 155668 171918 552.8 16.9459 18.3 33.23 6.3 25591.23 21129.21 69.03 8593.82 38.46 42.91 14.7872 14.9001 42.88 318.5 5672.2 2.01217 15.8476 15.9518 50.81 214.3 10.864 54.71 1685554 12.111 17.74 11.12 7.176 204.81 2.44 9.526 75.647 378 73.99 4.3061 8.854 28.6623 28.5518 29.1118 28.9079 14.3 28.7669 28.7749 6.101 30.9595 30.9894 99.72 130.2225 25162.58 4.833 107.34 120.84 16.93 5.063 150.27 53.8771 54.0622 5.28 55.5756 55.6246 178.47 190.914 190.158 3.472 2.705 2893.39 1.671 1.434 14.0404 14.0495 14.0426 14.0817 17.54 25.5679 25.585 27.5275 27.5452 27.4046 27.5381 27.5697 27.5778 53.4593 53.7069 58.05 58.4249 0.187 16.8437 16.6687 18.7925 18.9154 56.6746 40.5778 52.8988 59.0159 97.4361 64.0485 64.9886 98.9954 114.123 114.341 183.834 184.512 1815.051 195.54508 1062.13 526.015 549 220.533 307.934 290.16 20.389 509.73 249.686 263.94 259.37 34.84 202.62 195691 161.88961 27.222561 168378 166313 9.84149 166.08 162.803 167740437850 35479543830 47059792960 145929235660 68291402680 10972703730 168.31 10.1689 204.454 1599780 0.623 50776 1604627 19.694 57567 131.703 108.16 115.59 124.75 98534 72527312 9501 91.176640881 10.2334 15177.55 84845 86.56 83238 83.64314069 256.136 188170.4 201855.19 75.24 1.148 5.31 5039 5175 7.834 2188.89 2141.47 65.34 1081.46 1044.98 2 4.66 15832.3 6037 4.94223 28970 5.05305 1508 15.023 375.582 49366.17 1318 1296 5171670.55 5727247.49 24512 50705 1431.86 5.54 6.94737 52.67 6.91 3.48 56.766 35.42 435.36 18.26 3.85 56.866 4.22 4.067 11.407 123.75 64.57 143.52 55.69 148.29 53.9 20.89 382.3 11.17 715.44 79.99 299.8 65.93 363.79 66.42 120.32 7.98 3005.33 8.77 910.84 43.32 553.49 0.55 42343.47 22.45 355.77 12.95 1852.66 4.78 1668.97 1.39 17187.3 1888000 71931000 354387.3 5367.5 71115165 7.36 21146 8.24 43624 20598 42916 8.31 0.66 419.3 45.723 49.009 14.38 48.276 42.77 37.994095068 52390.75 26.63 15.439 60209 32.373449454 30.471851303 1.5848 118632 0.61266 35.075 33.165 430 19.4 32.225 89756.1 29.1886 504 61337 309.14 47587 47203 329870000 169900000 239250000 138510 2559.46 9248.07 722 14200 3946.27 71070 102900 16590 38040 2055 2139 35.2707 0.98 64.8 21.985 33 8.98 138934 185607 490.7 16.9808 17.7 33.47 6.4 25290.9 21905.42 70.56 7707.48 39.46 42.68 15.0325 14.7708 43.18 317.3 5666.3 2.03456 15.5292 15.8822 50.69 214.5 11.108 53.99 1686956 12.695 14.89 11.167 7.226 171.17 2.45 9.526 75.057 376 72.63 4.31754 8.828 27.776 28.2529 28.4467 28.5277 14.3 28.762 28.7644 6.49 30.9269 31.0469 98.68 130.417 24381.01 4.785 108.94 120.89 14.04 5.066 152.13 54.2616 54.6591 5.36 55.3711 55.782 170.99 191.65 194.228 3.854 2.771 3340.25 1.667 1.405 14.0224 14.015 14.0375 14.144 17.53 25.449 25.4696 27.3438 27.4897 27.3379 27.5919 27.549 27.654 53.406 53.5429 58.5898 58.2473 0.217 16.8575 17.032 19.1865 18.8817 53.0212 59.2394 54.7998 60.5548 87.4539 62.9035 63.746 96.4633 115.053 115.093 187.72 179.454 OpenBenchmarking.org
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Execution Time e f d b c 500 1000 1500 2000 2500 1813.29 1815.05 1816.89 2166.82 2166.83 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: drivaerFastback, Medium Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Medium Mesh Size - Mesh Time f d e b c 50 100 150 200 250 195.55 203.50 204.96 230.30 234.40 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
CloverLeaf Input: clover_bm16 OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm16 d f e c b a 200 400 600 800 1000 1058.26 1062.13 1063.27 1091.47 1093.01 1094.13 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
Timed Linux Kernel Compilation Build: allmodconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: allmodconfig d f e c b 200 400 600 800 1000 519.60 526.02 527.01 767.22 778.31
OpenVKL Benchmark: vklBenchmarkCPU ISPC OpenBenchmarking.org Items / Sec, More Is Better OpenVKL 2.0.0 Benchmark: vklBenchmarkCPU ISPC f d e b c 120 240 360 480 600 549 547 545 387 386 MIN: 37 / MAX: 7452 MIN: 37 / MAX: 8225 MIN: 36 / MAX: 7468 MIN: 26 / MAX: 5487 MIN: 26 / MAX: 5478
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 23.0.1 Time To Compile f e d 50 100 150 200 250 SE +/- 0.37, N = 3 220.53 221.12 221.24
Timed LLVM Compilation Build System: Ninja OpenBenchmarking.org Seconds, Fewer Is Better Timed LLVM Compilation 16.0 Build System: Ninja e f d c b 100 200 300 400 500 307.20 307.93 308.21 435.62 439.46
OpenRadioss Model: INIVOL and Fluid Structure Interaction Drop Container OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: INIVOL and Fluid Structure Interaction Drop Container d f e b c 100 200 300 400 500 285.26 290.16 291.63 435.68 438.56
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 d f e c b 5 10 15 20 25 20.41 20.39 20.34 12.91 12.91 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM2 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM2 d f e c b 110 220 330 440 550 510.27 509.73 508.38 322.67 322.65 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
Timed Node.js Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Node.js Compilation 19.8.1 Time To Compile e f d c b 80 160 240 320 400 249.43 249.69 251.15 355.83 357.10
QMCPACK Input: FeCO6_b3lyp_gms OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: FeCO6_b3lyp_gms f e d b c 60 120 180 240 300 263.94 264.55 265.33 266.24 266.65 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QMCPACK Input: O_ae_pyscf_UHF OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: O_ae_pyscf_UHF d e f b c 60 120 180 240 300 258.40 258.48 259.37 262.57 264.99 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
DuckDB Benchmark: TPC-H Parquet OpenBenchmarking.org Seconds, Fewer Is Better DuckDB 0.9.1 Benchmark: TPC-H Parquet d b c 20 40 60 80 100 SE +/- 0.21, N = 3 SE +/- 0.28, N = 3 SE +/- 0.50, N = 3 83.67 89.75 90.45 1. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl
TensorFlow Device: CPU - Batch Size: 64 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 64 - Model: ResNet-50 f e d b c 8 16 24 32 40 34.84 34.83 34.80 24.14 24.10
QMCPACK Input: Li2_STO_ae OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: Li2_STO_ae f e d c b 60 120 180 240 300 202.62 203.73 208.70 284.18 285.85 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU f d e b c 60K 120K 180K 240K 300K 195691 196772 196983 277357 277524
DuckDB Benchmark: IMDB OpenBenchmarking.org Seconds, Fewer Is Better DuckDB 0.9.1 Benchmark: IMDB d c b 20 40 60 80 100 SE +/- 0.27, N = 3 SE +/- 0.54, N = 3 SE +/- 0.48, N = 3 71.37 86.56 86.57 1. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl
OpenFOAM Input: drivaerFastback, Small Mesh Size - Execution Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Execution Time f e d c b 50 100 150 200 250 161.89 162.41 162.53 223.87 224.44 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OpenFOAM Input: drivaerFastback, Small Mesh Size - Mesh Time OpenBenchmarking.org Seconds, Fewer Is Better OpenFOAM 10 Input: drivaerFastback, Small Mesh Size - Mesh Time d e f c b 8 16 24 32 40 26.29 27.07 27.22 32.90 33.88 1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU f e d c b 50K 100K 150K 200K 250K 168378 168775 169828 238496 239054
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU e f d b c 50K 100K 150K 200K 250K 166105 166313 167519 234540 234968
High Performance Conjugate Gradient X Y Z: 104 104 104 - RT: 60 OpenBenchmarking.org GFLOP/s, More Is Better High Performance Conjugate Gradient 3.1 X Y Z: 104 104 104 - RT: 60 d e f b c 3 6 9 12 15 9.91323 9.88187 9.84149 8.28855 8.26023 1. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi
OpenRadioss Model: Bird Strike on Windshield OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bird Strike on Windshield f d e b c 50 100 150 200 250 166.08 166.94 170.46 228.35 234.62
Timed Godot Game Engine Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Godot Game Engine Compilation 4.0 Time To Compile e f d b c 50 100 150 200 250 162.57 162.80 163.49 230.16 234.32
OpenSSL Algorithm: AES-128-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-128-GCM d f e b c 40000M 80000M 120000M 160000M 200000M 168593428210 167740437850 167529875870 106224257980 106192145890 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA256 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA256 f d e c b 8000M 16000M 24000M 32000M 40000M 35479543830 35393577110 35045220680 21897801590 21234053100 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: ChaCha20-Poly1305 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20-Poly1305 f e d b c 10000M 20000M 30000M 40000M 50000M 47059792960 47048102160 47030287560 31517307700 30264414210 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: AES-256-GCM OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: AES-256-GCM f d e c b 30000M 60000M 90000M 120000M 150000M 145929235660 145829960090 144868800380 94488932900 91408403390 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: ChaCha20 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: ChaCha20 f d e b c 15000M 30000M 45000M 60000M 75000M 68291402680 68204421390 67647607600 43683166890 43658009260 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: SHA512 OpenBenchmarking.org byte/s, More Is Better OpenSSL 3.1 Algorithm: SHA512 f d e c b 2000M 4000M 6000M 8000M 10000M 10972703730 10970736110 10887827050 7042483840 6922574030 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
QMCPACK Input: LiH_ae_MSD OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: LiH_ae_MSD f e d c b 40 80 120 160 200 168.31 168.37 168.69 168.70 169.05 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
OSPRay Benchmark: particle_volume/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/scivis/real_time f e d b c 3 6 9 12 15 10.16890 10.12290 10.11880 7.32159 7.32078
OSPRay Benchmark: particle_volume/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/pathtracer/real_time e d f c b 50 100 150 200 250 209.44 207.16 204.45 169.84 169.65
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only f e d b c 300K 600K 900K 1200K 1500K 1599780 1591084 1299800 947197 833675 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency e f d c b 0.27 0.54 0.81 1.08 1.35 0.615 0.623 0.734 0.910 1.054 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency e d f b c 12K 24K 36K 48K 60K 55746 54797 50776 42553 42185 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency e f d c b 300K 600K 900K 1200K 1500K 1627171 1604627 1361760 1098384 949091 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency OpenBenchmarking.org ms, Fewer Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write - Average Latency f e d b c 6 12 18 24 30 17.37 17.75 17.89 23.50 23.71 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
PostgreSQL Scaling Factor: 100 - Clients: 1000 - Mode: Read Write OpenBenchmarking.org TPS, More Is Better PostgreSQL 16 Scaling Factor: 100 - Clients: 1000 - Mode: Read Write f e d b c 12K 24K 36K 48K 60K 57567 56335 55901 41736 41656 1. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 1200 e d f b c 30 60 90 120 150 129.45 129.70 131.70 134.19 134.63 1. (CXX) g++ options: -O3 -fopenmp
OpenRadioss Model: Rubber O-Ring Seal Installation OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Rubber O-Ring Seal Installation f d e b c 40 80 120 160 200 108.16 108.37 109.41 165.27 165.49
OpenRadioss Model: Bumper Beam OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Bumper Beam e d f c b 30 60 90 120 150 114.07 114.72 115.59 150.19 151.75
CloverLeaf Input: clover_bm64_short OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm64_short d e f a b c 30 60 90 120 150 124.58 124.71 124.75 127.74 127.94 128.11 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU f d e c b 30K 60K 90K 120K 150K 98534 99301 99388 139809 139891
asmFish 1024 Hash Memory, 26 Depth OpenBenchmarking.org Nodes/second, More Is Better asmFish 2018-07-23 1024 Hash Memory, 26 Depth f e d b c 16M 32M 48M 64M 80M 72527312 71944056 71318180 47742886 47676136
Xmrig Variant: Monero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.18.1 Variant: Monero - Hash Count: 1M d e f b c 2K 4K 6K 8K 10K 9559.6 9512.1 9501.0 7676.5 7660.4 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
SPECFEM3D Model: Layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Layered Halfspace f d e b c 30 60 90 120 150 91.18 91.86 91.89 135.10 140.59 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
OSPRay Benchmark: particle_volume/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: particle_volume/ao/real_time f e d b c 3 6 9 12 15 10.23340 10.20380 10.16860 7.37745 7.36690
NAS Parallel Benchmarks Test / Class: SP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.C e d f c b 3K 6K 9K 12K 15K 15270.26 15263.55 15177.55 12101.42 12080.00 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU e f d c b 30K 60K 90K 120K 150K 84586 84845 85258 120664 120716
Rodinia Test: OpenMP LavaMD OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP LavaMD f d e c b 30 60 90 120 150 86.56 86.77 86.95 129.98 131.53 1. (CXX) g++ options: -O2 -lOpenCL
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU f e d b c 30K 60K 90K 120K 150K 83238 83524 83823 118595 118639
SPECFEM3D Model: Water-layered Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Water-layered Halfspace e f d b c 30 60 90 120 150 83.59 83.64 84.11 126.07 126.11 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
Selenium Benchmark: Jetstream 2 - Browser: Firefox OpenBenchmarking.org Score, More Is Better Selenium Benchmark: Jetstream 2 - Browser: Firefox f e d b c 60 120 180 240 300 256.14 253.62 251.97 229.49 227.94 1. firefox 118.0.2
nginx Connections: 1000 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 1000 f d e b c 40K 80K 120K 160K 200K 188170.40 184950.78 180058.09 127899.06 127678.15 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
nginx Connections: 500 OpenBenchmarking.org Requests Per Second, More Is Better nginx 1.23.2 Connections: 500 f e d b c 40K 80K 120K 160K 200K 201855.19 200059.86 196302.45 135692.91 132783.22 1. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2
Blender Blend File: Fishy Cat - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: Fishy Cat - Compute: CPU-Only f e d b c 20 40 60 80 100 75.24 75.33 75.40 109.94 110.19
GROMACS Implementation: MPI CPU - Input: water_GMX50_bare OpenBenchmarking.org Ns Per Day, More Is Better GROMACS 2023 Implementation: MPI CPU - Input: water_GMX50_bare f e d 0.2583 0.5166 0.7749 1.0332 1.2915 1.148 1.148 1.143 1. (CXX) g++ options: -O3
simdjson Throughput Test: Kostya OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: Kostya f e d c b 1.1948 2.3896 3.5844 4.7792 5.974 5.31 5.30 5.30 4.81 4.80 1. (CXX) g++ options: -O3
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU f e d b c 1500 3000 4500 6000 7500 5039 5081 5114 7176 7195
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU d e f c b 1600 3200 4800 6400 8000 5095 5127 5175 7320 7338
VVenC Video Input: Bosphorus 4K - Video Preset: Fast OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Fast e f d c b 2 4 6 8 10 7.946 7.834 7.829 6.550 6.539 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
oneDNN Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU e d f b c 700 1400 2100 2800 3500 2145.16 2150.83 2188.89 3320.34 3321.37 MIN: 1934.78 MIN: 1926.06 MIN: 1932.77 MIN: 3316.93 MIN: 3317.95 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU d e f c b 700 1400 2100 2800 3500 2113.83 2125.71 2141.47 3319.30 3319.64 MIN: 1927.97 MIN: 1925.61 MIN: 1947.18 MIN: 3315.49 MIN: 3316.21 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
OpenRadioss Model: Cell Phone Drop Test OpenBenchmarking.org Seconds, Fewer Is Better OpenRadioss 2023.09.15 Model: Cell Phone Drop Test f e d c b 20 40 60 80 100 65.34 65.66 66.07 95.46 96.02
oneDNN Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU d f e b c 400 800 1200 1600 2000 1062.21 1081.46 1083.23 1705.51 1705.89 MIN: 982.53 MIN: 999.13 MIN: 981.64 MIN: 1702.42 MIN: 1703.31 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
oneDNN Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU f e d b c 400 800 1200 1600 2000 1044.98 1064.70 1145.28 1705.10 1707.37 MIN: 983.94 MIN: 979.41 MIN: 996.9 MIN: 1702.22 MIN: 1704.07 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
simdjson Throughput Test: LargeRandom OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: LargeRandom f e d c b 0.45 0.9 1.35 1.8 2.25 2.00 1.99 1.99 1.77 1.77 1. (CXX) g++ options: -O3
PyPerformance Benchmark: python_startup OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: python_startup d f e c b 1.0913 2.1826 3.2739 4.3652 5.4565 4.65 4.66 4.68 4.84 4.85
Xmrig Variant: Wownero - Hash Count: 1M OpenBenchmarking.org H/s, More Is Better Xmrig 6.18.1 Variant: Wownero - Hash Count: 1M d e f b c 3K 6K 9K 12K 15K 16204.0 16088.6 15832.3 11202.1 11160.8 1. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU e d f c b 2K 4K 6K 8K 10K 5932 6008 6037 8512 8523
OSPRay Benchmark: gravity_spheres_volume/dim_512/scivis/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/scivis/real_time d e f c b 1.1463 2.2926 3.4389 4.5852 5.7315 5.09456 5.03492 4.94223 3.65807 3.60869
Chaos Group V-RAY Mode: CPU OpenBenchmarking.org vsamples, More Is Better Chaos Group V-RAY 5.02 Mode: CPU e f d c b 6K 12K 18K 24K 30K 29012 28970 27940 18885 18743
OSPRay Benchmark: gravity_spheres_volume/dim_512/ao/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/ao/real_time e d f c b 1.1872 2.3744 3.5616 4.7488 5.936 5.27642 5.25193 5.05305 3.79562 3.69580
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU f e d c b 500 1000 1500 2000 2500 1508 1524 1547 2150 2156
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org Billion Interactions/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 f e d c b 4 8 12 16 20 15.023 15.007 14.948 8.705 8.679 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
miniBUDE Implementation: OpenMP - Input Deck: BM1 OpenBenchmarking.org GFInst/s, More Is Better miniBUDE 20210901 Implementation: OpenMP - Input Deck: BM1 f e d c b 80 160 240 320 400 375.58 375.18 373.70 217.63 216.97 1. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
NAS Parallel Benchmarks Test / Class: BT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: BT.C d e f c b 11K 22K 33K 44K 55K 50512.70 50229.93 49366.17 34573.71 34501.92 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU e d f c b 400 800 1200 1600 2000 1294 1304 1318 1840 1848
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU d f e c b 400 800 1200 1600 2000 1294 1296 1305 1816 1830
Memcached Set To Get Ratio: 1:100 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:100 e f d c b 1.1M 2.2M 3.3M 4.4M 5.5M 5241319.80 5171670.55 5061427.55 4149899.46 4058394.64 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
Memcached Set To Get Ratio: 1:10 OpenBenchmarking.org Ops/sec, More Is Better Memcached 1.6.19 Set To Get Ratio: 1:10 e f d b c 1.2M 2.4M 3.6M 4.8M 6M 5730258.98 5727247.49 5693052.64 4464270.25 4305589.62 1. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU d f e c b 8K 16K 24K 32K 40K 24365 24512 24526 37367 37384
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU e f d c b 16K 32K 48K 64K 80K 50458 50705 50740 72092 72426
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU f e d b c 400 800 1200 1600 2000 1431.86 1447.91 1448.07 1730.89 1733.40 MIN: 963.6 / MAX: 2030.33 MIN: 1094.97 / MAX: 2000.94 MIN: 1089.83 / MAX: 1961.07 MIN: 1541.53 / MAX: 2245.54 MIN: 1541.89 / MAX: 2247.48 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU f d e c b 1.2465 2.493 3.7395 4.986 6.2325 5.54 5.48 5.45 3.45 3.45 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OSPRay Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time OpenBenchmarking.org Items Per Second, More Is Better OSPRay 2.12 Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time d f e c b 2 4 6 8 10 6.94770 6.94737 6.92496 5.04566 5.03922
Blender Blend File: BMW27 - Compute: CPU-Only OpenBenchmarking.org Seconds, Fewer Is Better Blender 3.6 Blend File: BMW27 - Compute: CPU-Only f e d b c 20 40 60 80 100 52.67 52.94 53.09 78.97 79.19
LuxCoreRender Scene: Orange Juice - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Orange Juice - Acceleration: CPU e d f c b 2 4 6 8 10 6.96 6.93 6.91 4.91 4.88 MIN: 6.29 / MAX: 7.55 MIN: 6.23 / MAX: 7.51 MIN: 6.28 / MAX: 7.27 MIN: 4.72 / MAX: 4.97 MIN: 4.71 / MAX: 4.93
LuxCoreRender Scene: Danish Mood - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Danish Mood - Acceleration: CPU f e d c b 0.783 1.566 2.349 3.132 3.915 3.48 3.46 3.44 2.27 2.22 MIN: 1.71 / MAX: 3.89 MIN: 1.38 / MAX: 3.89 MIN: 1.57 / MAX: 3.87 MIN: 0.86 / MAX: 2.63 MIN: 0.78 / MAX: 2.61
Ngspice Circuit: C7552 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C7552 d f e c b 15 30 45 60 75 56.74 56.77 56.81 63.91 65.90 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
TensorFlow Device: CPU - Batch Size: 16 - Model: ResNet-50 OpenBenchmarking.org images/sec, More Is Better TensorFlow 2.12 Device: CPU - Batch Size: 16 - Model: ResNet-50 f d e c b 8 16 24 32 40 35.42 35.36 35.10 23.93 23.91
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU e f d b c 100 200 300 400 500 432.96 435.36 436.86 469.85 470.00 MIN: 258.52 / MAX: 962.37 MIN: 259 / MAX: 1007.23 MIN: 257.97 / MAX: 949.62 MIN: 366.95 / MAX: 965.67 MIN: 367.47 / MAX: 966.24 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU e f d c b 5 10 15 20 25 18.46 18.26 18.22 12.75 12.75 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
LuxCoreRender Scene: LuxCore Benchmark - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: LuxCore Benchmark - Acceleration: CPU e f d c b 0.8753 1.7506 2.6259 3.5012 4.3765 3.89 3.85 3.82 2.55 2.54 MIN: 1.74 / MAX: 4.37 MIN: 1.53 / MAX: 4.35 MIN: 1.47 / MAX: 4.34 MIN: 0.93 / MAX: 2.98 MIN: 0.92 / MAX: 2.97
Ngspice Circuit: C2670 OpenBenchmarking.org Seconds, Fewer Is Better Ngspice 34 Circuit: C2670 f d e c b 15 30 45 60 75 56.87 57.22 57.46 66.08 66.39 1. (CC) gcc options: -O0 -fopenmp -lm -lstdc++ -lfftw3 -lXaw -lXmu -lXt -lXext -lX11 -lXft -lfontconfig -lXrender -lfreetype -lSM -lICE
LuxCoreRender Scene: DLSC - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: DLSC - Acceleration: CPU f e d b c 0.9495 1.899 2.8485 3.798 4.7475 4.22 4.17 4.15 2.88 2.87 MIN: 4.08 / MAX: 4.91 MIN: 4 / MAX: 4.47 MIN: 4.05 / MAX: 4.44 MIN: 2.81 / MAX: 3 MIN: 2.8 / MAX: 2.98
IndigoBench Acceleration: CPU - Scene: Bedroom OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Bedroom f d e b c 0.9151 1.8302 2.7453 3.6604 4.5755 4.067 4.064 4.062 2.689 2.600
IndigoBench Acceleration: CPU - Scene: Supercar OpenBenchmarking.org M samples/s, More Is Better IndigoBench 4.4 Acceleration: CPU - Scene: Supercar e d f c b 3 6 9 12 15 12.189 12.020 11.407 7.778 7.592
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU f e d b c 30 60 90 120 150 123.75 126.22 128.32 134.23 135.00 MIN: 72.91 / MAX: 247.36 MIN: 85.87 / MAX: 245.66 MIN: 86.52 / MAX: 370.89 MIN: 63.18 / MAX: 200.65 MIN: 61.37 / MAX: 200.78 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU f e d b c 14 28 42 56 70 64.57 63.33 62.26 44.66 44.39 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU f e d c b 30 60 90 120 150 143.52 145.64 148.70 153.73 154.78 MIN: 72.01 / MAX: 257.8 MIN: 62.64 / MAX: 256.75 MIN: 107.41 / MAX: 259.59 MIN: 83.63 / MAX: 218.01 MIN: 79.79 / MAX: 219.52 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU f e d c b 13 26 39 52 65 55.69 54.88 53.71 38.98 38.73 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU e d f c b 30 60 90 120 150 145.89 147.80 148.29 153.66 154.50 MIN: 66.07 / MAX: 257.13 MIN: 108.01 / MAX: 317.25 MIN: 66.12 / MAX: 269.72 MIN: 79.35 / MAX: 217.79 MIN: 131.36 / MAX: 217.7 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU e d f c b 12 24 36 48 60 54.78 54.06 53.90 39.00 38.75 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU f e d c b 6 12 18 24 30 20.89 21.51 21.63 23.41 23.47 MIN: 12.72 / MAX: 44.33 MIN: 13.48 / MAX: 43.52 MIN: 9.02 / MAX: 46.9 MIN: 18.22 / MAX: 41.11 MIN: 11.19 / MAX: 40.86 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU f e d c b 80 160 240 320 400 382.30 371.16 369.15 255.73 255.00 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU f e b c d 3 6 9 12 15 11.17 11.83 11.88 11.91 12.05 MIN: 6.05 / MAX: 28.92 MIN: 6.65 / MAX: 27.36 MIN: 9.25 / MAX: 18.16 MIN: 5.89 / MAX: 17.91 MIN: 7.14 / MAX: 26.84 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU f e d b c 150 300 450 600 750 715.44 675.24 662.91 503.59 502.05 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU b c f e d 20 40 60 80 100 71.08 71.15 79.99 80.25 81.20 MIN: 55.16 / MAX: 87.27 MIN: 53.9 / MAX: 90.11 MIN: 47.77 / MAX: 173.9 MIN: 63.46 / MAX: 183.13 MIN: 43.5 / MAX: 176.88 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU f e d b c 70 140 210 280 350 299.80 298.80 295.35 224.92 224.74 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU b c f e d 15 30 45 60 75 59.37 59.91 65.93 67.16 67.85 MIN: 52.65 / MAX: 80.39 MIN: 30.66 / MAX: 74.01 MIN: 31.99 / MAX: 150.16 MIN: 42.11 / MAX: 149.84 MIN: 31.98 / MAX: 153.29 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU f e d b c 80 160 240 320 400 363.79 357.15 353.52 269.31 266.96 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU b c e f d 15 30 45 60 75 56.41 56.71 66.06 66.42 66.49 MIN: 41.43 / MAX: 98.24 MIN: 40.93 / MAX: 99.23 MIN: 32.08 / MAX: 125.5 MIN: 39.7 / MAX: 126.38 MIN: 40.75 / MAX: 125.11 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU e f d b c 30 60 90 120 150 120.99 120.32 120.21 106.23 105.65 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU c b e f d 2 4 6 8 10 7.37 7.52 7.96 7.98 8.02 MIN: 5.99 / MAX: 11.83 MIN: 5.11 / MAX: 16.43 MIN: 4.21 / MAX: 12.87 MIN: 5.16 / MAX: 18.32 MIN: 5.89 / MAX: 15.12 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU e f d c b 600 1200 1800 2400 3000 3010.60 3005.33 2989.78 2170.03 2125.52 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU f e d c b 3 6 9 12 15 8.77 8.93 9.01 9.79 9.84 MIN: 5 / MAX: 22.42 MIN: 5.27 / MAX: 19.37 MIN: 5.26 / MAX: 19.9 MIN: 7.41 / MAX: 19.03 MIN: 7.45 / MAX: 19.24 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU f e d c b 200 400 600 800 1000 910.84 894.24 886.66 610.86 607.54 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU c b f e d 10 20 30 40 50 41.63 41.65 43.32 43.34 43.58 MIN: 23.08 / MAX: 44.79 MIN: 23.28 / MAX: 44.79 MIN: 31.46 / MAX: 73.03 MIN: 30.19 / MAX: 72.32 MIN: 31.3 / MAX: 72.83 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU f e d c b 120 240 360 480 600 553.49 553.27 550.30 383.97 383.74 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU b c e f d 0.126 0.252 0.378 0.504 0.63 0.49 0.49 0.55 0.55 0.56 MIN: 0.29 / MAX: 3.73 MIN: 0.29 / MAX: 3.11 MIN: 0.28 / MAX: 13.45 MIN: 0.27 / MAX: 5.91 MIN: 0.28 / MAX: 9.03 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU e f d b c 9K 18K 27K 36K 45K 42386.78 42343.47 41956.06 32245.66 32241.73 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU c b f e d 5 10 15 20 25 19.09 19.33 22.45 22.63 22.70 MIN: 15 / MAX: 32.89 MIN: 8.18 / MAX: 34.75 MIN: 11.6 / MAX: 50.7 MIN: 11.81 / MAX: 48.47 MIN: 11.49 / MAX: 48.33 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU f e d c b 80 160 240 320 400 355.77 352.96 351.84 313.32 309.56 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU b c d e f 3 6 9 12 15 11.74 11.76 12.93 12.93 12.95 MIN: 9.52 / MAX: 22.89 MIN: 8.2 / MAX: 22.88 MIN: 8.18 / MAX: 24.53 MIN: 8.29 / MAX: 22.42 MIN: 8.15 / MAX: 35.46 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU e d f b c 400 800 1200 1600 2000 1855.23 1854.34 1852.66 1361.39 1359.67 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU f e d c b 1.3253 2.6506 3.9759 5.3012 6.6265 4.78 5.08 5.10 5.81 5.89 MIN: 2.77 / MAX: 19.3 MIN: 3.18 / MAX: 17.17 MIN: 2.36 / MAX: 12.91 MIN: 2.72 / MAX: 9.48 MIN: 4.59 / MAX: 9.41 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU f e d c b 400 800 1200 1600 2000 1668.97 1569.98 1564.50 1028.14 1011.66 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU b c f e d 0.3173 0.6346 0.9519 1.2692 1.5865 1.27 1.28 1.39 1.40 1.41 MIN: 0.78 / MAX: 10.23 MIN: 0.75 / MAX: 10.14 MIN: 0.84 / MAX: 11.1 MIN: 0.75 / MAX: 8.02 MIN: 0.83 / MAX: 9.26 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU f e d b c 4K 8K 12K 16K 20K 17187.30 17036.01 16901.10 12503.20 12456.32 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
John The Ripper Test: MD5 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: MD5 f e d b c 400K 800K 1200K 1600K 2000K 1888000 1862000 1862000 1134000 1131000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
John The Ripper Test: HMAC-SHA512 OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: HMAC-SHA512 d f e c b 15M 30M 45M 60M 75M 71981000 71931000 65609000 46015000 45526000 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org verify/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 f d e c b 80K 160K 240K 320K 400K 354387.3 353220.2 345908.2 231842.8 231709.8 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
OpenSSL Algorithm: RSA4096 OpenBenchmarking.org sign/s, More Is Better OpenSSL 3.1 Algorithm: RSA4096 d f e c b 1200 2400 3600 4800 6000 5418.1 5367.5 5294.3 3578.6 3577.3 1. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl
Stockfish Total Time OpenBenchmarking.org Nodes Per Second, More Is Better Stockfish 15 Total Time d f e b c 15M 30M 45M 60M 75M 71722427 71115165 64256226 46079885 41998826 1. (CXX) g++ options: -lgcov -m64 -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -msse -msse3 -mpopcnt -mavx2 -msse4.1 -mssse3 -msse2 -mbmi2 -flto -flto=jobserver
simdjson Throughput Test: PartialTweets OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: PartialTweets e d f c b 2 4 6 8 10 7.37 7.37 7.36 6.51 5.39 1. (CXX) g++ options: -O3
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU e d f c b 7K 14K 21K 28K 35K 20740 20919 21146 32538 32575
simdjson Throughput Test: DistinctUserID OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: DistinctUserID e d f c b 2 4 6 8 10 8.25 8.25 8.24 6.04 6.04 1. (CXX) g++ options: -O3
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU d f e c b 13K 26K 39K 52K 65K 43450 43624 44009 62329 62478
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU f e d b c 7K 14K 21K 28K 35K 20598 20746 20752 31957 32007
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU e f d b c 13K 26K 39K 52K 65K 42719 42916 43729 61145 61161
simdjson Throughput Test: TopTweet OpenBenchmarking.org GB/s, More Is Better simdjson 2.0 Throughput Test: TopTweet f e d c b 2 4 6 8 10 8.31 8.29 8.29 7.31 7.29 1. (CXX) g++ options: -O3
Intel Open Image Denoise Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only OpenBenchmarking.org Images / Sec, More Is Better Intel Open Image Denoise 2.1 Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only f e d c b 0.1485 0.297 0.4455 0.594 0.7425 0.66 0.66 0.65 0.43 0.43
Selenium Benchmark: Jetstream 2 - Browser: Google Chrome OpenBenchmarking.org Score, More Is Better Selenium Benchmark: Jetstream 2 - Browser: Google Chrome e f d c b 90 180 270 360 450 421.02 419.30 419.01 388.31 382.67 1. chrome 118.0.5993.70
Timed Linux Kernel Compilation Build: defconfig OpenBenchmarking.org Seconds, Fewer Is Better Timed Linux Kernel Compilation 6.1 Build: defconfig d f e c b 15 30 45 60 75 44.92 45.72 45.82 64.96 65.56
Rodinia Test: OpenMP HotSpot3D OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP HotSpot3D e d f b c 13 26 39 52 65 48.47 48.85 49.01 56.00 58.02 1. (CXX) g++ options: -O2 -lOpenCL
uvg266 Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Medium f e d c b 4 8 12 16 20 14.38 14.29 14.25 9.43 9.41
Rodinia Test: OpenMP Leukocyte OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Leukocyte e d f b c 12 24 36 48 60 47.48 48.08 48.28 53.87 54.93 1. (CXX) g++ options: -O2 -lOpenCL
CloverLeaf Input: clover_bm OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm d f e b c a 13 26 39 52 65 42.55 42.77 42.83 55.82 56.10 58.98 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
SPECFEM3D Model: Homogeneous Halfspace OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Homogeneous Halfspace d f e b c 14 28 42 56 70 37.65 37.99 38.24 62.09 63.90 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
NAS Parallel Benchmarks Test / Class: LU.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: LU.C d e f b c 11K 22K 33K 44K 55K 52753.11 52719.71 52390.75 35845.04 34481.92 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
Node.js V8 Web Tooling Benchmark OpenBenchmarking.org runs/s, More Is Better Node.js V8 Web Tooling Benchmark f d e b c 6 12 18 24 30 26.63 26.59 26.44 24.04 23.75
VVenC Video Input: Bosphorus 4K - Video Preset: Faster OpenBenchmarking.org Frames Per Second, More Is Better VVenC 1.9 Video Input: Bosphorus 4K - Video Preset: Faster f d e c b 4 8 12 16 20 15.44 15.44 15.30 13.28 13.27 1. (CXX) g++ options: -O3 -flto=auto -fno-fat-lto-objects
Selenium Benchmark: Octane - Browser: Firefox OpenBenchmarking.org Geometric Mean, More Is Better Selenium Benchmark: Octane - Browser: Firefox d e f c b 13K 26K 39K 52K 65K 60754 60711 60209 55067 55056 1. firefox 118.0.2
SPECFEM3D Model: Mount St. Helens OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Mount St. Helens d e f b c 12 24 36 48 60 30.83 31.65 32.37 51.62 51.75 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
SPECFEM3D Model: Tomographic Model OpenBenchmarking.org Seconds, Fewer Is Better SPECFEM3D 4.0 Model: Tomographic Model d f e b c 12 24 36 48 60 30.33 30.47 30.65 49.14 51.56 1. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz
ASTC Encoder Preset: Exhaustive OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Exhaustive f e d b c 0.3566 0.7132 1.0698 1.4264 1.783 1.5848 1.5633 1.5573 1.0807 1.0804 1. (CXX) g++ options: -O3 -flto -pthread
Selenium Benchmark: Octane - Browser: Google Chrome OpenBenchmarking.org Geometric Mean, More Is Better Selenium Benchmark: Octane - Browser: Google Chrome f d c e b 30K 60K 90K 120K 150K 118632 117684 105856 102783 97769 1. chrome 118.0.5993.70
NAMD ATPase Simulation - 327,506 Atoms OpenBenchmarking.org days/ns, Fewer Is Better NAMD 2.14 ATPase Simulation - 327,506 Atoms f e d c b 0.1942 0.3884 0.5826 0.7768 0.971 0.61266 0.61420 0.61454 0.86116 0.86311
RawTherapee Total Benchmark Time OpenBenchmarking.org Seconds, Fewer Is Better RawTherapee Total Benchmark Time d f e b c 9 18 27 36 45 34.45 35.08 35.19 40.09 40.30 1. RawTherapee, version 5.9, command line.
libavif avifenc Encoder Speed: 2 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 2 e f d c b 10 20 30 40 50 32.79 33.17 33.20 44.00 44.20 1. (CXX) g++ options: -O3 -fPIC -lm
Selenium Benchmark: Speedometer - Browser: Firefox OpenBenchmarking.org Runs Per Minute, More Is Better Selenium Benchmark: Speedometer - Browser: Firefox e d f c b 90 180 270 360 450 433 433 430 407 406 1. firefox 118.0.2
Kvazaar Video Input: Bosphorus 4K - Video Preset: Medium OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Medium e f d c b 5 10 15 20 25 19.43 19.40 19.38 13.09 13.07 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: simple-H2O d b e f c 8 16 24 32 40 31.97 32.16 32.19 32.23 32.72 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QuantLib Configuration: Multi-Threaded OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Multi-Threaded f e d c b a 20K 40K 60K 80K 100K 89756.1 89344.0 88060.0 57317.5 55963.1 55799.7 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
Embree Binary: Pathtracer ISPC - Model: Crown OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Crown e d f c b 7 14 21 28 35 29.57 29.50 29.19 19.15 19.03 MIN: 29.06 / MAX: 32.27 MIN: 28.9 / MAX: 32.01 MIN: 28.59 / MAX: 31.95 MIN: 18.85 / MAX: 19.5 MIN: 18.79 / MAX: 19.41
Selenium Benchmark: Speedometer - Browser: Google Chrome OpenBenchmarking.org Runs Per Minute, More Is Better Selenium Benchmark: Speedometer - Browser: Google Chrome f e d b c 110 220 330 440 550 504 502 498 486 471 1. chrome 118.0.5993.70
John The Ripper Test: WPA PSK OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: WPA PSK e f d c b 13K 26K 39K 52K 65K 61351 61337 60805 36403 36403 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
Cpuminer-Opt Algorithm: scrypt OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: scrypt e d f b c a 70 140 210 280 350 311.73 310.55 309.14 195.41 194.68 185.62 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
John The Ripper Test: bcrypt OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: bcrypt e f d b c 10K 20K 30K 40K 50K 47942 47587 47260 31152 31110 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
John The Ripper Test: Blowfish OpenBenchmarking.org Real C/S, More Is Better John The Ripper 2023.03.14 Test: Blowfish e d f c b 10K 20K 30K 40K 50K 47590 47385 47203 31050 26232 1. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt -lbz2
Liquid-DSP Threads: 32 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 32 - Buffer Length: 256 - Filter Length: 512 e f d c b 70M 140M 210M 280M 350M 331120000 329870000 329800000 228970000 228820000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 8 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 8 - Buffer Length: 256 - Filter Length: 512 e f d c b 40M 80M 120M 160M 200M 171400000 169900000 168030000 140020000 139440000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Liquid-DSP Threads: 16 - Buffer Length: 256 - Filter Length: 512 OpenBenchmarking.org samples/s, More Is Better Liquid-DSP 1.6 Threads: 16 - Buffer Length: 256 - Filter Length: 512 f e d c b 50M 100M 150M 200M 250M 239250000 239170000 223360000 204100000 204060000 1. (CC) gcc options: -O3 -pthread -lm -lc -lliquid
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Blake-2 S d e f c b a 30K 60K 90K 120K 150K 141280 140960 138510 86620 85030 79320 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Garlicoin d e f c b a 600 1200 1800 2400 3000 2726.85 2605.60 2559.46 2040.66 2040.33 1893.63 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Deepcoin e f d c b a 2K 4K 6K 8K 10K 9300.91 9248.07 9210.49 5604.27 5549.86 5436.88 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Magi f d e c b a 160 320 480 640 800 722.00 714.42 710.53 450.42 442.28 417.24 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Myriad-Groestl d e f b c a 3K 6K 9K 12K 15K 14360.00 14310.00 14200.00 8774.33 8765.28 8564.28 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Ringcoin e d f b c a 900 1800 2700 3600 4500 4048.08 4039.03 3946.27 2508.93 2494.87 2345.04 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Quad SHA-256, Pyrite f d e b c a 15K 30K 45K 60K 75K 71070 70960 70910 43650 43630 41360 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Triple SHA-256, Onecoin e f d b c a 20K 40K 60K 80K 100K 103680 102900 102200 62870 62850 61070 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: LBC, LBRY Credits e d f c b a 4K 8K 12K 16K 20K 16950.00 16610.00 16590.00 9996.74 9845.56 9035.20 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Skeincoin d f e c b a 8K 16K 24K 32K 40K 38520 38040 38020 22780 22770 21370 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Selenium Benchmark: PSPDFKit WASM - Browser: Firefox OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Firefox e f d c b 500 1000 1500 2000 2500 2051 2055 2059 2096 2108 1. firefox 118.0.2
Selenium Benchmark: PSPDFKit WASM - Browser: Google Chrome OpenBenchmarking.org Score, Fewer Is Better Selenium Benchmark: PSPDFKit WASM - Browser: Google Chrome d f e c b 500 1000 1500 2000 2500 2139 2139 2140 2170 2202 1. chrome 118.0.5993.70
Embree Binary: Pathtracer ISPC - Model: Asian Dragon OpenBenchmarking.org Frames Per Second, More Is Better Embree 4.3 Binary: Pathtracer ISPC - Model: Asian Dragon d e f c b 8 16 24 32 40 35.45 35.33 35.27 23.20 23.10 MIN: 35.02 / MAX: 37.92 MIN: 34.92 / MAX: 37.85 MIN: 34.8 / MAX: 37.69 MIN: 23.04 / MAX: 23.45 MIN: 22.96 / MAX: 23.41
WebP Image Encode Encode Settings: Quality 100, Lossless, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless, Highest Compression f e d c b 0.2205 0.441 0.6615 0.882 1.1025 0.98 0.97 0.97 0.88 0.87 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
PyPerformance Benchmark: regex_compile OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: regex_compile d f e c b 16 32 48 64 80 63.6 64.8 67.0 71.4 71.6
Timed Mesa Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Mesa Compilation 21.0 Time To Compile d e f c b 7 14 21 28 35 21.80 21.85 21.99 29.20 29.83
PyPerformance Benchmark: crypto_pyaes OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: crypto_pyaes d f e c b 9 18 27 36 45 33.0 33.0 33.6 37.4 37.5
PyPerformance Benchmark: json_loads OpenBenchmarking.org Milliseconds, Fewer Is Better PyPerformance 1.0.0 Benchmark: json_loads f d e c b 3 6 9 12 15 8.98 9.18 9.23 10.10 10.20
7-Zip Compression Test: Decompression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Decompression Rating e d f b c 30K 60K 90K 120K 150K 155668 142133 138934 92168 92137 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
7-Zip Compression Test: Compression Rating OpenBenchmarking.org MIPS, More Is Better 7-Zip Compression 22.01 Test: Compression Rating f d e c b 40K 80K 120K 160K 200K 185607 183859 171918 127943 127009 1. (CXX) g++ options: -lpthread -ldl -O2 -fPIC
Selenium Benchmark: Kraken - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Firefox d f b c e 120 240 360 480 600 487.8 490.7 520.2 520.4 552.8 1. firefox 118.0.2
ASTC Encoder Preset: Thorough OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Thorough f d e c b 4 8 12 16 20 16.98 16.96 16.95 10.69 10.69 1. (CXX) g++ options: -O3 -flto -pthread
QMCPACK Input: H4_ae OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: H4_ae f e d b c 6 12 18 24 30 17.70 18.30 18.59 22.73 25.09 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
x265 Video Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better x265 3.4 Video Input: Bosphorus 4K f e d c b 8 16 24 32 40 33.47 33.23 32.51 26.74 25.70 1. (CXX) g++ options: -O3 -rdynamic -lpthread -lrt -ldl -lnuma
Natron Input: Spaceship OpenBenchmarking.org FPS, More Is Better Natron 2.4.3 Input: Spaceship f d e b c 2 4 6 8 10 6.4 6.4 6.3 4.6 4.4
NAS Parallel Benchmarks Test / Class: FT.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: FT.C e f d b c 5K 10K 15K 20K 25K 25591.23 25290.90 24740.05 18953.53 18737.09 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
NAS Parallel Benchmarks Test / Class: SP.B OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: SP.B d f e b c 5K 10K 15K 20K 25K 22004.23 21905.42 21129.21 16564.09 16408.43 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
LibRaw Post-Processing Benchmark OpenBenchmarking.org Mpix/sec, More Is Better LibRaw 0.20 Post-Processing Benchmark d f e c b 16 32 48 64 80 71.16 70.56 69.03 63.14 61.85 1. (CXX) g++ options: -O2 -fopenmp -ljpeg -lz -lm
NAS Parallel Benchmarks Test / Class: CG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: CG.C d e b f c 2K 4K 6K 8K 10K 10390.33 8593.82 8025.90 7707.48 7422.74 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
uvg266 Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Very Fast f d e c b 9 18 27 36 45 39.46 38.59 38.46 27.74 27.74
uvg266 Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Super Fast e f d c b 10 20 30 40 50 42.91 42.68 41.03 29.34 29.30
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 b c a f e d 4 8 12 16 20 15.20 15.18 15.11 15.03 14.79 14.79 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 c b a d e f 4 8 12 16 20 15.22 15.22 15.08 15.00 14.90 14.77 1. (CXX) g++ options: -O3
Kvazaar Video Input: Bosphorus 4K - Video Preset: Very Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Very Fast f e d b c 10 20 30 40 50 43.18 42.88 42.25 32.39 32.33 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
Selenium Benchmark: Kraken - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: Kraken - Browser: Google Chrome f e d c b 80 160 240 320 400 317.3 318.5 319.4 343.2 349.0 1. chrome 118.0.5993.70
QuantLib Configuration: Single-Threaded OpenBenchmarking.org MFLOPS, More Is Better QuantLib 1.32 Configuration: Single-Threaded e f d b c a 1200 2400 3600 4800 6000 5672.2 5666.3 5665.0 5016.1 5014.7 4818.8 1. (CXX) g++ options: -O3 -march=native -fPIE -pie
oneDNN Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU d e f b c 0.6734 1.3468 2.0202 2.6936 3.367 2.01150 2.01217 2.03456 2.98963 2.99292 MIN: 1.64 MIN: 1.64 MIN: 1.64 MIN: 2.87 MIN: 2.87 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 b c a e d f 5 10 15 20 25 18.32 18.28 18.20 15.85 15.81 15.53 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 b c a d e f 5 10 15 20 25 18.29 18.25 18.24 15.98 15.95 15.88 1. (CXX) g++ options: -O3
uvg266 Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better uvg266 0.4.1 Video Input: Bosphorus 4K - Video Preset: Ultra Fast e f d b c 11 22 33 44 55 50.81 50.69 50.11 35.30 35.09
Selenium Benchmark: WASM collisionDetection - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Firefox e f d b c 50 100 150 200 250 214.3 214.5 216.0 242.3 242.5 1. firefox 118.0.2
Y-Cruncher Pi Digits To Calculate: 500M OpenBenchmarking.org Seconds, Fewer Is Better Y-Cruncher 0.7.10.9513 Pi Digits To Calculate: 500M d e f b c 3 6 9 12 15 10.78 10.86 11.11 13.44 13.52
Kvazaar Video Input: Bosphorus 4K - Video Preset: Super Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Super Fast e d f b c 12 24 36 48 60 54.71 54.02 53.99 40.71 40.69 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
PHPBench PHP Benchmark Suite OpenBenchmarking.org Score, More Is Better PHPBench 0.8.1 PHP Benchmark Suite d f e b c 400K 800K 1200K 1600K 2000K 1692102 1686956 1685554 1491682 1479761
GIMP Test: resize OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: resize e c d b f 3 6 9 12 15 12.11 12.39 12.52 12.63 12.70
LuxCoreRender Scene: Rainbow Colors and Prism - Acceleration: CPU OpenBenchmarking.org M samples/sec, More Is Better LuxCoreRender 2.6 Scene: Rainbow Colors and Prism - Acceleration: CPU d e f c b 4 8 12 16 20 17.96 17.74 14.89 12.54 12.46 MIN: 17.28 / MAX: 18.69 MIN: 16.79 / MAX: 18.55 MIN: 12.73 / MAX: 16.75 MIN: 12.43 / MAX: 12.6 MIN: 12.32 / MAX: 12.57
GIMP Test: unsharp-mask OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: unsharp-mask d b e f c 3 6 9 12 15 11.04 11.12 11.12 11.17 11.35
Rodinia Test: OpenMP Streamcluster OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP Streamcluster e d f c b 4 8 12 16 20 7.176 7.224 7.226 15.834 15.894 1. (CXX) g++ options: -O2 -lOpenCL
Selenium Benchmark: WASM collisionDetection - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM collisionDetection - Browser: Google Chrome f c e d b 50 100 150 200 250 171.17 193.39 204.81 211.66 232.97 1. chrome 118.0.5993.70
WebP Image Encode Encode Settings: Quality 100, Lossless OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Lossless f d e b c 0.5513 1.1026 1.6539 2.2052 2.7565 2.45 2.45 2.44 2.20 2.19 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
GIMP Test: auto-levels OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: auto-levels d e f b c 3 6 9 12 15 9.459 9.526 9.526 9.615 9.748
SVT-AV1 Encoder Mode: Preset 8 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 8 - Input: Bosphorus 4K e d f b c 20 40 60 80 100 75.65 75.36 75.06 58.46 58.18 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
PyBench Total For Average Test Times OpenBenchmarking.org Milliseconds, Fewer Is Better PyBench 2018-02-16 Total For Average Test Times f d e b c 100 200 300 400 500 376 377 378 427 446
Kvazaar Video Input: Bosphorus 4K - Video Preset: Ultra Fast OpenBenchmarking.org Frames Per Second, More Is Better Kvazaar 2.2 Video Input: Bosphorus 4K - Video Preset: Ultra Fast d e f c b 20 40 60 80 100 75.87 73.99 72.63 54.29 54.14 1. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt
oneDNN Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU OpenBenchmarking.org ms, Fewer Is Better oneDNN 3.3 Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU e d f b c 0.9892 1.9784 2.9676 3.9568 4.946 4.30610 4.30701 4.31754 4.36881 4.39625 MIN: 4.28 MIN: 4.27 MIN: 4.27 MIN: 4.33 MIN: 4.36 1. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl
GIMP Test: rotate OpenBenchmarking.org Seconds, Fewer Is Better GIMP 2.10.34 Test: rotate d f e b c 3 6 9 12 15 8.806 8.828 8.854 9.199 9.447
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 e d f b c a 7 14 21 28 35 28.66 28.44 27.78 27.73 27.55 27.22 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 e d f c b a 7 14 21 28 35 28.55 28.48 28.25 27.62 27.53 27.16 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 e d f b c a 7 14 21 28 35 29.11 28.91 28.45 28.42 28.21 27.99 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 d e f c b a 7 14 21 28 35 28.94 28.91 28.53 28.40 28.34 27.99 1. (CXX) g++ options: -O3
Selenium Benchmark: WASM imageConvolute - Browser: Firefox OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Firefox d e f c b 4 8 12 16 20 14.2 14.3 14.3 16.1 16.3 1. firefox 118.0.2
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 b c a e f d 7 14 21 28 35 32.01 31.90 31.73 28.77 28.76 28.66 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 b c a d e f 7 14 21 28 35 31.91 31.91 31.61 28.81 28.77 28.76 1. (CXX) g++ options: -O3
Rodinia Test: OpenMP CFD Solver OpenBenchmarking.org Seconds, Fewer Is Better Rodinia 3.1 Test: OpenMP CFD Solver d e f b c 3 6 9 12 15 5.935 6.101 6.490 10.118 10.121 1. (CXX) g++ options: -O2 -lOpenCL
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 b a c d e f 8 16 24 32 40 35.38 35.28 35.18 31.06 30.96 30.93 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 c b a f e d 8 16 24 32 40 35.43 35.29 35.22 31.05 30.99 30.84 1. (CXX) g++ options: -O3
SVT-HEVC Tuning: 7 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 7 - Input: Bosphorus 4K e d f c b 20 40 60 80 100 99.72 98.86 98.68 71.18 71.18 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
ASTC Encoder Preset: Medium OpenBenchmarking.org MT/s, More Is Better ASTC Encoder 4.0 Preset: Medium f d e c b 30 60 90 120 150 130.42 130.36 130.22 81.26 81.20 1. (CXX) g++ options: -O3 -flto -pthread
NAS Parallel Benchmarks Test / Class: MG.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: MG.C c e d f b 6K 12K 18K 24K 30K 26540.76 25162.58 24498.20 24381.01 24092.42 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
easyWave Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 OpenBenchmarking.org Seconds, Fewer Is Better easyWave r34 Input: e2Asean Grid + BengkuluSept2007 Source - Time: 240 d f e c b 2 4 6 8 10 4.774 4.785 4.833 7.127 8.637 1. (CXX) g++ options: -O3 -fopenmp
SVT-VP9 Tuning: VMAF Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: VMAF Optimized - Input: Bosphorus 4K f d e c b 20 40 60 80 100 108.94 107.40 107.34 96.00 93.49 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
SVT-VP9 Tuning: Visual Quality Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: Visual Quality Optimized - Input: Bosphorus 4K d f e c b 30 60 90 120 150 122.15 120.89 120.84 88.36 88.07 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
Selenium Benchmark: WASM imageConvolute - Browser: Google Chrome OpenBenchmarking.org ms, Fewer Is Better Selenium Benchmark: WASM imageConvolute - Browser: Google Chrome f d b c e 4 8 12 16 20 14.04 14.10 15.95 15.95 16.93 1. chrome 118.0.5993.70
libavif avifenc Encoder Speed: 6, Lossless OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6, Lossless e f d b c 2 4 6 8 10 5.063 5.066 5.097 6.908 6.983 1. (CXX) g++ options: -O3 -fPIC -lm
SVT-VP9 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-VP9 0.3 Tuning: PSNR/SSIM Optimized - Input: Bosphorus 4K f e d c b 30 60 90 120 150 152.13 150.27 137.97 112.34 111.84 1. (CC) gcc options: -O3 -fcommon -fPIE -fPIC -fvisibility=hidden -pie -rdynamic -lpthread -lrt -lm
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 d f e c b a 12 24 36 48 60 54.58 54.26 53.88 50.85 50.63 49.45 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 d f e c b a 12 24 36 48 60 54.95 54.66 54.06 50.91 50.81 49.76 1. (CXX) g++ options: -O3
WebP Image Encode Encode Settings: Quality 100, Highest Compression OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100, Highest Compression f d e c b 1.206 2.412 3.618 4.824 6.03 5.36 5.36 5.28 4.74 4.74 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 b c a d e f 13 26 39 52 65 59.20 59.17 58.42 55.88 55.58 55.37 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 b c a f d e 13 26 39 52 65 59.36 59.22 58.65 55.78 55.67 55.62 1. (CXX) g++ options: -O3
SVT-HEVC Tuning: 10 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-HEVC 1.5.0 Tuning: 10 - Input: Bosphorus 4K d e f c b 40 80 120 160 200 185.07 178.47 170.99 140.06 139.99 1. (CC) gcc options: -fPIE -fPIC -O3 -O2 -pie -rdynamic -lpthread -lrt
SVT-AV1 Encoder Mode: Preset 13 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 13 - Input: Bosphorus 4K d f e b c 40 80 120 160 200 193.28 191.65 190.91 149.16 147.09 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
SVT-AV1 Encoder Mode: Preset 12 - Input: Bosphorus 4K OpenBenchmarking.org Frames Per Second, More Is Better SVT-AV1 1.7 Encoder Mode: Preset 12 - Input: Bosphorus 4K d f e c b 40 80 120 160 200 194.35 194.23 190.16 148.55 146.57 1. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq
libavif avifenc Encoder Speed: 6 OpenBenchmarking.org Seconds, Fewer Is Better libavif avifenc 1.0 Encoder Speed: 6 e d f b c 1.0307 2.0614 3.0921 4.1228 5.1535 3.472 3.852 3.854 4.515 4.581 1. (CXX) g++ options: -O3 -fPIC -lm
Darktable Test: Boat - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Boat - Acceleration: CPU-only d e f b c 0.9875 1.975 2.9625 3.95 4.9375 2.697 2.705 2.771 4.194 4.389
NAS Parallel Benchmarks Test / Class: EP.C OpenBenchmarking.org Total Mop/s, More Is Better NAS Parallel Benchmarks 3.4 Test / Class: EP.C d f e c b 700 1400 2100 2800 3500 3342.51 3340.25 2893.39 1775.67 1773.71 1. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.5
Darktable Test: Masskrug - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Masskrug - Acceleration: CPU-only f e d b c 0.5452 1.0904 1.6356 2.1808 2.726 1.667 1.671 1.694 2.364 2.423
Darktable Test: Server Room - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Server Room - Acceleration: CPU-only f d e c b 0.4124 0.8248 1.2372 1.6496 2.062 1.405 1.415 1.434 1.825 1.833
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 b a c d e f 4 8 12 16 20 15.90 15.76 15.75 14.12 14.04 14.02 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 b c a d e f 4 8 12 16 20 16.05 15.94 15.83 14.09 14.05 14.02 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 b c a d e f 4 8 12 16 20 16.47 16.36 16.15 14.17 14.04 14.04 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 b c a f d e 4 8 12 16 20 16.48 16.43 16.27 14.14 14.13 14.08 1. (CXX) g++ options: -O3
WebP Image Encode Encode Settings: Quality 100 OpenBenchmarking.org MP/s, More Is Better WebP Image Encode 1.2.4 Encode Settings: Quality 100 e d f b c 4 8 12 16 20 17.54 17.54 17.53 15.57 15.53 1. (CC) gcc options: -fvisibility=hidden -O2 -lm
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 b c a d e f 7 14 21 28 35 29.01 28.81 28.59 25.82 25.57 25.45 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 b c a d e f 7 14 21 28 35 28.99 28.78 28.62 25.80 25.59 25.47 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 b c a d e f 7 14 21 28 35 30.04 29.88 29.68 27.64 27.53 27.34 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 b c a d e f 7 14 21 28 35 30.34 29.81 29.65 27.83 27.55 27.49 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 c b a d e f 7 14 21 28 35 29.87 29.52 29.39 27.63 27.40 27.34 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 b c a d f e 7 14 21 28 35 29.91 29.86 29.78 28.01 27.59 27.54 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 b c a d e f 7 14 21 28 35 31.82 31.55 31.30 27.78 27.57 27.55 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 b c a d f e 7 14 21 28 35 31.83 31.64 31.27 27.85 27.65 27.58 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 b a c d e f 13 26 39 52 65 55.94 55.50 55.41 53.75 53.46 53.41 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 c b a e d f 12 24 36 48 60 55.60 55.53 55.10 53.71 53.64 53.54 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 d f c b e a 13 26 39 52 65 58.98 58.59 58.47 58.15 58.05 57.98 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 d b e f c a 13 26 39 52 65 58.99 58.43 58.42 58.25 57.99 57.60 1. (CXX) g++ options: -O3
Darktable Test: Server Rack - Acceleration: CPU-only OpenBenchmarking.org Seconds, Fewer Is Better Darktable 4.4.2 Test: Server Rack - Acceleration: CPU-only e d f b c 0.0713 0.1426 0.2139 0.2852 0.3565 0.187 0.209 0.217 0.266 0.317
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 d f e b c a 4 8 12 16 20 16.94 16.86 16.84 14.48 14.38 14.28 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 f d e b a c 4 8 12 16 20 17.03 16.75 16.67 14.82 14.66 14.44 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 d f e c b a 5 10 15 20 25 19.23 19.19 18.79 18.55 18.43 18.11 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 d e f b c a 5 10 15 20 25 19.24 18.92 18.88 18.53 18.32 18.11 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 e f d b a c 13 26 39 52 65 56.67 53.02 52.54 32.69 31.81 22.17 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 d f e c b a 13 26 39 52 65 60.02 59.24 40.58 32.39 31.64 31.23 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 f d e b c a 12 24 36 48 60 54.80 53.69 52.90 32.62 32.41 31.60 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 f d e b c a 14 28 42 56 70 60.55 59.74 59.02 31.93 31.74 31.26 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 e d f b c a 20 40 60 80 100 97.44 95.58 87.45 53.17 52.25 51.88 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 e d f c b a 14 28 42 56 70 64.05 63.22 62.90 44.62 44.06 43.71 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 e d f b c a 15 30 45 60 75 64.99 64.20 63.75 44.88 44.60 44.29 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 d e f b c a 20 40 60 80 100 99.75 99.00 96.46 52.50 51.94 50.93 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 d f e b c a 30 60 90 120 150 117.85 115.05 114.12 71.01 69.36 66.67 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 d f e c b a 30 60 90 120 150 115.20 115.09 114.34 71.50 70.42 68.88 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 f d e b c a 40 80 120 160 200 187.72 186.55 183.83 134.25 131.51 127.31 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 e d f a c b 40 80 120 160 200 184.51 183.45 179.45 131.94 131.25 104.03 1. (CXX) g++ options: -O3
Phoronix Test Suite v10.8.5