2 x Intel Xeon Platinum 8490H testing with a Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS) and ASPEED on Ubuntu 23.10 via the Phoronix Test Suite.
Compare your own system(s) to this result file with the
Phoronix Test Suite by running the command:
phoronix-test-suite benchmark 2311059-NE-NEWSAT27963 new-sat - Phoronix Test Suite new-sat 2 x Intel Xeon Platinum 8490H testing with a Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS) and ASPEED on Ubuntu 23.10 via the Phoronix Test Suite.
HTML result view exported from: https://openbenchmarking.org/result/2311059-NE-NEWSAT27963&grw&export=pdf&rdt .
new-sat Processor Motherboard Chipset Memory Disk Graphics Network OS Kernel Desktop Display Server Compiler File-System Screen Resolution a b c d e f g h i j k l m n 2 x Intel Xeon Platinum 8490H @ 3.50GHz (120 Cores / 240 Threads) Quanta Cloud S6Q-MB-MPS (3A10.uh BIOS) Intel Device 1bce 1008GB 3201GB Micron_7450_MTFDKCC3T2TFS ASPEED 2 x Intel X710 for 10GBASE-T Ubuntu 23.10 6.6.0-rc5-phx-patched (x86_64) GNOME Shell 45.0 X Server 1.21.1.7 GCC 13.2.0 ext4 1920x1200 Intel Xeon Platinum 8490H @ 3.50GHz (60 Cores / 120 Threads) 512GB Intel Xeon Max 9468 @ 3.50GHz (48 Cores / 96 Threads) 2 x Intel Xeon Max 9468 @ 3.50GHz (96 Cores / 192 Threads) 1008GB 2 x Intel X710 for 10GBASE-T 2 x Intel Xeon Platinum 8490H @ 3.50GHz (120 Cores / 240 Threads) OpenBenchmarking.org Kernel Details - Transparent Huge Pages: madvise Compiler Details - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-bootstrap --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-13-XYspKM/gcc-13-13.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details - a: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x2b0004b1 - b: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x2b0004b1 - c: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x2b0004b1 - d: Scaling Governor: intel_pstate powersave (EPP: balance_performance) - CPU Microcode: 0x2b0004b1 - e: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0004b1 - f: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0004b1 - g: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2c000271 - h: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2c000271 - i: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2c000271 - j: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2c000271 - k: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2c000271 - l: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0004b1 - m: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0004b1 - n: Scaling Governor: intel_pstate performance (EPP: performance) - CPU Microcode: 0x2b0004b1 Security Details - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced / Automatic IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected Python Details - e, f, g, h, i, j, k, l, m, n: Python 3.11.6
new-sat heffte: r2c - Stock - float-long - 512 cloverleaf: clover_bm heffte: r2c - Stock - double-long - 256 cloverleaf: clover_bm16 heffte: c2c - Stock - double-long - 512 heffte: c2c - Stock - float-long - 256 heffte: r2c - FFTW - double-long - 512 heffte: r2c - FFTW - float-long - 128 heffte: c2c - FFTW - double-long - 256 heffte: c2c - FFTW - double - 128 heffte: c2c - FFTW - double - 256 heffte: r2c - FFTW - float - 256 heffte: r2c - FFTW - float - 512 heffte: r2c - FFTW - float - 128 heffte: c2c - FFTW - float - 512 heffte: c2c - FFTW - float - 256 heffte: r2c - Stock - double-long - 1024 heffte: c2c - FFTW - double - 512 heffte: c2c - FFTW - float-long - 256 heffte: c2c - FFTW - float - 1024 heffte: r2c - FFTW - float-long - 512 heffte: c2c - Stock - float - 128 heffte: c2c - FFTW - float-long - 1024 heffte: c2c - Stock - float - 256 heffte: r2c - FFTW - double-long - 128 heffte: c2c - Stock - float - 512 heffte: r2c - Stock - float-long - 128 heffte: r2c - FFTW - double - 128 heffte: c2c - Stock - double-long - 128 heffte: r2c - FFTW - double - 256 heffte: r2c - FFTW - double-long - 1024 heffte: r2c - FFTW - double - 512 heffte: r2c - Stock - float-long - 1024 heffte: r2c - FFTW - float - 1024 heffte: c2c - FFTW - float-long - 128 heffte: r2c - Stock - float - 128 heffte: c2c - FFTW - float-long - 512 heffte: r2c - Stock - float - 256 heffte: r2c - FFTW - float-long - 256 heffte: r2c - Stock - float - 512 heffte: c2c - FFTW - double-long - 128 heffte: c2c - FFTW - double - 1024 heffte: c2c - FFTW - double-long - 512 heffte: c2c - Stock - double - 128 heffte: c2c - Stock - float-long - 128 heffte: c2c - Stock - double - 256 heffte: c2c - Stock - float-long - 512 heffte: c2c - Stock - double - 512 heffte: r2c - FFTW - double-long - 256 heffte: c2c - Stock - float - 1024 heffte: r2c - FFTW - float-long - 1024 heffte: r2c - FFTW - double - 1024 heffte: r2c - Stock - float-long - 256 heffte: r2c - Stock - double - 128 heffte: c2c - FFTW - float - 128 heffte: r2c - Stock - double - 256 heffte: c2c - Stock - double-long - 256 heffte: r2c - Stock - double - 512 heffte: c2c - Stock - float-long - 1024 heffte: r2c - Stock - float - 1024 heffte: r2c - Stock - double-long - 128 heffte: c2c - Stock - double - 1024 heffte: r2c - Stock - double-long - 512 heffte: r2c - Stock - double - 1024 heffte: c2c - Stock - double-long - 1024 heffte: c2c - FFTW - double-long - 1024 cloverleaf: clover_bm64_short openvino: Face Detection FP16 - CPU openvino: Face Detection FP16 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Face Detection FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP16 - CPU openvino: Person Detection FP32 - CPU openvino: Person Detection FP32 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16-INT8 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Weld Porosity Detection FP16 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16-INT8 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Vehicle Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Person Vehicle Bike Detection FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Machine Translation EN To DE FP16 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Face Detection Retail FP16 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Face Detection Retail FP16-INT8 - CPU openvino: Handwritten English Recognition FP16 - CPU openvino: Handwritten English Recognition FP16 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Handwritten English Recognition FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Road Segmentation ADAS FP16 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU openvino: Road Segmentation ADAS FP16-INT8 - CPU qmcpack: H4_ae qmcpack: Li2_STO_ae qmcpack: LiH_ae_MSD qmcpack: simple-H2O qmcpack: O_ae_pyscf_UHF qmcpack: FeCO6_b3lyp_gms cpuminer-opt: Magi cpuminer-opt: scrypt cpuminer-opt: Deepcoin cpuminer-opt: Ringcoin cpuminer-opt: Blake-2 S cpuminer-opt: Garlicoin cpuminer-opt: Skeincoin cpuminer-opt: Myriad-Groestl cpuminer-opt: LBC, LBRY Credits cpuminer-opt: Quad SHA-256, Pyrite cpuminer-opt: Triple SHA-256, Onecoin ospray-studio: 1 - 4K - 1 - Path Tracer - CPU ospray-studio: 2 - 4K - 1 - Path Tracer - CPU ospray-studio: 3 - 4K - 1 - Path Tracer - CPU ospray-studio: 1 - 4K - 16 - Path Tracer - CPU ospray-studio: 1 - 4K - 32 - Path Tracer - CPU ospray-studio: 2 - 4K - 16 - Path Tracer - CPU ospray-studio: 2 - 4K - 32 - Path Tracer - CPU ospray-studio: 3 - 4K - 16 - Path Tracer - CPU ospray-studio: 3 - 4K - 32 - Path Tracer - CPU ospray-studio: 1 - 1080p - 1 - Path Tracer - CPU ospray-studio: 2 - 1080p - 1 - Path Tracer - CPU ospray-studio: 3 - 1080p - 1 - Path Tracer - CPU ospray-studio: 1 - 1080p - 16 - Path Tracer - CPU ospray-studio: 1 - 1080p - 32 - Path Tracer - CPU ospray-studio: 2 - 1080p - 16 - Path Tracer - CPU ospray-studio: 2 - 1080p - 32 - Path Tracer - CPU ospray-studio: 3 - 1080p - 16 - Path Tracer - CPU ospray-studio: 3 - 1080p - 32 - Path Tracer - CPU build-gem5: Time To Compile blosc: blosclz shuffle - 8MB blosc: blosclz shuffle - 16MB blosc: blosclz shuffle - 32MB blosc: blosclz shuffle - 64MB blosc: blosclz noshuffle - 8MB blosc: blosclz shuffle - 128MB blosc: blosclz shuffle - 256MB blosc: blosclz bitshuffle - 8MB blosc: blosclz noshuffle - 16MB blosc: blosclz noshuffle - 32MB blosc: blosclz noshuffle - 64MB blosc: blosclz bitshuffle - 16MB blosc: blosclz bitshuffle - 32MB blosc: blosclz bitshuffle - 64MB blosc: blosclz noshuffle - 128MB blosc: blosclz noshuffle - 256MB blosc: blosclz bitshuffle - 128MB blosc: blosclz bitshuffle - 256MB duckdb: IMDB duckdb: TPC-H Parquet a b c d e f g h i j k l m n 244.606 14.03 156.736 224.98 71.7427 162.018 124.494 165.324 67.4501 71.4774 64.8474 345.601 242.8 175.042 136.085 159.217 154.939 69.6908 166.056 151.717 246.417 125.363 152.251 172.186 71.9574 119.576 164.817 89.7639 87.9144 130.153 138.026 123.786 299.635 261.576 143.402 161.815 134.078 360.023 323.078 242.763 29.7765 82.0966 67.0103 89.0188 123.364 65.75 128.23 72.0271 127.353 152.494 262.189 137.37 322.715 115.201 140.098 149.777 67.4313 133.759 153.32 298.194 120.195 82.6559 135.652 154.89 82.7248 82.8177 24.34 2683.56 1220.91 32300 5799.03 535240 3725.51 133860 47950 58960 267960 384930 889 907 1066 14067 33358 14223 34608 16770 38573 225 230 269 3539 7076 3620 7192 4239 8459 259.420 178.042 241.839 13.69 157.55 214.08 71.4551 158.115 123.112 175.737 63.5861 66.6381 63.6836 331.454 238.309 176.183 133.271 173.449 155.044 65.4142 158.009 152.472 238.175 126.182 153.363 157.025 38.4964 129.547 158.419 41.5556 90.2338 138.926 136.462 122.949 297.937 262.003 132.641 161.364 133.67 323.206 348.229 245.522 92.731 80.4689 66.1912 90.0107 123.259 66.9959 129.794 71.3098 131.689 152.916 262.087 137.378 327.718 125.701 145.298 146.839 65.5461 131.436 153.985 297.888 124.546 82.8318 132.365 154.81 82.771 81.9652 24.97 2681.44 1194.3 32320 5933.75 535430 3727.55 133600 46280 59690 269860 384820 889 908 1064 13991 33490 14248 33567 16786 38554 226 230 268 3542 7113 3594 7193 4213 8429 14.05 218.67 92.6394 62.712 341.133 236.917 175.272 130.463 160.511 65.1177 152.035 108.631 159.458 118.43 115.397 144.142 24.12 19.28 303.86 32.19 2672.68 1189.10 32317 5770.26 533880 4200.71 134140 47623 58620 267450 384423 894 910 1064 14042 33652 14338 33817 16765 39029 227 231 270 3558 7142 3636 7245 4258 8493 259.283 178.825 8.74 323.53 38.96 77.69 385.45 165.82 360.89 86847.05 0.61 106187.7 0.48 290.36 103.21 290.88 103.01 16489.33 3.51 9078.33 6.44 3221.62 18.43 1878.87 15.86 3566.26 16.72 462.66 64.66 6106.91 9.66 9344.43 6.28 1545.36 38.54 1224.11 48.77 751.14 39.86 1101.87 54.3 1370.86 618.59 16340 5867.89 267760 11540 67000 25310 29550 136700 194980 1644 1663 1962 30876 57130 31373 58032 36028 67291 416 421 496 6633 13272 6711 13393 7892 15787 197.545 8.68 323.38 38.94 1647 1661 1962 30951 57572 31194 58436 36109 67337 416 422 497 6628 13231 6711 10.00 183.71 20.86 104.31 114.95 196.21 244.29 90384.31 0.51 131083.06 0.3 339.17 35.35 338.84 35.39 19358.98 2.47 10978.58 4.36 3319.81 14.44 1884.46 6.35 3685.78 13 433.9 27.53 6581.71 7.28 8871.46 5.4 1979.56 24.23 1200.82 39.95 682.77 17.55 923.91 51.91 968.45 472.76 11730 5205 191940 2256.38 50420 17860 21190 97890 139540 1984 1990 2352 36833 68562 36770 68506 42592 80104 500 505 595 7939 15955 8009 16151 9424 18877 206.079 9.87 182.63 20.99 104.12 115.13 196.28 244.11 88444.43 0.52 131445.26 0.3 339.22 35.35 338.61 35.41 19208.93 2.49 11783.36 4.06 3314 14.46 1889.02 6.34 3678.97 13.03 430.23 27.75 6566.24 7.3 8855.95 5.41 1988.3 24.12 1192.76 40.22 681.9 17.58 925.43 51.84 986.33 456.46 12140 5300.21 191940 2332.32 48010 18010 21180 97960 140170 1975 1989 2345 36575 68355 36787 68593 42412 79992 500 505 595 7952 15942 8010 16114 9487 18945 216.878 9.94 78.70 1584.37 117.20 193.11 124.17 370.5 258.62 115992.72 0.41 114581.51 0.31 632.07 37.93 631.31 37.98 35303.47 2.68 21815.71 4.38 6234.26 15.33 3484.18 6.87 6992.06 13.68 806.26 29.63 12561.59 7.62 16935.65 5.65 3910.94 24.52 2375.2 40.39 1283.22 18.68 1712.71 56 1946.4 939.58 23390 15560 382680 20460 95670 35700 42230 195710 278140 1366 1535 1691 31587 51593 22407 51940 30736 60905 356 361 420 5349 10852 5659 11801 6496 13319 195.489 63.32 1812.17 73.01 191.2 125.39 365.91 261.89 103901.05 0.41 119756.16 0.31 626.67 38.26 627.12 38.23 34620.99 2.74 21635.47 4.42 6196.74 15.43 3465.57 6.9 7049.72 13.59 803.87 29.75 12534.63 7.64 16935.03 5.65 3918.19 24.48 2377.19 40.36 1273.73 18.82 1709.38 56.11 1946.34 900.68 23850 15770 382610 20550 95990 35610 42230 196100 278150 1361 1381 1672 21000 48393 22062 52535 34459 59689 344 370 450 5771 10581 5812 11652 7011 13049 197.948 218.22 137.34 415.38 288.36 144299.06 0.37 151691.73 0.29 636.9 47.07 636.48 47.1 47357.41 2.52 27232.91 4.35 5831.93 20.51 3893.77 7.69 7617.42 15.66 733.98 40.61 14938.83 8.02 21075.26 5.68 4387.28 27.3 3004.85 39.91 1530.63 19.58 1914.91 62.61 12.52 99.816 94.265 29.288 214.65 128.73 168.582 14338.2 16680.2 16111.4 12239.3 12794.5 8707.4 5519.3 14873.2 15231.9 14712.6 11321.9 16895 15811.9 11927.7 8089.4 5409.4 8697.6 5615.4 218.39 137.23 415.53 288.29 142293.67 0.38 150804.01 0.29 638.13 46.98 639.58 46.87 47468.22 2.51 27109.5 4.35 5825.24 20.53 3895.49 7.69 7626.39 15.63 744.86 40.04 14932.7 8.02 21071.09 5.68 4379.43 27.35 3004.85 39.91 1531.37 19.57 1915.98 62.56 12.68 98.175 94.023 29.186 207.84 129.2 162.385 14241.2 16762.6 16050.9 12161.8 12855.3 8640 5580.3 14846.5 15269.2 14746.7 11315 16810.4 15827.9 11905.5 8059.2 5333.4 8864.5 5389.3 14.07 222.01 24.90 217.79 137.61 414.39 289.07 142499.69 0.38 146682.31 0.29 638.56 46.94 635.92 47.14 47369.39 2.52 27112.85 4.35 5799.67 20.62 3895.01 7.69 7650.29 15.59 729.58 40.9 14888.49 8.05 21033.04 5.69 4433.69 27.04 2991.51 40.09 1530.84 19.58 1912.82 62.67 12.7 99.261 94.301 29.146 208.62 129.23 2694.61 1201.63 32400 7748.29 534840 26000 133840 46430 59060 271570 385290 889 903 1057 14017 33085 14289 33789 16672 38979 225 230 268 3545 7076 3605 7194 4206 8428 165.809 14082.4 16392.9 15827.9 12008 12742.3 8564.3 5520.6 14815.4 15041.6 14569.9 11238.1 16633.4 15691 11839.7 7960.2 5324.3 8289.6 5455.4 OpenBenchmarking.org
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512 a b 50 100 150 200 250 244.61 241.84 1. (CXX) g++ options: -O3
CloverLeaf Input: clover_bm OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm a b c d e f g h i j k n 20 40 60 80 100 SE +/- 0.11, N = 3 14.03 13.69 14.05 19.28 8.74 8.68 10.00 9.87 9.94 78.70 63.32 14.07 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256 a b 30 60 90 120 150 156.74 157.55 1. (CXX) g++ options: -O3
CloverLeaf Input: clover_bm16 OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm16 a b c d e f g h j k n 400 800 1200 1600 2000 SE +/- 0.98, N = 3 224.98 214.08 218.67 303.86 323.53 323.38 183.71 182.63 1584.37 1812.17 222.01 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512 a b 16 32 48 64 80 71.74 71.46 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256 a b 40 80 120 160 200 162.02 158.12 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512 a b 30 60 90 120 150 124.49 123.11 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128 a b 40 80 120 160 200 165.32 175.74 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256 a b 15 30 45 60 75 67.45 63.59 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128 a b c 20 40 60 80 100 71.48 66.64 92.64 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256 a b c 14 28 42 56 70 64.85 63.68 62.71 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256 a b c 80 160 240 320 400 345.60 331.45 341.13 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512 a b c 50 100 150 200 250 242.80 238.31 236.92 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128 a b c 40 80 120 160 200 175.04 176.18 175.27 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512 a b c 30 60 90 120 150 136.09 133.27 130.46 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256 a b c 40 80 120 160 200 159.22 173.45 160.51 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024 a b 30 60 90 120 150 154.94 155.04 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512 a b c 16 32 48 64 80 69.69 65.41 65.12 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256 a b 40 80 120 160 200 166.06 158.01 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024 a b c 30 60 90 120 150 151.72 152.47 152.04 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512 a b 50 100 150 200 250 246.42 238.18 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 128 a b c 30 60 90 120 150 125.36 126.18 108.63 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024 a b 30 60 90 120 150 152.25 153.36 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 256 a b c 40 80 120 160 200 172.19 157.03 159.46 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128 a b 16 32 48 64 80 71.96 38.50 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 512 a b c 30 60 90 120 150 119.58 129.55 118.43 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128 a b 40 80 120 160 200 164.82 158.42 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128 a b c 30 60 90 120 150 89.76 41.56 115.40 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128 a b 20 40 60 80 100 87.91 90.23 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256 a b 30 60 90 120 150 130.15 138.93 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024 a b 30 60 90 120 150 138.03 136.46 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512 a b 30 60 90 120 150 123.79 122.95 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024 a b 70 140 210 280 350 299.64 297.94 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024 a b 60 120 180 240 300 261.58 262.00 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128 a b 30 60 90 120 150 143.40 132.64 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 128 a b 40 80 120 160 200 161.82 161.36 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512 a b 30 60 90 120 150 134.08 133.67 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 256 a b 80 160 240 320 400 360.02 323.21 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256 a b 80 160 240 320 400 323.08 348.23 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 512 a b 50 100 150 200 250 242.76 245.52 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128 a b 20 40 60 80 100 29.78 92.73 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024 a b 20 40 60 80 100 82.10 80.47 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512 a b 15 30 45 60 75 67.01 66.19 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 128 a b 20 40 60 80 100 89.02 90.01 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128 a b 30 60 90 120 150 123.36 123.26 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 256 a b 15 30 45 60 75 65.75 67.00 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512 a b 30 60 90 120 150 128.23 129.79 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 512 a b 16 32 48 64 80 72.03 71.31 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256 a b 30 60 90 120 150 127.35 131.69 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024 a b 30 60 90 120 150 152.49 152.92 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024 a b 60 120 180 240 300 262.19 262.09 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024 a b 30 60 90 120 150 137.37 137.38 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256 a b 70 140 210 280 350 322.72 327.72 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 128 a b 30 60 90 120 150 115.20 125.70 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128 a b c 30 60 90 120 150 140.10 145.30 144.14 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 256 a b 30 60 90 120 150 149.78 146.84 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256 a b 15 30 45 60 75 67.43 65.55 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 512 a b 30 60 90 120 150 133.76 131.44 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024 a b 30 60 90 120 150 153.32 153.99 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024 a b 60 120 180 240 300 298.19 297.89 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128 a b 30 60 90 120 150 120.20 124.55 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024 a b 20 40 60 80 100 82.66 82.83 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512 a b 30 60 90 120 150 135.65 132.37 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024 a b 30 60 90 120 150 154.89 154.81 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024 a b 20 40 60 80 100 82.72 82.77 1. (CXX) g++ options: -O3
HeFFTe - Highly Efficient FFT for Exascale Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024 OpenBenchmarking.org GFLOP/s, More Is Better HeFFTe - Highly Efficient FFT for Exascale 2.4 Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024 a b 20 40 60 80 100 82.82 81.97 1. (CXX) g++ options: -O3
CloverLeaf Input: clover_bm64_short OpenBenchmarking.org Seconds, Fewer Is Better CloverLeaf 1.3 Input: clover_bm64_short a b c d e f g h j k n 30 60 90 120 150 SE +/- 0.19, N = 3 24.34 24.97 24.12 32.19 38.96 38.94 20.86 20.99 117.20 73.01 24.90 1. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU e g h j k l m n 50 100 150 200 250 77.69 104.31 104.12 193.11 191.20 218.22 218.39 217.79 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16 - Device: CPU e g h j k l m n 80 160 240 320 400 385.45 114.95 115.13 124.17 125.39 137.34 137.23 137.61 MIN: 237.16 / MAX: 411.93 MIN: 101.98 / MAX: 211.99 MIN: 92.5 / MAX: 159.18 MIN: 94.99 / MAX: 281.08 MIN: 100.15 / MAX: 195.42 MIN: 107.88 / MAX: 226.99 MIN: 105.45 / MAX: 172.87 MIN: 109.14 / MAX: 177.75 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU e g h j k l m n 90 180 270 360 450 165.82 196.21 196.28 370.50 365.91 415.38 415.53 414.39 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection FP16-INT8 - Device: CPU e g h j k l m n 80 160 240 320 400 360.89 244.29 244.11 258.62 261.89 288.36 288.29 289.07 MIN: 130.24 / MAX: 396.13 MIN: 209.59 / MAX: 272.46 MIN: 188.68 / MAX: 273.96 MIN: 201.42 / MAX: 286.91 MIN: 205.93 / MAX: 306.15 MIN: 214.15 / MAX: 340.62 MIN: 214.96 / MAX: 344.63 MIN: 215.04 / MAX: 341.97 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU e g h j k l m n 30K 60K 90K 120K 150K 86847.05 90384.31 88444.43 115992.72 103901.05 144299.06 142293.67 142499.69 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU e g h j k l m n 0.1373 0.2746 0.4119 0.5492 0.6865 0.61 0.51 0.52 0.41 0.41 0.37 0.38 0.38 MIN: 0.26 / MAX: 17.87 MIN: 0.35 / MAX: 25.07 MIN: 0.35 / MAX: 25.08 MIN: 0.34 / MAX: 35.61 MIN: 0.33 / MAX: 43.64 MIN: 0.34 / MAX: 39.96 MIN: 0.29 / MAX: 38.89 MIN: 0.3 / MAX: 43.5 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU e g h j k l m n 30K 60K 90K 120K 150K 106187.70 131083.06 131445.26 114581.51 119756.16 151691.73 150804.01 146682.31 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU e g h j k l m n 0.108 0.216 0.324 0.432 0.54 0.48 0.30 0.30 0.31 0.31 0.29 0.29 0.29 MIN: 0.23 / MAX: 17.56 MIN: 0.25 / MAX: 20.77 MIN: 0.25 / MAX: 21.33 MIN: 0.26 / MAX: 16.56 MIN: 0.27 / MAX: 17.64 MIN: 0.24 / MAX: 41.51 MIN: 0.26 / MAX: 40.41 MIN: 0.23 / MAX: 39.74 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU e g h j k l m n 140 280 420 560 700 290.36 339.17 339.22 632.07 626.67 636.90 638.13 638.56 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP16 - Device: CPU e g h j k l m n 20 40 60 80 100 103.21 35.35 35.35 37.93 38.26 47.07 46.98 46.94 MIN: 57.77 / MAX: 142.16 MIN: 29.67 / MAX: 62.19 MIN: 32.55 / MAX: 55.19 MIN: 33.58 / MAX: 128.85 MIN: 31.97 / MAX: 99.85 MIN: 39.06 / MAX: 99.76 MIN: 35.9 / MAX: 87.78 MIN: 35.97 / MAX: 111.06 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU e g h j k l m n 140 280 420 560 700 290.88 338.84 338.61 631.31 627.12 636.48 639.58 635.92 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Detection FP32 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Detection FP32 - Device: CPU e g h j k l m n 20 40 60 80 100 103.01 35.39 35.41 37.98 38.23 47.10 46.87 47.14 MIN: 35.6 / MAX: 143.64 MIN: 32.69 / MAX: 62.18 MIN: 32.93 / MAX: 61.75 MIN: 33.57 / MAX: 87.62 MIN: 34.05 / MAX: 82 MIN: 35.94 / MAX: 89.59 MIN: 34.63 / MAX: 89.14 MIN: 38.01 / MAX: 97.48 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU e g h j k l m n 10K 20K 30K 40K 50K 16489.33 19358.98 19208.93 35303.47 34620.99 47357.41 47468.22 47369.39 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16-INT8 - Device: CPU e g h j k l m n 0.7898 1.5796 2.3694 3.1592 3.949 3.51 2.47 2.49 2.68 2.74 2.52 2.51 2.52 MIN: 1.45 / MAX: 17.74 MIN: 2.24 / MAX: 9.11 MIN: 2.24 / MAX: 10.47 MIN: 2.36 / MAX: 31.84 MIN: 2.35 / MAX: 26.63 MIN: 2.28 / MAX: 15.14 MIN: 2.29 / MAX: 14.64 MIN: 2.3 / MAX: 15.72 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU e g h j k l m n 6K 12K 18K 24K 30K 9078.33 10978.58 11783.36 21815.71 21635.47 27232.91 27109.50 27112.85 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Weld Porosity Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Weld Porosity Detection FP16 - Device: CPU e g h j k l m n 2 4 6 8 10 6.44 4.36 4.06 4.38 4.42 4.35 4.35 4.35 MIN: 2.24 / MAX: 17.31 MIN: 3.58 / MAX: 23 MIN: 3.62 / MAX: 12.04 MIN: 3.7 / MAX: 39.28 MIN: 3.73 / MAX: 38.36 MIN: 3.72 / MAX: 18.63 MIN: 3.73 / MAX: 16.96 MIN: 3.76 / MAX: 33.39 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU e g h j k l m n 1300 2600 3900 5200 6500 3221.62 3319.81 3314.00 6234.26 6196.74 5831.93 5825.24 5799.67 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16-INT8 - Device: CPU e g h j k l m n 5 10 15 20 25 18.43 14.44 14.46 15.33 15.43 20.51 20.53 20.62 MIN: 8.82 / MAX: 34.87 MIN: 12.83 / MAX: 23.01 MIN: 13.16 / MAX: 25.98 MIN: 13.52 / MAX: 43.56 MIN: 12.64 / MAX: 42.13 MIN: 12.93 / MAX: 38.6 MIN: 14.84 / MAX: 38.04 MIN: 14.04 / MAX: 37.86 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU e g h j k l m n 800 1600 2400 3200 4000 1878.87 1884.46 1889.02 3484.18 3465.57 3893.77 3895.49 3895.01 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Vehicle Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Vehicle Detection FP16 - Device: CPU e g h j k l m n 4 8 12 16 20 15.86 6.35 6.34 6.87 6.90 7.69 7.69 7.69 MIN: 11.52 / MAX: 32.13 MIN: 5.9 / MAX: 14.78 MIN: 5.9 / MAX: 14.06 MIN: 6.09 / MAX: 26.57 MIN: 6.08 / MAX: 44.64 MIN: 6.84 / MAX: 22.74 MIN: 6.78 / MAX: 23.97 MIN: 6.75 / MAX: 23.64 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU e g h j k l m n 1600 3200 4800 6400 8000 3566.26 3685.78 3678.97 6992.06 7049.72 7617.42 7626.39 7650.29 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Person Vehicle Bike Detection FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Person Vehicle Bike Detection FP16 - Device: CPU e g h j k l m n 4 8 12 16 20 16.72 13.00 13.03 13.68 13.59 15.66 15.63 15.59 MIN: 9.44 / MAX: 29.62 MIN: 11.57 / MAX: 20.67 MIN: 11.65 / MAX: 22.06 MIN: 11.91 / MAX: 28.69 MIN: 11.93 / MAX: 41.34 MIN: 13.15 / MAX: 33.2 MIN: 12.45 / MAX: 32.59 MIN: 13.17 / MAX: 34.41 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU e g h j k l m n 200 400 600 800 1000 462.66 433.90 430.23 806.26 803.87 733.98 744.86 729.58 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Machine Translation EN To DE FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Machine Translation EN To DE FP16 - Device: CPU e g h j k l m n 14 28 42 56 70 64.66 27.53 27.75 29.63 29.75 40.61 40.04 40.90 MIN: 49.3 / MAX: 84.95 MIN: 23.27 / MAX: 212.35 MIN: 23.59 / MAX: 217.1 MIN: 23.86 / MAX: 333.15 MIN: 23.81 / MAX: 291.46 MIN: 28.19 / MAX: 258.26 MIN: 22.98 / MAX: 252.16 MIN: 23.54 / MAX: 269.54 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU e g h j k l m n 3K 6K 9K 12K 15K 6106.91 6581.71 6566.24 12561.59 12534.63 14938.83 14932.70 14888.49 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16 - Device: CPU e g h j k l m n 3 6 9 12 15 9.66 7.28 7.30 7.62 7.64 8.02 8.02 8.05 MIN: 5.23 / MAX: 20.95 MIN: 6.68 / MAX: 15.94 MIN: 6.66 / MAX: 16.96 MIN: 6.81 / MAX: 24.56 MIN: 6.82 / MAX: 24.32 MIN: 7.02 / MAX: 20.74 MIN: 7.06 / MAX: 19.79 MIN: 7.01 / MAX: 22.43 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU e g h j k l m n 5K 10K 15K 20K 25K 9344.43 8871.46 8855.95 16935.65 16935.03 21075.26 21071.09 21033.04 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Face Detection Retail FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Face Detection Retail FP16-INT8 - Device: CPU e g h j k l m n 2 4 6 8 10 6.28 5.40 5.41 5.65 5.65 5.68 5.68 5.69 MIN: 3.85 / MAX: 24.26 MIN: 5 / MAX: 12.46 MIN: 5 / MAX: 13.32 MIN: 5.22 / MAX: 20.27 MIN: 5.13 / MAX: 21.07 MIN: 4.97 / MAX: 15.4 MIN: 5.25 / MAX: 14.86 MIN: 5.2 / MAX: 16.43 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU e g h j k l m n 1000 2000 3000 4000 5000 1545.36 1979.56 1988.30 3910.94 3918.19 4387.28 4379.43 4433.69 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16 - Device: CPU e g h j k l m n 9 18 27 36 45 38.54 24.23 24.12 24.52 24.48 27.30 27.35 27.04 MIN: 23.05 / MAX: 52.61 MIN: 23.13 / MAX: 32.82 MIN: 22.95 / MAX: 31.04 MIN: 23.19 / MAX: 54.56 MIN: 23.23 / MAX: 49.9 MIN: 24.8 / MAX: 50.96 MIN: 21.13 / MAX: 51.46 MIN: 24.72 / MAX: 50.96 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU e g h j k l m n 600 1200 1800 2400 3000 1224.11 1200.82 1192.76 2375.20 2377.19 3004.85 3004.85 2991.51 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Handwritten English Recognition FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Handwritten English Recognition FP16-INT8 - Device: CPU e g h j k l m n 11 22 33 44 55 48.77 39.95 40.22 40.39 40.36 39.91 39.91 40.09 MIN: 29.81 / MAX: 58.65 MIN: 38.59 / MAX: 46.29 MIN: 39.04 / MAX: 47.11 MIN: 39.1 / MAX: 63.97 MIN: 39.01 / MAX: 73.27 MIN: 37.94 / MAX: 58.88 MIN: 37.95 / MAX: 57.96 MIN: 38.11 / MAX: 59.9 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU e g h j k l m n 300 600 900 1200 1500 751.14 682.77 681.90 1283.22 1273.73 1530.63 1531.37 1530.84 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16 - Device: CPU e g h j k l m n 9 18 27 36 45 39.86 17.55 17.58 18.68 18.82 19.58 19.57 19.58 MIN: 30.89 / MAX: 59.23 MIN: 16.33 / MAX: 25.28 MIN: 16.23 / MAX: 27.09 MIN: 16.75 / MAX: 79.48 MIN: 16.8 / MAX: 96.74 MIN: 17.86 / MAX: 41.74 MIN: 17.99 / MAX: 38.79 MIN: 17.93 / MAX: 45.71 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org FPS, More Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU e g h j k l m n 400 800 1200 1600 2000 1101.87 923.91 925.43 1712.71 1709.38 1914.91 1915.98 1912.82 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
OpenVINO Model: Road Segmentation ADAS FP16-INT8 - Device: CPU OpenBenchmarking.org ms, Fewer Is Better OpenVINO 2023.2.dev Model: Road Segmentation ADAS FP16-INT8 - Device: CPU e g h j k l m n 14 28 42 56 70 54.30 51.91 51.84 56.00 56.11 62.61 62.56 62.67 MIN: 27.95 / MAX: 73.79 MIN: 44.4 / MAX: 72.89 MIN: 49.58 / MAX: 74.45 MIN: 49.06 / MAX: 176.61 MIN: 49.26 / MAX: 169.49 MIN: 52.19 / MAX: 111.48 MIN: 49.16 / MAX: 109.26 MIN: 51.78 / MAX: 114.18 1. (CXX) g++ options: -fsigned-char -ffunction-sections -fdata-sections -O3 -fno-strict-overflow -fwrapv -pie
QMCPACK Input: H4_ae OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: H4_ae l m n 3 6 9 12 15 12.52 12.68 12.70 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QMCPACK Input: Li2_STO_ae OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: Li2_STO_ae l m n 20 40 60 80 100 99.82 98.18 99.26 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QMCPACK Input: LiH_ae_MSD OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: LiH_ae_MSD l m n 20 40 60 80 100 94.27 94.02 94.30 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QMCPACK Input: simple-H2O OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: simple-H2O l m n 7 14 21 28 35 29.29 29.19 29.15 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QMCPACK Input: O_ae_pyscf_UHF OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: O_ae_pyscf_UHF l m n 50 100 150 200 250 214.65 207.84 208.62 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
QMCPACK Input: FeCO6_b3lyp_gms OpenBenchmarking.org Total Execution Time - Seconds, Fewer Is Better QMCPACK 3.17.1 Input: FeCO6_b3lyp_gms l m n 30 60 90 120 150 128.73 129.20 129.23 1. (CXX) g++ options: -fopenmp -foffload=disable -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -ffast-math -march=native -O3 -lm -ldl
Cpuminer-Opt Algorithm: Magi OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Magi a b d e g h j k n 600 1200 1800 2400 3000 SE +/- 3.06, N = 3 2683.56 2681.44 2672.68 1370.86 968.45 986.33 1946.40 1946.34 2694.61 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: scrypt OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: scrypt a b d e g h j k n 300 600 900 1200 1500 SE +/- 0.70, N = 3 1220.91 1194.30 1189.10 618.59 472.76 456.46 939.58 900.68 1201.63 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Deepcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Deepcoin a b d e g h j k n 7K 14K 21K 28K 35K SE +/- 38.44, N = 3 32300 32320 32317 16340 11730 12140 23390 23850 32400 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Ringcoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Ringcoin a b d e g h j k n 3K 6K 9K 12K 15K SE +/- 117.64, N = 12 5799.03 5933.75 5770.26 5867.89 5205.00 5300.21 15560.00 15770.00 7748.29 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Blake-2 S OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Blake-2 S a b d e g h j k n 110K 220K 330K 440K 550K SE +/- 611.34, N = 3 535240 535430 533880 267760 191940 191940 382680 382610 534840 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Garlicoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Garlicoin a b d e g h j k n 6K 12K 18K 24K 30K SE +/- 466.61, N = 12 3725.51 3727.55 4200.71 11540.00 2256.38 2332.32 20460.00 20550.00 26000.00 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Skeincoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Skeincoin a b d e g h j k n 30K 60K 90K 120K 150K SE +/- 922.19, N = 3 133860 133600 134140 67000 50420 48010 95670 95990 133840 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Myriad-Groestl OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Myriad-Groestl a b d e g h j k n 10K 20K 30K 40K 50K SE +/- 226.96, N = 3 47950 46280 47623 25310 17860 18010 35700 35610 46430 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: LBC, LBRY Credits OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: LBC, LBRY Credits a b d e g h j k n 13K 26K 39K 52K 65K SE +/- 30.55, N = 3 58960 59690 58620 29550 21190 21180 42230 42230 59060 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Quad SHA-256, Pyrite OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Quad SHA-256, Pyrite a b d e g h j k n 60K 120K 180K 240K 300K SE +/- 616.52, N = 3 267960 269860 267450 136700 97890 97960 195710 196100 271570 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
Cpuminer-Opt Algorithm: Triple SHA-256, Onecoin OpenBenchmarking.org kH/s, More Is Better Cpuminer-Opt 23.5 Algorithm: Triple SHA-256, Onecoin a b d e g h j k n 80K 160K 240K 320K 400K SE +/- 1685.56, N = 3 384930 384820 384423 194980 139540 140170 278140 278150 385290 1. (CXX) g++ options: -O2 -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 400 800 1200 1600 2000 SE +/- 1.15, N = 3 889 889 894 1644 1647 1984 1975 1366 1361 889
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 400 800 1200 1600 2000 SE +/- 0.88, N = 3 907 908 910 1663 1661 1990 1989 1535 1381 903
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 500 1000 1500 2000 2500 SE +/- 2.03, N = 3 1066 1064 1064 1962 1962 2352 2345 1691 1672 1057
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 8K 16K 24K 32K 40K SE +/- 11.26, N = 3 14067 13991 14042 30876 30951 36833 36575 31587 21000 14017
OSPRay Studio Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 15K 30K 45K 60K 75K SE +/- 63.66, N = 3 33358 33490 33652 57130 57572 68562 68355 51593 48393 33085
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 8K 16K 24K 32K 40K SE +/- 34.26, N = 3 14223 14248 14338 31373 31194 36770 36787 22407 22062 14289
OSPRay Studio Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 15K 30K 45K 60K 75K SE +/- 21.94, N = 3 34608 33567 33817 58032 58436 68506 68593 51940 52535 33789
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 9K 18K 27K 36K 45K SE +/- 30.39, N = 3 16770 16786 16765 36028 36109 42592 42412 30736 34459 16672
OSPRay Studio Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 20K 40K 60K 80K 100K SE +/- 299.89, N = 3 38573 38554 39029 67291 67337 80104 79992 60905 59689 38979
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 110 220 330 440 550 SE +/- 0.00, N = 3 225 226 227 416 416 500 500 356 344 225
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 110 220 330 440 550 SE +/- 0.00, N = 3 230 230 231 421 422 505 505 361 370 230
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 130 260 390 520 650 SE +/- 0.00, N = 3 269 268 270 496 497 595 595 420 450 268
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 2K 4K 6K 8K 10K SE +/- 3.71, N = 3 3539 3542 3558 6633 6628 7939 7952 5349 5771 3545
OSPRay Studio Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 3K 6K 9K 12K 15K SE +/- 20.95, N = 3 7076 7113 7142 13272 13231 15955 15942 10852 10581 7076
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b d e f g h j k n 2K 4K 6K 8K 10K SE +/- 5.36, N = 3 3620 3594 3636 6711 6711 8009 8010 5659 5812 3605
OSPRay Studio Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b d e g h j k n 3K 6K 9K 12K 15K SE +/- 6.36, N = 3 7192 7193 7245 13393 16151 16114 11801 11652 7194
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path Tracer - Acceleration: CPU a b d e g h j k n 2K 4K 6K 8K 10K SE +/- 2.33, N = 3 4239 4213 4258 7892 9424 9487 6496 7011 4206
OSPRay Studio Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU OpenBenchmarking.org ms, Fewer Is Better OSPRay Studio 0.13 Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path Tracer - Acceleration: CPU a b d e g h j k n 4K 8K 12K 16K 20K SE +/- 14.44, N = 3 8459 8429 8493 15787 18877 18945 13319 13049 8428
Timed Gem5 Compilation Time To Compile OpenBenchmarking.org Seconds, Fewer Is Better Timed Gem5 Compilation 23.0.1 Time To Compile e g h j k l m n 50 100 150 200 250 197.55 206.08 216.88 195.49 197.95 168.58 162.39 165.81
C-Blosc Test: blosclz shuffle - Buffer Size: 8MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 8MB l m n 3K 6K 9K 12K 15K 14338.2 14241.2 14082.4 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz shuffle - Buffer Size: 16MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 16MB l m n 4K 8K 12K 16K 20K 16680.2 16762.6 16392.9 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz shuffle - Buffer Size: 32MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 32MB l m n 3K 6K 9K 12K 15K 16111.4 16050.9 15827.9 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz shuffle - Buffer Size: 64MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 64MB l m n 3K 6K 9K 12K 15K 12239.3 12161.8 12008.0 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz noshuffle - Buffer Size: 8MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 8MB l m n 3K 6K 9K 12K 15K 12794.5 12855.3 12742.3 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz shuffle - Buffer Size: 128MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 128MB l m n 2K 4K 6K 8K 10K 8707.4 8640.0 8564.3 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz shuffle - Buffer Size: 256MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz shuffle - Buffer Size: 256MB l m n 1200 2400 3600 4800 6000 5519.3 5580.3 5520.6 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz bitshuffle - Buffer Size: 8MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 8MB l m n 3K 6K 9K 12K 15K 14873.2 14846.5 14815.4 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz noshuffle - Buffer Size: 16MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 16MB l m n 3K 6K 9K 12K 15K 15231.9 15269.2 15041.6 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz noshuffle - Buffer Size: 32MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 32MB l m n 3K 6K 9K 12K 15K 14712.6 14746.7 14569.9 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz noshuffle - Buffer Size: 64MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 64MB l m n 2K 4K 6K 8K 10K 11321.9 11315.0 11238.1 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz bitshuffle - Buffer Size: 16MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 16MB l m n 4K 8K 12K 16K 20K 16895.0 16810.4 16633.4 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz bitshuffle - Buffer Size: 32MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 32MB l m n 3K 6K 9K 12K 15K 15811.9 15827.9 15691.0 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz bitshuffle - Buffer Size: 64MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 64MB l m n 3K 6K 9K 12K 15K 11927.7 11905.5 11839.7 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz noshuffle - Buffer Size: 128MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 128MB l m n 2K 4K 6K 8K 10K 8089.4 8059.2 7960.2 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz noshuffle - Buffer Size: 256MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz noshuffle - Buffer Size: 256MB l m n 1200 2400 3600 4800 6000 5409.4 5333.4 5324.3 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz bitshuffle - Buffer Size: 128MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 128MB l m n 2K 4K 6K 8K 10K 8697.6 8864.5 8289.6 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
C-Blosc Test: blosclz bitshuffle - Buffer Size: 256MB OpenBenchmarking.org MB/s, More Is Better C-Blosc 2.11 Test: blosclz bitshuffle - Buffer Size: 256MB l m n 1200 2400 3600 4800 6000 5615.4 5389.3 5455.4 1. (CC) gcc options: -std=gnu99 -O3 -ldl -lrt -lm
DuckDB Benchmark: IMDB OpenBenchmarking.org Seconds, Fewer Is Better DuckDB 0.9.1 Benchmark: IMDB a d 60 120 180 240 300 SE +/- 1.33, N = 3 SE +/- 3.23, N = 4 259.42 259.28 1. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl
DuckDB Benchmark: TPC-H Parquet OpenBenchmarking.org Seconds, Fewer Is Better DuckDB 0.9.1 Benchmark: TPC-H Parquet a d 40 80 120 160 200 SE +/- 1.27, N = 3 SE +/- 2.06, N = 3 178.04 178.83 1. (CXX) g++ options: -O3 -rdynamic -lssl -lcrypto -ldl
Phoronix Test Suite v10.8.4