bettah

AMD EPYC 7R13 48-Core testing with a Supermicro H12SSL-I v1.02 (2.6a BIOS) and NVIDIA GeForce RTX 3080 10GB on EndeavourOS rolling via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2311152-NE-BETTAH65530
Jump To Table - Results

Statistics

Remove Outliers Before Calculating Averages

Graph Settings

Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
mo
November 16 2023
  45 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


bettahOpenBenchmarking.orgPhoronix Test SuiteAMD EPYC 7R13 48-Core @ 3.73GHz (48 Cores / 96 Threads)Supermicro H12SSL-I v1.02 (2.6a BIOS)AMD Starship/Matisse256GB15363GB Micron_7450_MTFDKCC15T3TFRNVIDIA GeForce RTX 3080 10GBNVIDIA GA102 HD Audio38GN9502 x Intel X710 for 10GbE SFP+EndeavourOS rolling6.6.1-zen1-1-zen (x86_64)Xfce 4.18X Server 1.21.1.9NVIDIA 545.29.02GCC 13.2.1 20230801 + Clang 16.0.6 + LLVM 16.0.6 + CUDA 12.3btrfs1024x768ProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverCompilerFile-SystemScreen ResolutionBettah BenchmarksSystem Logs- Transparent Huge Pages: always- NVCC_PREPEND_FLAGS="-ccbin /opt/cuda/bin"- --disable-libssp --disable-libstdcxx-pch --disable-werror --enable-__cxa_atexit --enable-bootstrap --enable-cet=auto --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-default-ssp --enable-gnu-indirect-function --enable-gnu-unique-object --enable-languages=ada,c,c++,d,fortran,go,lto,objc,obj-c++ --enable-libstdcxx-backtrace --enable-link-serialization=1 --enable-lto --enable-multilib --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-build-config=bootstrap-lto --with-linker-hash-style=gnu - Scaling Governor: amd-pstate-epp performance (EPP: performance) - CPU Microcode: 0xa0011d1 - gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Mitigation of safe RET + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

bettahheffte: c2c - FFTW - float - 128heffte: c2c - FFTW - float - 256heffte: c2c - FFTW - float - 512heffte: r2c - FFTW - float - 128heffte: r2c - FFTW - float - 256heffte: r2c - FFTW - float - 512heffte: c2c - FFTW - double - 128heffte: c2c - FFTW - double - 256heffte: c2c - FFTW - double - 512heffte: c2c - FFTW - float - 1024heffte: c2c - Stock - float - 128heffte: c2c - Stock - float - 256heffte: c2c - Stock - float - 512heffte: r2c - FFTW - double - 128heffte: r2c - FFTW - double - 256heffte: r2c - FFTW - double - 512heffte: r2c - FFTW - float - 1024heffte: r2c - Stock - float - 128heffte: r2c - Stock - float - 256heffte: r2c - Stock - float - 512heffte: c2c - FFTW - double - 1024heffte: c2c - Stock - double - 128heffte: c2c - Stock - double - 256heffte: c2c - Stock - double - 512heffte: c2c - Stock - float - 1024heffte: r2c - FFTW - double - 1024heffte: r2c - Stock - double - 128heffte: r2c - Stock - double - 256heffte: r2c - Stock - double - 512heffte: r2c - Stock - float - 1024heffte: c2c - Stock - double - 1024heffte: r2c - Stock - double - 1024heffte: c2c - FFTW - float-long - 128heffte: c2c - FFTW - float-long - 256heffte: c2c - FFTW - float-long - 512heffte: r2c - FFTW - float-long - 128heffte: r2c - FFTW - float-long - 256heffte: r2c - FFTW - float-long - 512heffte: c2c - FFTW - double-long - 128heffte: c2c - FFTW - double-long - 256heffte: c2c - FFTW - double-long - 512heffte: c2c - FFTW - float-long - 1024heffte: c2c - Stock - float-long - 128heffte: c2c - Stock - float-long - 256heffte: c2c - Stock - float-long - 512heffte: r2c - FFTW - double-long - 128heffte: r2c - FFTW - double-long - 256heffte: r2c - FFTW - double-long - 512heffte: r2c - FFTW - float-long - 1024heffte: r2c - Stock - float-long - 128heffte: r2c - Stock - float-long - 256heffte: r2c - Stock - float-long - 512heffte: c2c - FFTW - double-long - 1024heffte: c2c - Stock - double-long - 128heffte: c2c - Stock - double-long - 256heffte: c2c - Stock - double-long - 512heffte: c2c - Stock - float-long - 1024heffte: r2c - FFTW - double-long - 1024heffte: r2c - Stock - double-long - 128heffte: r2c - Stock - double-long - 256heffte: r2c - Stock - double-long - 512heffte: r2c - Stock - float-long - 1024heffte: c2c - Stock - double-long - 1024heffte: r2c - Stock - double-long - 1024mo56.508949.772545.458588.8484111.07384.709932.607421.836622.977949.852048.065651.125645.190452.819945.042342.578392.240481.7842121.73087.751225.011428.354722.157422.921850.019845.987847.262049.284244.766999.134525.086949.481955.864249.566945.536988.4918110.39784.750226.714821.942922.951450.137047.871851.157145.307648.374445.035842.585492.385779.9453121.42287.596925.040128.887722.194022.903450.221045.939946.569548.978844.684199.291825.086049.4816OpenBenchmarking.org

HeFFTe - Highly Efficient FFT for Exascale

HeFFTe is the Highly Efficient FFT for Exascale software developed as part of the Exascale Computing Project. This test profile uses HeFFTe's built-in speed benchmarks under a variety of configuration options and currently catering to CPU/processor testing. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 128mo1326395265SE +/- 0.61, N = 356.511. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 256mo1122334455SE +/- 0.09, N = 349.771. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 512mo1020304050SE +/- 0.02, N = 345.461. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 128mo20406080100SE +/- 0.92, N = 388.851. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 256mo20406080100SE +/- 0.50, N = 3111.071. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 512mo20406080100SE +/- 0.01, N = 384.711. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 128mo816243240SE +/- 0.10, N = 332.611. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 256mo510152025SE +/- 0.04, N = 321.841. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 512mo612182430SE +/- 0.01, N = 322.981. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float - X Y Z: 1024mo1122334455SE +/- 0.06, N = 349.851. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 128mo1122334455SE +/- 0.59, N = 348.071. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 256mo1224364860SE +/- 0.05, N = 351.131. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 512mo1020304050SE +/- 0.04, N = 345.191. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 128mo1224364860SE +/- 0.19, N = 352.821. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 256mo1020304050SE +/- 0.07, N = 345.041. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 512mo1020304050SE +/- 0.00, N = 342.581. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float - X Y Z: 1024mo20406080100SE +/- 0.02, N = 392.241. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 128mo20406080100SE +/- 0.78, N = 381.781. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 256mo306090120150SE +/- 0.06, N = 3121.731. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 512mo20406080100SE +/- 0.01, N = 387.751. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double - X Y Z: 1024mo612182430SE +/- 0.03, N = 325.011. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 128mo714212835SE +/- 0.34, N = 428.351. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 256mo510152025SE +/- 0.01, N = 322.161. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 512mo510152025SE +/- 0.01, N = 322.921. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float - X Y Z: 1024mo1122334455SE +/- 0.00, N = 350.021. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double - X Y Z: 1024mo1020304050SE +/- 0.06, N = 345.991. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 128mo1122334455SE +/- 0.35, N = 347.261. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 256mo1122334455SE +/- 0.10, N = 349.281. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 512mo1020304050SE +/- 0.06, N = 344.771. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float - X Y Z: 1024mo20406080100SE +/- 0.01, N = 399.131. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double - X Y Z: 1024mo612182430SE +/- 0.01, N = 325.091. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double - X Y Z: 1024mo1122334455SE +/- 0.01, N = 349.481. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 128mo1326395265SE +/- 0.20, N = 355.861. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 256mo1122334455SE +/- 0.08, N = 349.571. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 512mo1020304050SE +/- 0.02, N = 345.541. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 128mo20406080100SE +/- 0.27, N = 388.491. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 256mo20406080100SE +/- 0.37, N = 3110.401. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 512mo20406080100SE +/- 0.02, N = 384.751. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 128mo612182430SE +/- 0.64, N = 1526.711. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 256mo510152025SE +/- 0.09, N = 321.941. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 512mo510152025SE +/- 0.02, N = 322.951. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: float-long - X Y Z: 1024mo1122334455SE +/- 0.01, N = 350.141. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 128mo1122334455SE +/- 0.12, N = 347.871. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 256mo1224364860SE +/- 0.11, N = 351.161. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 512mo1020304050SE +/- 0.02, N = 345.311. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 128mo1122334455SE +/- 1.43, N = 1548.371. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 256mo1020304050SE +/- 0.04, N = 345.041. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 512mo1020304050SE +/- 0.01, N = 342.591. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: float-long - X Y Z: 1024mo20406080100SE +/- 0.01, N = 392.391. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 128mo20406080100SE +/- 0.60, N = 379.951. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 256mo306090120150SE +/- 0.14, N = 3121.421. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 512mo20406080100SE +/- 0.35, N = 387.601. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: FFTW - Precision: double-long - X Y Z: 1024mo612182430SE +/- 0.01, N = 325.041. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 128mo714212835SE +/- 0.24, N = 928.891. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 256mo510152025SE +/- 0.01, N = 322.191. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 512mo510152025SE +/- 0.01, N = 322.901. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: float-long - X Y Z: 1024mo1122334455SE +/- 0.01, N = 350.221. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: FFTW - Precision: double-long - X Y Z: 1024mo1020304050SE +/- 0.02, N = 345.941. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 128mo1122334455SE +/- 0.37, N = 346.571. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 256mo1122334455SE +/- 0.05, N = 348.981. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 512mo1020304050SE +/- 0.07, N = 344.681. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: float-long - X Y Z: 1024mo20406080100SE +/- 0.02, N = 399.291. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: c2c - Backend: Stock - Precision: double-long - X Y Z: 1024mo612182430SE +/- 0.01, N = 325.091. (CXX) g++ options: -O3

OpenBenchmarking.orgGFLOP/s, More Is BetterHeFFTe - Highly Efficient FFT for Exascale 2.4Test: r2c - Backend: Stock - Precision: double-long - X Y Z: 1024mo1122334455SE +/- 0.01, N = 349.481. (CXX) g++ options: -O3

64 Results Shown

HeFFTe - Highly Efficient FFT for Exascale:
  c2c - FFTW - float - 128
  c2c - FFTW - float - 256
  c2c - FFTW - float - 512
  r2c - FFTW - float - 128
  r2c - FFTW - float - 256
  r2c - FFTW - float - 512
  c2c - FFTW - double - 128
  c2c - FFTW - double - 256
  c2c - FFTW - double - 512
  c2c - FFTW - float - 1024
  c2c - Stock - float - 128
  c2c - Stock - float - 256
  c2c - Stock - float - 512
  r2c - FFTW - double - 128
  r2c - FFTW - double - 256
  r2c - FFTW - double - 512
  r2c - FFTW - float - 1024
  r2c - Stock - float - 128
  r2c - Stock - float - 256
  r2c - Stock - float - 512
  c2c - FFTW - double - 1024
  c2c - Stock - double - 128
  c2c - Stock - double - 256
  c2c - Stock - double - 512
  c2c - Stock - float - 1024
  r2c - FFTW - double - 1024
  r2c - Stock - double - 128
  r2c - Stock - double - 256
  r2c - Stock - double - 512
  r2c - Stock - float - 1024
  c2c - Stock - double - 1024
  r2c - Stock - double - 1024
  c2c - FFTW - float-long - 128
  c2c - FFTW - float-long - 256
  c2c - FFTW - float-long - 512
  r2c - FFTW - float-long - 128
  r2c - FFTW - float-long - 256
  r2c - FFTW - float-long - 512
  c2c - FFTW - double-long - 128
  c2c - FFTW - double-long - 256
  c2c - FFTW - double-long - 512
  c2c - FFTW - float-long - 1024
  c2c - Stock - float-long - 128
  c2c - Stock - float-long - 256
  c2c - Stock - float-long - 512
  r2c - FFTW - double-long - 128
  r2c - FFTW - double-long - 256
  r2c - FFTW - double-long - 512
  r2c - FFTW - float-long - 1024
  r2c - Stock - float-long - 128
  r2c - Stock - float-long - 256
  r2c - Stock - float-long - 512
  c2c - FFTW - double-long - 1024
  c2c - Stock - double-long - 128
  c2c - Stock - double-long - 256
  c2c - Stock - double-long - 512
  c2c - Stock - float-long - 1024
  r2c - FFTW - double-long - 1024
  r2c - Stock - double-long - 128
  r2c - Stock - double-long - 256
  r2c - Stock - double-long - 512
  r2c - Stock - float-long - 1024
  c2c - Stock - double-long - 1024
  r2c - Stock - double-long - 1024