v1: Initial release
Usage
from dfttest2 import DFTTest
output = DFTTest(input)
The Python wrapper dfttest2.DFTTest()
is the only stable interface and generally matches the original DFTTest plugin interface.
The new dfttest2.DFTTest2()
interface is still a work-in-progress. The dfttest2.Backend
interface may be changed in the future.
Performance is mostly limited by the memory bandwidth of the gpu. Expecting ~2x performance on conventional hardware (i.e. with GDDR memory rather than HBM memory).
Specifying backend=Backend.cuFFT(in_place=False)
to use more efficient kernels at the cost of increased (1.15x ~ 1.30x) device memory usage.
Benchmark for this release.
Known issues
nlocation
is not implemented yet- other backends