More recipes & datasets
New features
- Better
argparse
interface with dictionary. - STFT is now prefectly invertible with default values. Also
perfect_synthesis_window
enables perfect synthesis with a large range if windows for even overlaps. Encoder
andDecoder
now support arbitrary number of input dimensions.- More support for complex numbers (angle, magnitude, interface to numpy and torchaudio)
- Add
SingleSrcMultiScaleSpectralLoss
from DDSP (magenta) - Huge improvements on tests and coverage
New recipes
- ConvTasnet full recipe on WHAM
- DPRNN full recipe on WHAM
- Full DNS Challenge (microsoft) baseline.
- Deep clustering and Chimera++ recipe on WSJ0-2mix (ongoing)
- WHAMR dataset support.
Breaking change
Encoder
looses itspost_process_inputs
andapply_mask
methods which were not really useful. We consider it is better the user applies these methods knowingly
Big thanks to the contributors on this release @popcornell @sunits @JorisCos