Skip to content

More recipes & datasets

Compare
Choose a tag to compare
@mpariente mpariente released this 09 Mar 16:37
· 624 commits to master since this release

New features

  • Better argparse interface with dictionary.
  • STFT is now prefectly invertible with default values. Also perfect_synthesis_window enables perfect synthesis with a large range if windows for even overlaps.
  • Encoder and Decoder now support arbitrary number of input dimensions.
  • More support for complex numbers (angle, magnitude, interface to numpy and torchaudio)
  • Add SingleSrcMultiScaleSpectralLoss from DDSP (magenta)
  • Huge improvements on tests and coverage

New recipes

  • ConvTasnet full recipe on WHAM
  • DPRNN full recipe on WHAM
  • Full DNS Challenge (microsoft) baseline.
  • Deep clustering and Chimera++ recipe on WSJ0-2mix (ongoing)
  • WHAMR dataset support.

Breaking change

  • Encoder looses its post_process_inputs and apply_mask methods which were not really useful. We consider it is better the user applies these methods knowingly

Big thanks to the contributors on this release @popcornell @sunits @JorisCos