In this project we try out image based morphological filtering to audio spectrograms for removing noise from audio signals. Regions of the spectrogram having high energy are estimated to be of more importance and likely to contain the original audio signal. The process of erosion can remove noise while dilation can then restore any erroneously removed signal regions. The combination of the two techniques results in a non-linear, time-frequency filter. We then recreate the denoised audio from the processed spectrogram.
This project was part of a bigger project called "Visual Mic", where we extract audio from motion due to vibrations caused by sound waves on materials. Here is the link to the project: https://github.com/joeljose/Visual-Mic
This denoising algorithm was effective in extracting out the signal from the noisy audio.