Image to audio

Python package to convert image to audio. The system interpretates the images as raw spectrograms and synthetize the audio using customizable iFFT parameters.

Install

pip install -r requirements.txt

Process an image

python main.py --image_path=<PATH_TO_THE_IMAGE> --hop_size=512 --out_name=<NAME_OF_OUTPUT_AUDIO>.wav --sample_rate=16000

The higher the sample rate, the higher the content of high frequencies, but the length of the output audio gets shorter
The higher the hop_size, the higher the resolution and longer the length of the audio file (it slows down the playback). Very slow hop_size will lead into harmonic artifacts (could or could not be wanted...).

I recomend using images with large resolution (~4000px x 3000px), a hop size of 512 and a sample_rate of 16000 to obtain ~1.5 min of audio.

Test

TODO

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
classes		classes
test		test
README.md		README.md
agua3.jpeg		agua3.jpeg
main.py		main.py
requirements.txt		requirements.txt
spectrogram.png		spectrogram.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image to audio

Install

Process an image

Test

About

Releases

Packages

Languages

nschmidtg/image_to_audio

Folders and files

Latest commit

History

Repository files navigation

Image to audio

Install

Process an image

Test

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages