Multi-modal Music ML service that will fulfil promotion needs of musicians and companies. All you need to do is provide music, image and write what kind of album promotion cover you want.
There are two strategies to generate captions. One is 'Music-to-Caption' via BART. And the other is 'Image-to-Caption' via BLIP.
custom_data
has mp3 files and jpg files.demo/app.py
gets the mp3 file and outputs caption by BART. It also generates captions based on an image by BLIP.
Run ColabToFlask.ipynb colab notebook, get the url generated and paste that into app.py's variable named 'ngrok_url' (commented to highlight this variable) and run app.py. It will run Music2P on your local host.