- Cross-modal Music-to-Story Retrieval task
- Retrieving Story from Music
- With emotion labels, we tried to exploit embedding spaces for mapping story & music.
- ① GST style Reference Encoder
pip install -r requirements.txt
- Sentences from poetry, novels, dramas, scenarios, etc.
- Seven emotion labels (happy, neutral, flustered, anxious, angry, sad, hurt)
- https://mtg.github.io/mtg-jamendo-dataset
- You need to download audio files for the
autotagging_moodtheme.tsv
subset. - Many moodtheme tags (action,adventure,advertising,ambiental,background, ballad...)
- So, we manually map music labels onto story labels
bash ./run_twobranch_train.sh
bash ./run_train.sh
- Minz Won, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore, and Xavier Serra
@inproceedings{won2021emotion,
title={Emotion embedding spaces for matching music to stories},
author={Won, Minz. and Salamon, Justin. and Bryan, Nicholas J. and Mysore, Gautham J. and Serra, Xavier.},
booktitle={ISMIR},
year={2021}
}