LyricsAudioBoost

Combining BERT and Spotify Valence Feature For Track Sentiment Analysis

In this notebook I combine Spotify audio feature and BERT word embedding to predict tracks sentiments. I use hugginface pre-trained BERT transformer as an embedding layer, and train an additional bidirectional GRU layer for the sentiment analysis regression task (point prediction in range [0-1]). To train the fine-tunning layer of the model I use Spotify valence attribute which I added to a lyrics dataset.

Motivation:

The examples below use NLTK demo and Spotify valence to measure a track's positivenesss. They demonstrate that using strictly audio OR lyrics might be inaccurate.

Positive Sentiment Example: Baz Luhrmann - Everybody's Free To Wear Sunscreen.
- NLTK sentiment classification: Negative.
- Spotify Valence: 0.8.
Negative Sentiment Example: Otis Redding- Mr. pitiful.
- NLTK sentiment classification: Negative.
- Spotify Valence: 0.9.

Steps to build model:

Database: gathering songs lyrics, adding Spotify valence attribute and pre-processing. I uploaded to Kaggle the final 150K Lyrics Labeled with Spotify Valence Dataset.
Model Design: Iteratively improved model capacity.
Evaluation: loss and accuracy metrics across 3 buckets - negative, neutral and positive sentiments.
Interpretation: Understanding what the model is learning using word clouds.

Example:

Words in the word cloud are sized by their respective difference on the model's prediction, and their positive (green) or negative (red) influence.

Positive Sentiment Example: Armin Van Buuren- Blah Blah Blah.

NLTK sentiment classification: Negative.
Spotify Valence: 0.18.
LyricsAudioBoost Model: 0.76.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
README.md		README.md
Spotify_Dataset.ipynb		Spotify_Dataset.ipynb
Tracks_Sentiment_Analysis.ipynb		Tracks_Sentiment_Analysis.ipynb
blah_good.png		blah_good.png
diversified.xlsx		diversified.xlsx
helpers.py		helpers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LyricsAudioBoost

Combining BERT and Spotify Valence Feature For Track Sentiment Analysis

Motivation:

Steps to build model:

Example:

About

Releases

Packages

Languages

EdenBD/lyrics-sentiment

Folders and files

Latest commit

History

Repository files navigation

LyricsAudioBoost

Combining BERT and Spotify Valence Feature For Track Sentiment Analysis

Motivation:

Steps to build model:

Example:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages