Spotify Data Analysis

The main goal of this project was to find out what properties a song needs to have to be popular. I tried to approach this question from two different angles and work with the two datasets:

Dataset A: UK Number 1 vs. random songs (ca. 2,600 observations of 16 variables) It contains data from approximately 1,300 songs that were Number 1 songs in the UK (from playlist “Every Official UK Number 1 Ever” from user “Official Charts”) and 1,300 random songs.

For Dataset A the aim is to detect the main differences between the two groups.

Dataset B: 10,000 songs (ca. 10,000 observations of ca. 130 variables)

Here the focus lies on the variable popularity, that measures the popularity of a song on Spotify.

Collecting data with Spotipy

With get_token.py it is possible to get access the Spotify API. Then if the access established then one can use:

random_songs.py to get randon song ids (the are not really picked randomly!)
playlist_get_songs.py to collect track infos from a user playlist
search_song_get_id.py to get and id for track name and artist name
song_analysis.py get summarized values of the audio analysis for song ids
song_features.py get audio features for song ids

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Spotify Data Analysis

Collecting data with Spotipy

Files

README.md

Latest commit

History

README.md

File metadata and controls

Spotify Data Analysis

Collecting data with Spotipy