Skip to content

Latest commit

 

History

History
22 lines (14 loc) · 1.21 KB

README.md

File metadata and controls

22 lines (14 loc) · 1.21 KB

Spotify Data Analysis

The main goal of this project was to find out what properties a song needs to have to be popular. I tried to approach this question from two different angles and work with the two datasets:

Dataset A: UK Number 1 vs. random songs (ca. 2,600 observations of 16 variables) It contains data from approximately 1,300 songs that were Number 1 songs in the UK (from playlist “Every Official UK Number 1 Ever” from user “Official Charts”) and 1,300 random songs.

For Dataset A the aim is to detect the main differences between the two groups.

Dataset B: 10,000 songs (ca. 10,000 observations of ca. 130 variables)

Here the focus lies on the variable popularity, that measures the popularity of a song on Spotify.

Collecting data with Spotipy

With get_token.py it is possible to get access the Spotify API. Then if the access established then one can use:

  • random_songs.py to get randon song ids (the are not really picked randomly!)
  • playlist_get_songs.py to collect track infos from a user playlist
  • search_song_get_id.py to get and id for track name and artist name
  • song_analysis.py get summarized values of the audio analysis for song ids
  • song_features.py get audio features for song ids