Small Python Tool, to webscrape webradio playlists.
Idea: There are plenty of good webradio stations. But I want their songs in a spotify, deezer, etc. playlist without commercials. Unfortunately the playlist of the most stations is resettet every couple of hours. The Python tool scrapes the songs, makes a proper data formatting, removes double values and saves an unique playlist per station. This playlist is then ready to import to your favourite music-streaming-service.
- add the URLs of your favourite webradio stations to
url_list.txt
- I recommend using Onlineradiobox.com to get the playlists, because they have the total list of each day. Example:
https://onlineradiobox.com/de/starfmhell/playlist/1?cs=de.starfmhell
- run the main.py
- the program will
- scrape the playlist data of all stations
- adds all the new songs to a playlist file, named by the station
- remove duplicates and sort the songs alphabetical
- save the playlist files
- write some statistics to the
stat.txt
(for instance, how many new songs were added each day)
Comment: Keep an eye on the stats ;-)
It's really astonishing, that after about 5 Days the playlist of the most radio stations will be freezed to a few hundred songs. So after this 5 days, they just keep on playing the same stuff every day over and over again. ** Happy Groundhog Day :D**