This script is used to download audio files from the website https://domianarchiv.de.
- Python 3
- BeautifulSoup4
- requests
You can install these packages using pip:
pip install beautifulsoup4 requests
You can run the script using the following command:
python domianarchiv-downloader.py --start_page START_PAGE --end_page END_PAGE --download_dir DOWNLOAD_DIR
Where:
START_PAGE
is the page number to start downloading from (default is 0).
END_PAGE
is the page number to stop downloading at (default is 184).
DOWNLOAD_DIR
is the directory where the downloaded files will be saved (default is "downloads").
The script uses BeautifulSoup to parse the HTML of the website and find the links to the audio files. It then downloads these files using the requests library. The script handles HTTP 503 errors by retrying failed requests and adding a delay between requests to avoid overloading the server. It also includes a User-Agent string in the request headers, which some servers require.
Please respect the rights of the content creators and only download the files if you have permission to do so.