BYOND Medals Scraper

This script scrapes medals earned by BYOND users and saves the data in JSON format. It handles different date formats and converts them to ISO 8601 format.

Features

Scrapes medals for a list of BYOND usernames
Handles date formats: today, yesterday, on DAY, and specific dates
Saves data in JSON format
Supports concurrent scraping for faster execution
Includes a progress bar to show scraping progress
Adds a delay between batches to be considerate to the web server
Can resume scraping from where it left off if interrupted

Requirements

Python 3.x
requests library
beautifulsoup4 library
tqdm library

Installation

Clone this repository:

git clone https://github.com/yourusername/byond-medals-scraper.git
cd byond-medals-scraper

Create a virtual environment (optional but recommended):
```
python -m venv venv
```
Activate the virtual environment:
- On Windows:
```
venv\Scripts\activate
```
- On macOS/Linux:
```
source venv/bin/activate
```
Install the required libraries:
```
pip install -r requirements.txt
```
Ensure your requirements.txt contains the following:
```
requests
beautifulsoup4
tqdm
```

Usage

Create a usernames.txt file in the same directory as the script. This file should contain one username per line.

Example usernames.txt:
```
user1
user2
user3
```
Set the desired mode and parameters at the top of the script:
- DELAY: Delay between each batch in seconds. Default is 1.
- MAX_WORKERS: Maximum number of concurrent workers. Default is 10.
- ERROR_DELAY: Delay between retries after a network failure. Default is 3.
- RETRIES: Max retries per user. Default is 3.
- OUTPUT_FILE: Output file name. Default is 'all_users_medals.json'.
- INPUT_FILE: Input file name. Default is 'usernames.txt'.
- SECTION_TITLE: Section title to search for. Default is 'Space Station 13 Medals'.
- APPEND_MODE: Boolean to either append with checks (True) or start fresh (False). Default is False.
Run the script:
```
python scrape_medals_batch.py
```
The script will create an all_users_medals.json file containing the scraped data. Errors will be logged in error_log.txt.
Deactivate the virtual environment when you are finished working with the script to restore your shell to the state it was in before you activated the virtual environment:
```
deactivate
```

Example Output

Example JSON structure:

{
    "user1": [
        {
            "Name": "Fish",
            "Date": "2023-11-29T10:22:00"
        },
        {
            "Name": "It'sa me, Mario",
            "Date": "2023-11-29T10:23:00"
        }
    ],
    "user2": [
        {
            "Name": "HIGH VOLTAGE",
            "Date": "2023-11-29T10:34:00"
        }
    ]
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
scrape_medals_batch.py		scrape_medals_batch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BYOND Medals Scraper

Features

Requirements

Installation

Usage

Example Output

About

Releases

Packages

Languages

License

Sovexe/byond-medal-scraper

Folders and files

Latest commit

History

Repository files navigation

BYOND Medals Scraper

Features

Requirements

Installation

Usage

Example Output

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages