School Test Results Scraper

This project is a web scraper for extracting school test results from the NECTA website. The scraper collects data from nested links on a specified page and consolidates the results into an Excel spreadsheet.

Prerequisites

Ensure you have Python installed. You will also need to install the following Python libraries:

requests
beautifulsoup4
pandas
openpyxl

You can install these libraries using pip:

pip install requests beautifulsoup4 pandas openpyxl

Usage

Clone the Repository

git clone https://github.com/yourusername/school-test-results-scraper.git
cd school-test-results-scraper

Edit the Script (if necessary)

Update the url variable in scraper.py if you want to scrape a different page.
Run the Script
```
python scraper.py
```
Check the Output

The results will be saved in an Excel file named school_test_results.xlsx in the same directory.

Notes

Ensure the structure of the scraped data matches the defined schema. Adjust the columns list if necessary.
This script currently assumes that the second table on each page contains the desired data. Modify the script as needed for different structures.

License

This project is licensed under the MIT License.

Feel free to adjust the instructions or content based on your specific requirements or preferences.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
scrape.py		scrape.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

School Test Results Scraper

Prerequisites

Usage

Notes

License

About

Releases

Packages

Languages

License

ChanceMark/site-scrape

Folders and files

Latest commit

History

Repository files navigation

School Test Results Scraper

Prerequisites

Usage

Notes

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages