group_assessment_esd

checked for open data, found a dataset on air pollution in Scotland on https://www.scottishairquality.scot/
Chose the data for Aberdeen from 1st Jan. 2022 until 25th March to match the requirement of 2000 - 7000 rows
Designed a database using draw.io
identified the need to add staion details data, which was not available in the dataset, but on the website
coded a webscraper using selenium and BeautifulSoup to retrieve the data for station details from the website.
setup models.py to structure the database in our code, according to the abovementioned database design
coded the parser, which handled both the csv file downloaded, as well as the webscraper
implemented extensive errorhandling in the parser, since the dataset contains many missing datapoints.
in this way no manual manipulation of the csv file was required
problems were the limitations of DecimalField and IntegerField, since they did not allow to have empty cells, when data was missing
had a group discussion, on how to progress and how to split up the tasks.
Considering GitHub issues and Kanban board, we decided to use GitHub issues, since they can be created, assigned and closed with terminal commands from our IDEs
designed first draft of templates, wrote tests for the templates and then filled the templates
included paginator in the data template
but since paginator did not work well with the filter/input form on our data page, we included sessions

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
.idea		.idea
air_tracker		air_tracker
esd		esd
features		features
static		static
station_app		station_app
.gitignore		.gitignore
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
cheat_sheet.txt		cheat_sheet.txt
git-log.txt		git-log.txt
manage.py		manage.py
requirements.txt		requirements.txt

Provide feedback