GitHub - Anishrkhadka/HydroComply: Automated water quality compliance checks using Environment Agency hydrology data, with an end-to-end workflow, medallion architecture, and Streamlit interface.

This project automates end-to-end water quality compliance checks using data from the UK Environment Agency Hydrology API (environment.data.gov.uk).
Compliance is evaluated against the following criteria:

Determinand	Unit	Compliance Criteria
Dissolved Oxygen	mg/L	> 4
pH	—	Between 6.5 and 8.5
Ammonium	mg/L	< 0.5
Turbidity	NTU	< 25
Temperature	°C	< 20

The project follows a four-step workflow and uses a medallion data architecture (bronze, silver, and gold layers) for data storage and processing.

00_data_ingestion.ipynb
Retrieves data from the Environment Agency API. Focuses on active monitoring stations publishing the required water properties within the 2020–2025 period. The retrieved data is stored in the bronze layer.
01_EDA.ipynb
Performs exploratory data analysis (EDA) on a selected station to design data preparation steps such as resampling and data quality checks. Processed data is saved to the silver layer.
02_compliance_check.ipynb
Executes compliance checks using the silver-layer dataset and generates an Excel report. The final processed data is stored in the gold layer, which can also be used for dashboard visualisations.
streamlit_app.py
Provides an interactive Streamlit interface that automates the above steps end to end. Users can download data, perform real-time compliance checks, and export Excel reports.
docker-compose.yml
Enables running the entire project in a containerised environment.

Repository Structure

.
|-- streamlit_app.py          # Streamlit UI for data download and compliance checks
|-- src/
|   |-- app_services.py       # File management, caching, ingest and preprocessing helpers
|   |-- compliance_checks.py  # Compliance rule definitions and evaluation logic
|   |-- config.py             # API endpoints and colour configuration
|   |-- station_catalog.py    # Station catalogue discovery using the EA Hydrology API
|   |-- visualisations.py     # Plotly charts for data quality and compliance
|   `-- water_quality.py      # API ingestion, resampling, and data quality tagging
|-- data/                     # Bronze, silver, and gold outputs (generated at runtime)
|-- assets/                   # UI assets (e.g. logo)
|-- 00_data_ingestion.ipynb   # Notebook: API ingestion prototype
|-- 01_EDA.ipynb              # Notebook: exploratory data analysis
|-- 02_compliance_check.ipynb # Notebook: compliance rule development
|-- requirements.txt
|-- Dockerfile
`-- docker-compose.yml

Getting Started

Prerequisites

Python 3.11 or later
pip for dependency management
Optional: Docker and Docker Compose for containerised deployment

Local Setup

Create and activate a virtual environment.

python -m venv .venv
source .venv/bin/activate

Install dependencies.

pip install --upgrade pip
pip install -r requirements.txt

Launch the Streamlit app.
```
streamlit run streamlit_app.py
```
Streamlit runs on http://localhost:8501. The first download may take several minutes because the API is paginated.

Docker

Build and run with Compose.
```
docker compose up --build
```
The app runs on http://localhost:8501. The data/ directory is mounted to persist downloads between sessions.

Outputs and Data Layout

data/meta/ — caches the station catalogue (stations_water_quality.parquet).
data/bronze/ — stores raw readings per station (*_readings.parquet).
data/silver/ — contains hourly, data-quality-tagged datasets (*_readings.parquet).
data/gold/ — stores compliance outputs (*_compliance_results.parquet) and generates Excel reports on demand.

These folders are created automatically. Ensure the repository has write access to the data/ directory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Repository Structure

Getting Started

Prerequisites

Local Setup

Docker

Outputs and Data Layout

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
00_data_ingestion.ipynb		00_data_ingestion.ipynb
01_EDA.ipynb		01_EDA.ipynb
02_compliance_check.ipynb		02_compliance_check.ipynb
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py

Folders and files

Latest commit

History

Repository files navigation

Repository Structure

Getting Started

Prerequisites

Local Setup

Docker

Outputs and Data Layout

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages