This project focuses on the retrieval, transformation, and storage of data from Velib Metropole. It uses Apache Airflow as the central orchestrator to automate and manage the data workflow. The processed data is stored in both Amazon S3 and PostgreSQL (Supabase) for further analysis and visualization in a Streamlit application.
-
Velib API
- Provide the raw data on velib stations
-
Python Scripts
- Fetch data from the Vélib API and store it in AWS S3 and Supabase PostgreSQL db.
-
Storing Data in Supabase PostgreSQL
- PostgreSQL Database: Transformed data is stored in a PostgreSQL database hosted on Supabase.
-
Supabase PostgreSQL
- Transformed data is stored in a PostgreSQL database hosted on Supabase.
-
Streamlit App
- A Streamlit app to fetch data from the PostgreSQL database and visualize it.
- The Streamlit app displays maps and tables to show where there is not enough e-bike availability.
- The Streamlit app can trigger the Airflow DAG to update the data
-
Apache Airflow
- Airflow DAGs to orchestrate the ETL process.
-
Python Scripts
