Skip to content

raghul3/Airflow_project_weather_api

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Weather API Workflow with Apache Airflow

Project Overview

This project involves creating an automated ETL (Extract, Transform, Load) pipeline to fetch weather data from an API, process and store it in CSV format using Python and Pandas, and finally load it into an Amazon S3 bucket. Apache Airflow is used to orchestrate and automate the workflow, while EC2 instances are used for hosting the Airflow scheduler and workers.

Components

  1. Weather API: Source of weather data.
  2. Python: Used for data extraction and transformation.
  3. Apache Airflow: Orchestration tool for managing the ETL workflow.
  4. EC2 Instances: Hosts the Airflow scheduler and workers.
  5. Pandas: Python library used to process and store the data in CSV format.
  6. Amazon S3: Storage service where the processed data is stored.

Weather API Key

Obtain Weather API Key: - Sign up at OpenWeatherMap to get your API key. - Follow the instructions to create an account and generate an API key.

Airflow's workflow

Airflow UI

Weather API Workflow Architecture

Architecture Diagram

Architecture

About

ETL process workflow using Airflow.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published