Skip to content

surajptl/airflow_for_beginners

Β 
Β 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

32 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

This is a small example of the workflow built with Apache Airflow.

You can find slides here and watch the talk here

The goal is to set up a data pipline to get a fresh portion of Stack Overflow questions with tag pandas to our mailbox daily.

A small python script could do the job, but for the learning purposes we choose to overengineer it.

By writing this workflow we will learn the main concepts of Apache Airflow, such as:

  • Operators
  • DAG
  • Tasks
  • Hooks
  • Variables
  • Connections
  • XComs

Happy learning πŸ€“

Helpful resources

πŸ“ Apache Airflow Documentation

Apache Airflow tutorials for beginners

πŸ“ Apache Airflow Tutorial for Data Pipelines

πŸ“ Apache Airflow for the confused

πŸ“ Airflow: Tutorial and Beginners Guide

πŸ“ ETL Pipelines With Airflow

Some more

πŸ“° ETL best principles

πŸ“° Managing Dependencies in Apache Airflow

πŸ“ Getting Started with Airflow Using Docker

🎧 Putting Airflow Into Production

πŸ“ How to configure SMTP server for apache airflow

If you have any questions or would like to get in touch with me, please drop me a message to hello@varya.io

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 94.9%
  • HTML 5.1%