Skip to content
View airscholar's full-sized avatar
💭
Do hard things!
💭
Do hard things!

Highlights

  • Pro

Block or report airscholar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
airscholar/README.md

Hey there 👋, I'm Yusuf!

LinkedIn Medium Stackoverflow Dev.to

👨🏻‍🎓 Academic experience:

📝 I regularly write articles:

  • On Medium about programing, data science and AI
  • On HackerNoon about programing, data science and AI
  • On Dev.to about programing, data science and AI

📺 Latest Youtube Videos

1.2 Billion Records Per Hour High Performance Kafka and Spark - End to End Data Engineering Project The 1.2Billion Records Architecture Per Hour with #ApacheKafka and #ApacheSpark Building a High Performance Real-Time Analytics Database - End to End Data Engineering Project #Apache Frameworks for #DataEngineering - Building High Performance Realtime Systems #Apache Frameworks for End to End #DataEngineering #programming #bigdatatechnologies #dataanalysis The Supermarket Trick Every Data Pro Should Know! Realtime Stock Market Anomaly Detection using ML Models | An End to End Data Engineering Project Master Realtime Data Warehousing And Boost Your Data Skills Building Realtime #datawarehouse with #Apacheairflow, #apachepinot #redpanda for #dataengineering

📚 Latest Medium Stories

airscholar

Pinned Loading

  1. e2e-data-engineering e2e-data-engineering Public

    An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All comp…

    Python 211 96

  2. RedditDataEngineering RedditDataEngineering Public

    This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and serv…

    Python 99 51

  3. changecapture-e2e changecapture-e2e Public

    This project shows how to capture changes from postgres database and stream them into kafka

    Python 31 19

  4. RealtimeStreamingEngineering RealtimeStreamingEngineering Public

    This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenAI LLM, Kafka and Elasticsearch. It covers each stage from da…

    Python 32 23

  5. FootballDataEngineering FootballDataEngineering Public

    An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Fa…

    Python 17 16

  6. ApacheFlink-SalesAnalytics ApacheFlink-SalesAnalytics Public

    This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project demonstrates how to ingest, process, and analyze sales data, s…

    Java 11 7