Skip to content

Real-time data engineering project designed to simulate the ingestion, processing, storage, and querying of live stock market data using Apache Kafka, Python, and AWS Cloud Services.

Notifications You must be signed in to change notification settings

manishjadhav9/Investrix

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

1 Commit
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ“ˆ Investrix: Real-Time Stock Market Data Engineering with Kafka & AWS

Investrix is a complete real-time data engineering project designed to simulate the ingestion, processing, storage, and querying of live stock market data using Apache Kafka, Python, and AWS Cloud Services.


๐Ÿงฉ Project Overview

Investrix demonstrates how to build a real-time streaming data pipeline by integrating open-source and cloud-native technologies. The pipeline includes:

  • Apache Kafka for real-time message streaming
  • Python for producer/consumer logic
  • Amazon S3 for scalable object storage
  • AWS Glue for schema inference and cataloging
  • AWS Athena for serverless querying

This project is ideal for learning modern data engineering, real-time analytics, and cloud integration.


๐Ÿ” Data Flow:

  1. Kafka Producer streams CSV stock data to a Kafka topic.
  2. Kafka Consumer listens to that topic and writes the data to an Amazon S3 bucket.
  3. AWS Glue Crawler detects schema from S3 and updates the Glue Catalog.
  4. AWS Athena runs SQL queries over structured S3 data using the Glue Catalog.

โš™๏ธ Tech Stack

Layer Tool/Service
Programming Python
Stream Processing Apache Kafka
Cloud Compute AWS EC2
Cloud Storage AWS S3
Metadata Catalog AWS Glue Crawler, Catalog
Query Engine AWS Athena

About

Real-time data engineering project designed to simulate the ingestion, processing, storage, and querying of live stock market data using Apache Kafka, Python, and AWS Cloud Services.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published