- Designed a microservice architecture to perform real-time analytics on comments from technology-related subreddits
(e.g., r/technology), with a cumulative analysis of over
220k
comments from76k
users. - Utilized
Kafka
as the message broker to decouple the comment ingestion and keyword extraction using named entity recognition enabled bySpacy
. - Streamed data from Kafka to
Elasticsearch
usingKafka Connect Elasticsearch Sink
andksqlDB
, and built aKibana
dashboard to identify the active Redditors and hot topics within user-specified time range. - Orchestrated the multi-container (10 containers) application with
docker-compose
.
increased from 15.23% to 18.65% as for "AI", 📈
9.35% to 10.24% as for "Google", 📈
4.5% to 6.45% as for "TikTok", 📈
decreased from 4.7% to 3.24% as for "SVB", 📉
during the period of March 26 to April 5.