This project delves into leveraging CDC for PostgreSQL databases using Debezium, orchestrating data processing through NiFi, and subsequently storing the processed data in MinIO, a high-performance S3-compatible storage solution.
For this project, we used Debezium to capture real-time data from a PostgreSQL database that mimicked the basic needs of e-commerce marketing. Debezium automatically wrote changes to Kafka topics, enabling data integration while maintaining data accuracy and performance. MinIO stored processed data as an S3-compatible object storage solution. This architecture makes data accessible in real-time for analysis and stores advanced storage and analysis capabilities by adding tools like Iceberg or Delta Lake. This architecture enabled systems to respond quickly and take advantage of downloading data.
- Docker-Docker Compose
- Debezium
- Kafka
- KafkaUI
- NiFi(1.27)
- MINIO
git clone https://github.com/burakugurr/cdc-postgresql_to_minio.git
cd cdc-postgresql_to_minio/Docker
docker compose up
- Download CDC_latest.cml file.
- Import template from nifi ui.