Build real-time view in Data Lake

This repository contains Flink jobs to process data with a help of Apache Paimon.

Pre-requisites:

MySQL server 8.x
Ammonite
Apache Flink cluster 1.17 or higher

Ingestion jobs

Create MySQL Database and apply mysql.sql DDL script to it.
Create Flink SQL jobs using flink-sql/ingestion.sql code.
Run Flink CDC job using Ammonite: amm runCdcAction.sc

Aggregation jobs

Create Flink SQL job to aggregate data using flink-sql/aggregation.sql.

Visualize data view

Run Paimon table stream-reader to print current data continuously in the console using Ammonite: amm readStream.sc.

Sync via Merge action

In order to sync country_sales table with customers run the Paimon Merge action via Ammonite: amm mergeDiff.sc.