GitHub - FatihArslan-cmd/Kafka-Spark-Cassandra-Expense-Tracker: A real-time data pipeline using Kafka, Spark, and Cassandra for processing and storing credit card expenses. Includes a Spring Boot application for retrieving personnel data from MySQL, storing images in S3, and displaying employee details with expense reports on a web interface.

Kafka-Spark-Cassandra-Expense-Tracker SpringBoot Web App

A project that integrates Spring Boot, PostgreSQL, and AWS S3 Kafka Spark Cassandra to manage employee data

📘 About The Project

Key Features:

🗄️ PostgreSQL Database Integration: Employee and department data are stored in PostgreSQL, with data imported from CSV files for easy initialization.
🖼️ AWS S3 Image Storage: Employee images are stored in AWS S3 for secure and scalable image storage.
📋 Web Interface: Displays employee details (name, manager name, salary, commission, department) with a JOIN operation, allowing for easy management and viewing.
Data comes constantly from cassandra

GIFS

🚀 Getting Started

To get a local copy up and running, follow these steps.

📋 Prerequisites

Ensure you have the following software installed:

Java 17+
Maven
apache-cassandra-3.11.10
kafka_2.12-3.9.0
spark-2.4.5-bin-hadoop2.7
AWS CLI (for AWS S3 integration)
PostgreSQL

⚙️ Installation

Clone the repository:

git clone https://github.com/FatihArslan-cmd/Kafka-Spark-Cassandra-Expense-Tracker.git

Navigate to the project directory:
```
cd demo
```
Install dependencies:
```
mvn clean install
```
Run the project:
```
mvn spring-boot:run
```

Set up AWS S3 Bucket:

Create an S3 bucket and upload sample images from this link images.
Configure your AWS credentials using aws configure.

Set up PostgreSQL:

Import employee and department data from the provided CSV files into PostgreSQL from data.

Set up Kafka Spark Cassandra:

[Follow the link](https://github.com/FatihArslan-cmd/DataGenerator-Kafka-)

🔑 Configuration

Add the following keys to your application.properties file:


aws.accessKeyId=""
aws.secretAccessKey=""
aws.region=""
aws.bucketName=""

spring.cassandra.contact-points=
spring.cassandra.port=
spring.cassandra.keyspace-name=
spring.cassandra.local-datacenter=
spring.cassandra.schema-action=none

spring.datasource.url=""
spring.datasource.username=""
spring.datasource.password=""

🛠️ Usage

Once the project is running:

You gotto start kafka server spark submit cassandra server and data generator [Follow the link](https://github.com/FatihArslan-cmd/DataGenerator-Kafka-)
Open your browser and navigate to http://localhost:8080 to view the employee data

Important

Java 17
Spring Boot
PostgreSQL
AWS SDK for Java (for S3 integration)
Maven (for build management)

Additionally, Apache Kafka, Apache Spark, and Cassandra are configured to run in an environment with Java 8. These components should be executed under a dedicated user profile set up with Java 8 on Ubuntu. Meanwhile, Spring Boot applications, which require Java 17, should be executed under a separate user profile configured with Java 17 to ensure compatibility.

📦 Dependencies

Java 17
Spring Boot
PostgreSQL
AWS SDK for Java (for S3 integration)
Maven (for build management)

🤝 Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

📞 Contact

Fatih Arslan - Software Engineer

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.mvn/wrapper		.mvn/wrapper
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kafka-Spark-Cassandra-Expense-Tracker SpringBoot Web App

📖 Table of Contents

📘 About The Project

GIFS

🚀 Getting Started

📋 Prerequisites

⚙️ Installation

Set up AWS S3 Bucket:

Set up PostgreSQL:

Set up Kafka Spark Cassandra:

🔑 Configuration

🛠️ Usage

Important

📦 Dependencies

🤝 Contributing

📞 Contact

About

Releases

Packages

Languages

License

FatihArslan-cmd/Kafka-Spark-Cassandra-Expense-Tracker

Folders and files

Latest commit

History

Repository files navigation

Kafka-Spark-Cassandra-Expense-Tracker SpringBoot Web App

📖 Table of Contents

📘 About The Project

GIFS

🚀 Getting Started

📋 Prerequisites

⚙️ Installation

Set up AWS S3 Bucket:

Set up PostgreSQL:

Set up Kafka Spark Cassandra:

🔑 Configuration

🛠️ Usage

Important

📦 Dependencies

🤝 Contributing

📞 Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages