Santander Customer Transaction Prediction

This project is about solving a particular problem posted by Kaggle which involves helping Santander Bank in identifying and predicting customers who could make future transactions with the bank. For this, Decision tree classifier and Gradient boosted tree have been implemented and the results are compared to the better-performing algorithm.

Steps to run:

1)Import the project in IntellijIDEA and generate the .jar file. 2)Create and start a cluster in EMR (AWS). 3)Create a bucket in S3. 4)Upload the input files (train.csv and test.csv) and the .jar file in the S3 bucket. 5)Add step in EMR: ->Select the spark application option and specify a name. ->Specify the class name in --class ->Input the path of .jar file, input and output files. 6)Run the step.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
project		project
src/main/scala		src/main/scala
target		target
README.md		README.md
build.sbt		build.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Santander Customer Transaction Prediction

About

Releases

Packages

Languages

sup1496/Santander-prediction

Folders and files

Latest commit

History

Repository files navigation

Santander Customer Transaction Prediction

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages