Skip to content

Latest commit

 

History

History
13 lines (10 loc) · 482 Bytes

README.md

File metadata and controls

13 lines (10 loc) · 482 Bytes

CSVJoin

Use this project to join data from multiple csv files. Currently in this project we support one to one and one to many join. Along with this you can find how to use kafka producer efficiently with spark. Metdata for whole joining process is defined in #datasource.json.

Commandline arguments:

  1. Boolean value (true to unable kafka otherwise false)
  2. Kafka topic name (If one is true )

Plateforms:

  1. Spark 2.1
  2. Kafka and zookeeper (If Kafka is enable)