Using Spark Dataframes,and various other functions like tokenizer, string indexer, vector assembler, a TF-IDF Matrix was created to differentiate relevant and irrelevant advertisements.
This Repository Contains:
- Dataset Used
- PySpark Code
- Project Report