Spark implementations of data mining algorithms for analyzing very large data sets and solving real-world problems.
HW Topics | Related Algorithms |
---|---|
MapReduce & Spark Operations | MapReduce |
Finding Frequent Itemsets | A-Priori, SON |
Finding Similar Sets & Recommendation Systems | Min-Hash, LSH, Content-based Recommendation System, Item-based & User-based CF |
Detecting Communities in Social Network Graphs | Girvan-Newman |