Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 220 Bytes

README.md

File metadata and controls

2 lines (2 loc) · 220 Bytes

Spark-Demontration

This is a demonstration of using Spark to explore large dataset, by using PySpark and SparkR. The files include loading data, data exploration and using clustering on words of Shakespeare's novels.