-
Notifications
You must be signed in to change notification settings - Fork 0
Py DS_Engineer Lab Report #06
Amy Lin edited this page Jul 20, 2017
·
21 revisions
A small dataset ( 23 people ) with their names, heights and weights is used in this case. For siplicity on clustering a fiarly small dataset, one iteration of K-mean Clustering was simutated throughout the process into 4 Clusters. The labeling will be assigned back to the data so each person will know what size of the T-shirt they're having! And for the company, they'll be able to determine the quantity and size range based on customers' weights and heights.
For social data, a graph formed by distances of points will be induced.The Spectral Clustering will then look at eigenvectors of the Laplacian of the graph to attempt to find a good (low dimensional) embedding of the graph into Euclidean space.
This technique is to find a transfornation of the graph to present manifold thathe the data is assumed to land on.
* Intuitive Parameters : Clustering number must be specifyour or hopefully find a 'suitabele' one through a range of parameters.