Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Getting never more than 3 clusters, unless manual initial points were given. #4

Open
jonathanslenders opened this issue Dec 29, 2011 · 1 comment

Comments

@jonathanslenders
Copy link

Somehow, we get never more than 3 clusters. Except for a few points, all of them fall into a single cluster.

We are using postgis, using the following query:

SELECT id, kmeans(ARRAY[ ST_X(point), ST_Y(point)], 40) OVER () AS k, id, FROM our_table ORDER BY k DESC;

Is there any way to improve the initial choosen points?

@umitanuki
Copy link
Owner

Sorry for the delay. Picking up initial points automatically is a hard problem, but I thought I did something reasonable... Could you provide reproducible dataset? I think I could add more various methods for the initial points like kmeans++ if there are needs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants