Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ideal Cluster Number #104

Open
meaksu opened this issue Feb 21, 2023 · 1 comment
Open

Ideal Cluster Number #104

meaksu opened this issue Feb 21, 2023 · 1 comment

Comments

@meaksu
Copy link

meaksu commented Feb 21, 2023

Thanks again for developing this package

I was wondering if there are any more quantitative ways to determine the ideal number of clusters other than the qPlot elbow graph. I noticed in some cases picking the clear elbow at q = 6, for example, will result in clusters where cluster 6 is only a few scattered spots, indicating that one less cluster would be more natural. Could it be theoretically possible for me to use the raw NLL values from the qPlot and compute something like a Gap Statistic or other metric?

@edward130603
Copy link
Owner

Sorry I wasn't able to respond sooner. Selecting # of clusters is in general a pretty hard problem in clustering. I think the gap statistic could work in theory but may be computationally too intensive.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants