Subscribe to Vincent Granville's Weekly Digest:
Hi, I would like to know if there is merit in using Discriminant Analysis as a means to validate clusters formed from a previous clustering solution? if I feed the same clustering variables into the discrim and assign the cluster segments as my class variable (DV) in discrim, and if after running the discrim, I get a good hit ratio, can I conclude that the clusters formed are significant and good enough to be carried forward for further analysis? Is this an accepted norm in the industry when dealing with segmentation projects?

Tags: Clusters, Discriminant

Views: 145

Reply to This

Replies to This Discussion

Discriminanat analysis is the best method to validate clusters. A good hit ratio implies good clustering. Also check if the F-values for all the variables are significant, else remove the insignificant variables, and rerun the procedure.
thanks for this. can you tell me a little of the theory behind this? as to what makes dcm the best method to validate clusters?
i think logistic regression would be better ffor the same purpose. requires lesser assumptions, is more robust...
Use logistic regression instead, since the coefficients are easier to interpret; especially if you will be presenting the results to senior management.

-Ralph Winters
As I understand it, If the same variables are used in a Discriminant Function analysis to test a cluster solution then all the F ratios and significance tests are invalid since clustering (especially K-means or Ward) is optimizing essentially same criterion that DFA is based on. Thus, no independence between original solution and it's test.
So, the only significance tests are appropriate are those using variables NOT used in the original cluster analysis. Thus, DFA does not give an appropriate significance test of the quality of a cluster analysis solution. One would need a totally independent DFA that uses variables not used in the initial cluster solution.

This may be too rigid, and I would be glad to be corrected.

RSS

Follow us

© 2013   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC

Badges  |  Report an Issue  |  Terms of Service