Subscribe to Vincent Granville's Weekly Digest:
I have a set of roughly 20-30 nominal/categorical variables and would like to determine if a relationship exists between them and a continuous-scale variable?

Besides doing an ANOVA to test for differences in the levels of the variables, what are some other techniques you have found useful in doing this?

Thanks for any input

Views: 177

Reply to This

Replies to This Discussion

Anova makes the assumption that each level of a categorical variable has a normal distribution for the response variable with equal variances.   Why notoriously robust, I would look at the data on a trellis-style graph before doing an ANOVA.  The y-axis for each subplot will the response variable.  Since it is a trellis the scale will be the same for each nominal variable.      The x-axis for each subplot will be each level of the nominal variable.   Nearly all popular statistics and business intelligence software provide for these graphs now.

 

This graph will have several uses.   1)  You will see which levels of the categories are used.   2)  You will see the distribution within each nominal variable.  3) You can look across the nominal variables.   Then you can think about what type of modeling approaches to use. 

 

Georgette Asherman

Direct Effects, LLC

201 673-4301

RSS

Follow us

© 2013   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC

Badges  |  Report an Issue  |  Terms of Service