Data Intelligence, Business Analytics
Tags:
Permalink Reply by Georgette Asherman on September 19, 2011 at 1:33pm Anova makes the assumption that each level of a categorical variable has a normal distribution for the response variable with equal variances. Why notoriously robust, I would look at the data on a trellis-style graph before doing an ANOVA. The y-axis for each subplot will the response variable. Since it is a trellis the scale will be the same for each nominal variable. The x-axis for each subplot will be each level of the nominal variable. Nearly all popular statistics and business intelligence software provide for these graphs now.
This graph will have several uses. 1) You will see which levels of the categories are used. 2) You will see the distribution within each nominal variable. 3) You can look across the nominal variables. Then you can think about what type of modeling approaches to use.
Georgette Asherman
Direct Effects, LLC
201 673-4301
© 2013 AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC