In SAS enterprise Miner we typically use Decision Trees or Filter nodes to do variable selection and missing values imputation before we pass them on to Neural Nets or Regressions...Not sure how they do it statistically though. Would love to know.
Aside from a statistical theory perspective, this comment applies well from an applications perspective in which the resulting regression model found from a variable selection method is difficult to interpret within the context of the problem. However, I imagine that other methods like CHAID, taking charge where usual regressions fail, would be an option as it may be more powerful in prediction and easier in interpretation.
Sean:
I am not sure about other methods. From my research, I saw not even a hint of it!
It's just another thing to put on the 204 old list of OLS' cracks.
Bruce