Data Intelligence, Business Analytics
Tags: R, forests, partitioning, random, trees
Permalink Reply by Kyriacos Chrysostomou on July 15, 2010 at 10:37am
Permalink Reply by DataMiner on July 27, 2010 at 6:21am There is a great book called "Elements of Statistical Learning" by Hastie, Tibshirani, and Friedman which describes in detail classification and regression trees and many other data mining methods. You can buy the book or download it in pdf. The link is:
http://www-stat.stanford.edu/~tibs/ElemStatLearn/
Lester Wollman
Permalink Reply by Gene Leynes on September 7, 2011 at 12:14pm
Permalink Reply by Jozo Kovac on September 7, 2011 at 2:14pm Isn't it funny that CART is one of the best algorithms 27 years after it was introduced? :-)
Logistic regression has similar story - very old and very useful for today's problems.
If you want to improve accuracy learn more about ensembles - boosting, bagging, adaboost, random forests. But don't expect miracles, they all improve CART's performance by relatively small margin. And all got some pros&cons.
http://www.salford-systems.com/doc/newhybridmethods.pdf - here's nice presentation about CART, only 13 years old, but really worth reading.
© 2013 AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC