Data Intelligence, Business Analytics
Tags:
R has packages for creating decision trees. The most notable one is rpart. For a variety of add-on packages that R can use for data mining, including decision trees, look at the Machine Learning task views page.
http://cran.r-project.org/web/views/MachineLearning.html
Permalink Reply by Richard on May 4, 2011 at 10:49am Check C4.5.
C4.5 is a software extension of the basic ID3 algorithm designed by Quinlan to address the following issues not dealt with by ID3:
It is installed for use on Grendel (grendel.icd.uregina.ca), but it may be set up on a local machine as follows:
Click on the links below for examples of C4.5 usage:
Source: http://www2.cs.uregina.ca/~dbd/cs831/notes/ml/dtrees/c4.5/tutorial....
Permalink Reply by Abhinav Jain on May 5, 2011 at 10:26am You can also use WEKA. It is open source data mining software with wide variety of machine learning algorithms for data mining tasks. It's easily installable and GUI based easy to use.
Google it or here is the link:
http://www.cs.waikato.ac.nz/ml/weka/
I'm using WEKA for my data mining course work, so let me know in case you need any assistance.
Permalink Reply by Yashu Gupta on May 5, 2011 at 10:34pm thanks Abhinav ....
I will take a look and get back to you in case of any clarifications..
Permalink Reply by Yashu Gupta on May 5, 2011 at 11:11pm Hi Abhinav,
I am able to successfully download and install WEKA.
Now, I have CSV file having variables and there corresponding data. I want to build a decision tree in which in which one variable will be performance variable and other will be independent variables.
Could you please guide me steps in creating this kind of decision tree in WEKA....?
Permalink Reply by sana on May 5, 2011 at 11:36am
Permalink Reply by Yashu Gupta on May 10, 2011 at 6:17am Hi Abhinav,
I am able to convert the CSV into ARFF file.
Could you please guide me the steps that i need to follow to create decision tree. My file has 4 variables in which 3 are categorical and one is numeric (dependent variable).
I just want to split the 3 categorical variables based on this fourth numeric dependent variable
Please advice. Thanks in advance.
Regards,
Yashu
Permalink Reply by Bhupendrasinh Thakre on July 27, 2012 at 12:10pm Hi Yashu,
Actually you dont have to convert csv file at all.
When selecting the file to load just select "csv" and program will pop only csv files.
So you have 3 independent var and 1 dependent which is numeric. Regression Tree might help.
Best,
Bhupendrasinh Thakre
© 2013 AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC