Data Intelligence, Business Analytics
This is a good book but lack details. Here is a question.
In Chapter 10 for identifying high risk customers, the author selects a sample of customers and brings in 300+ variables from the credit bureau. After applying logistic regression, 62 variables are retained. Then the author doesn't say how to apply this model to the whole customer base. Should the company buys 62 variables for all customers from the credit bureau in order to use the model to produce risk scores for them.
Tags:
Permalink Reply by Ravi Sangal on May 22, 2012 at 10:11am You do not have to buy any additional database. If the logistic regression model comes out to be reliable and valid, with the help of the coefficients, an equation can be formulated which gives P(Y). By default probability more than or equal to .5 would mean category 1, else, category 0. Hope it helps....Ravi Sangal
Permalink Reply by Ralph Winters on May 25, 2012 at 4:00pm Jason. The author is implying that it would be too costly and computationally inefficient to purchase and run a logistic regression on the whole population in order to determine the significant variables. So she chose a smaller sample. Once these variables are identified, they can be purchased for the entire population and then scored.
=Ralph Winters
Permalink Reply by Jason Monte on May 26, 2012 at 7:35pm Can anyone tell me how normally a credit bureau sell their data? Do they charges by # of records, by # of variables, or both?
Permalink Reply by Joe Palaia on June 24, 2012 at 7:05am Jason-
Typically by # of Records. Sometimes it can be custom pricing based on the project.
© 2013 AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC