Subscribe to Vincent Granville's Weekly Digest:
RockyRambo
  • Male
  • Delhi
  • India
Share Twitter

RockyRambo's Friends

  • r chaitanyapradeep
  • Amitesh Kumar
  • kishan Verma
  • Suzy Tonini
  • dabsy
  • K.Kalyanaraman
  • Sandeep Raut
  • Jozo Kovac
  • Arun
  • Amy
  • Ralph Winters
  • Vincent Granville

RockyRambo's Groups

RockyRambo's Discussions

Weight statement and oversampling or undersampling
2 Replies

I need an understanding of the usage of Weight statement. Background - I had to build a logistic regression response model on rare event data with event rate as low as 0.008%. I increased the event…Continue

Started this discussion. Last reply by RockyRambo Oct 25, 2012.

Variable Clustering
4 Replies

I have a question that if we are using proc varclus to eliminate redundancy in the IV's, how do we go about selecting the cluster representatives? I know the lower the (1-R^2) ratio, the better is a…Continue

Started this discussion. Last reply by RockyRambo Oct 24, 2012.

WOE v/s using continuous variables as such
2 Replies

What are the advantages/disadvantages of using WOE approach viz-a-viz. using continuous variables in their original form..For e.g. Making bins of Age and then using its WOE's in the model v/s using…Continue

Started this discussion. Last reply by RockyRambo Oct 24, 2012.

 

RockyRambo's Page

Latest Activity

r chaitanyapradeep left a comment for RockyRambo
"thanks for your reply....if you get it plz share.....do you have cody's innovative sas technique book ?"
Mar 27
RockyRambo replied to RockyRambo's discussion Weight statement and oversampling or undersampling
"Thanks for the reply Ralph.. PFA two documents both of which contain statistics from models built on hypothetically created datasets. It is to be noticed that when weights are used HL test fails (high chi sq and low probability), however,…"
Oct 25, 2012
Ralph Winters replied to RockyRambo's discussion Weight statement and oversampling or undersampling
"I think your problem is that you've already oversampled your data and you want to compensate by using the weight statement.  I would use the original data and then used the weight statement to perform the…"
Oct 25, 2012
RockyRambo replied to RockyRambo's discussion WOE v/s using continuous variables as such
"Thanks Sandeep, Sorry to have missed your reply..Yeah, you're right on your inputs..I'll look deep into the best transformation through WOE in detail.. :)"
Oct 24, 2012
RockyRambo posted a discussion

Weight statement and oversampling or undersampling

I need an understanding of the usage of Weight statement. Background - I had to build a logistic regression response model on rare event data with event rate as low as 0.008%. I increased the event rate to 3% by creating two separate datasets through oversampling (increasing the number of events) and undersampling(decreasing the number of non-events). It is believed that the model equation obtained after such sampling has a change only in the intercept term, however, the coefficients remain the…See More
Oct 24, 2012
RockyRambo replied to RockyRambo's discussion Variable Clustering
"Thanks Ralph and Edmund..In fact, I used 100 such clusters and then looked at each variable in each cluster starting from the one having lowest (1-R^2) and left those variables which were 'redundant'.."
Oct 24, 2012
Edmund Freeman replied to RockyRambo's discussion Variable Clustering
"I would go with the business sense here. One of the things I like about proc varclus is that it takes a really hard problem for humans -- picking out some variables from hundreds -- into a bunch of very reasonable variables -- picking one or two…"
Oct 24, 2012
Ralph Winters replied to RockyRambo's discussion Variable Clustering
"Varun, I haven't used varclus for a while, but I would say that you could swap one variable for another if it made better business sense. The 1-R^2 ratio is only a guide. Also, look at the relationship between the 2 candidate variables.…"
Oct 20, 2012
RockyRambo replied to RockyRambo's discussion Variable Clustering
".Or else, we should go by selecting the top 5 , top 10 variables per cluster and then look at other statistics later on?"
Oct 11, 2012
RockyRambo posted a discussion

Variable Clustering

I have a question that if we are using proc varclus to eliminate redundancy in the IV's, how do we go about selecting the cluster representatives? I know the lower the (1-R^2) ratio, the better is a variable as a representative, however, if we use other factors such as business sense or univariate chi square of a variable along with (1-R^2) ratio then should we select cluster representatives that have a higher univariate chi square or making more 'business sense' even if they are having a…See More
Oct 11, 2012
Sandeep Sunkara replied to RockyRambo's discussion WOE v/s using continuous variables as such
"I assume you use logistic regression post applying WOE transformation(?)! X: Independent Variable Y: Binary Dependent Variable T(.): Transformation function  The fundamental assumption in logistic regression is 'X' and…"
Mar 24, 2012
RockyRambo posted a discussion

WOE v/s using continuous variables as such

What are the advantages/disadvantages of using WOE approach viz-a-viz. using continuous variables in their original form..For e.g. Making bins of Age and then using its WOE's in the model v/s using the values of the variable Age as such in the model..Countering the arguments in favor of WOE :-1. WOE gives us the 'riskiness' measure of an attribute - BUT, so does age as the values of age will be 'corresponding' to the dependent variable only, as in, using values of age as given in the data can…See More
Mar 14, 2012
RockyRambo commented on Minethedata's blog post Logistic regression.
"Variable transformations are usually applied if the relationship between the independent and dependent variables is not linear.."
Mar 8, 2012
RockyRambo added a discussion to the group SAS Network
Thumbnail

SAS Books

Hi,I am looking for two books in SAS - "An Array of Challenges : Test your SAS Skills" by Robert Virgile...& "SAS Workbook & Solutions by Ron Cody"..Please let me know if someone can share them.Thanks a lot in advance,VarunSee More
Mar 7, 2012
RockyRambo replied to Sharath Dandamudi's discussion Data points for sensitivity and 1-specificity in ROC
"Hi Sharath, Since sensitivity and (1- specificity) are determined using different cut off points from the confusion matrix, you can get both of them by varying the cutoff points..Let's say I have scores of 1 million observations in my model. If…"
Mar 7, 2012
RockyRambo added a discussion to the group SAS Network
Thumbnail

SAS Books

Hi,I am looking for two books in SAS - "An Array of Challenges : Test your SAS Skills" by Robert Virgile...& "SAS Workbook & Solutions by Ron Cody"..If any one here has the online pdf's then please send them to me at my email id - varunnakra1@gmail.com..Thanks a lot in advance,VarunSee More
Mar 1, 2012

Profile Information

Short Bio:
Designed analytics solutions across multiple domains (Transportation, Health Care, Marketing & Insurance) in US markets; Managed Projects & Client Relationship (independently & in teams); Led, trained & mentored new hires..
My Website or LinkedIn Profile (URL):
http://www.linkedin.com/profile/view?id=21442910&trk=tab_pro
Field of Expertise:
Business Analytics, Predictive Modeling, Data Mining, Operations Research, Quant, Econometrics, Web Analytics
Years of Experience in Analytical Role:
4
Professional Status:
Manager
Interests:
Finding a New Position, Networking, Recruiting
Your Company:
Genpact, EXL Decision Analytics (erst while Inductis)
Industry:
Analytics
How did you find out about AnalyticBridge?
Random sampling :-)

Comment Wall (1 comment)

You need to be a member of AnalyticBridge to add comments!

Join AnalyticBridge

At 10:54am on March 27, 2013, r chaitanyapradeep said…

thanks for your reply....if you get it plz share.....do you have cody's innovative sas technique book ?

 
 
 

Follow us

© 2013   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC

Badges  |  Report an Issue  |  Terms of Service