# AnalyticBridge

Subscribe to Vincent Granville's Weekly Digest:
Jason Monte
• Oakland, CA
• United States

## Jason Monte's Discussions

Suppose we want to look at which airlines have similar pricing strategies. The data set looks like this: Variables: Flight Origination, Flight Destination, Airline1 Price, Airline2 Price,…Continue

Started this discussion. Last reply by Jason Monte Aug 29, 2012.

### Hadoop and Data Mining4 Replies

Hadoop and Big Data are buzzwords these days. How does it affect data mining workers? Should it be completely transparent for people only using analytical tools such as R, SPSS, SAS etc. in their…Continue

Started this discussion. Last reply by Ingo Mierswa Aug 15, 2012.

### Independent variables need to be normally distributed in multiple regression?5 Replies

Below is a quote regarding logistic regression. It seems it is saying OLS regression requires independent variables to be normally distributed. Based on my past experience, most independent…Continue

Started this discussion. Last reply by Sean Flanigan Jul 20, 2012.

### How To Determine If A Sample Is Representative5 Replies

If the sample is obtained through simple random sampling, would it be automatically representative of the population? If not, what is the way to determine if it is representative.Continue

Started this discussion. Last reply by Lynne Mysliwiec Jul 24, 2012.

# Jason Monte's Page

## Latest Activity

"More info: What I mean by "price similarly" is when Airline1 prices high on a origination and destination pair, Airline3 and Airline8 also prices high."
Aug 29, 2012
Jason Monte's discussion was featured

### Grouping Similar Competitors

Suppose we want to look at which airlines have similar pricing strategies. The data set looks like this: Variables: Flight Origination, Flight Destination, Airline1 Price, Airline2 Price, ....Airline10 Price. Data:Origination: A, Destination: B, Airline1 Price=100, Airline2 = 120, ...., Airline10=95Origination: A, Destination: C, Airline1 Price=500, Airline2 = 450, ...., Airline10=505...... The expected outcome is like:Airline1, Airline3, Airline8 price similar.Airline2, Airline4 price…See More
Aug 28, 2012
Jason Monte posted a discussion

### Grouping Similar Competitors

Suppose we want to look at which airlines have similar pricing strategies. The data set looks like this: Variables: Flight Origination, Flight Destination, Airline1 Price, Airline2 Price, ....Airline10 Price. Data:Origination: A, Destination: B, Airline1 Price=100, Airline2 = 120, ...., Airline10=95Origination: A, Destination: C, Airline1 Price=500, Airline2 = 450, ...., Airline10=505...... The expected outcome is like:Airline1, Airline3, Airline8 price similar.Airline2, Airline4 price…See More
Aug 28, 2012
"Hi Jason, Hadoop and Big Data on itself does not really help anyone, especially not if it used on a data-management level only. So we could now store even larger data sets and we are able to retrieve them faster than before. Nice, but in principle…"
Aug 15, 2012
"Jason, You are completely right in your statement about Hadoop that it is makes data retrieval fater. But it does more than that actually. It has power of distributed computing where you have large number of CPU power to run your…"
Aug 13, 2012
"Yes - there is a great deal of demand for data mining/predictive modeling people. Not only that, but there's competition between employers for the best talent & salaries are much better for heavy-duty quant people than they are for entry…"
Jul 24, 2012
"The answer is: No. No sample is guaranteed to be representative of the entire population, although the risk of non-representative samples is reduced as sample size / total N gets larger.  The larger the % of the total population, the lower the…"
Jul 24, 2012
"I think it will foster different ways of operating on the data, to perform equivalent results.  For example if you are used to doing a regression on a large sample base, you may be forced to perform separate analyses on the various subsets of…"
Jul 22, 2012
"Although I don't have any great experience in the big data area, it looks like an exciting time to me. There are few current solutions which allow data scientists to effectively leverage big data without extensive understanding of the…"
Jul 22, 2012
Jason Monte posted a discussion

Hadoop and Big Data are buzzwords these days. How does it affect data mining workers? Should it be completely transparent for people only using analytical tools such as R, SPSS, SAS etc. in their life? I guess Hadoop and Big Data is more at the data-management level. It just makes data retrieval faster and has nothing to do with analytics.See More
Jul 21, 2012
"Sorry, I mean't the within category means of the DV, not the IV. "
Jul 20, 2012
"The fractional factorial is a higher end breed, more of a GLM traditionally , but there are choice modeling approaches as well (http://www.nobelprize.org/nobel_prizes/economics/laureates/2000/press.html) for these designs. So, the research design…"
Jul 20, 2012
"Social Research, where measures are not massive in certain studies, such as jury bias etc. Pharmaceutical research, where cost of data collection can be astronomical. "
Jul 19, 2012
"Given dummy codes are not normal, would this generalize to impact the business presentation of "on average a unit increase in x produces an increase in y" if both are not normally distributed, or at least based on some fundamental…"
Jul 19, 2012
"You are correct in saying most independent variables are not normally distributed in that (in classic statistics)  predictors are from designed experiments and are not random in that sense. But more importantly, the usual assumption of…"
Jul 19, 2012
"All you have to do is transform the shape of the distribution. There can be hidden distributions within ranges of the IV. So there are techniques to transform the whole distribution, or restrict the range of the IV and transform those. All the rules…"
Jul 19, 2012

## Profile Information

Short Bio:
A SAS Programmer
Field of Expertise:
Other
Years of Experience in Analytical Role:
10
Professional Status:
Technical
Interests:
Networking

## Comment Wall

Join AnalyticBridge

1

2

3

4

5

6

7

8

9

10