i am carrying out logistic regression over 1 set of data. for the test data should i use the model which is developed or use the test data also to build the model.
Added by Minethedata on November 28, 2010 at 12:09am —
Several job ads recently posted on DataShaping / AnalyticBridge, across United Sates and in Europe. Use the DataShaping search box to find more opportunities.… Continue
Added by Vincent Granville on November 27, 2010 at 10:14pm —
Added by Vincent Granville on November 25, 2010 at 8:51am —
Added by Vincent Granville on November 25, 2010 at 8:44am —
Added by Vincent Granville on November 25, 2010 at 8:30am —
Suppose while modeling the training data we use clustering initially to group together objects and then apply decision trees for the data which belongs to cluster 1. Then for the test data how do we conclue that this data belongs to which cluster or do we have to carry out clustering again along with the training data and how do we apply the decision trees which we used for cluster 1?
Added by Minethedata on November 25, 2010 at 5:21am —
a common problem in many organizations, but a solvable one:-)
A favorite cartoon for those working with data and analytics within an… Continue
Added by Jaime Fitzgerald on November 24, 2010 at 7:00pm —
There is a great opportunity right now for analytic solutions providers to help TSA in their mission to make airport policies and security check significantly more efficient. It is about
- Minimizing X-ray usage due to radiation exposure
- Automatically identifying frequent fliers who are not…
Added by Titus on November 24, 2010 at 4:22pm —
Plans express our expectations about the future. Companies operating on a calendar basis will be in the final stages of the annual operating and capital planning process for next year. Business intelligence teams contribute to the quantitative models underlying these plans.
At a minimum, BI supports trend analysis showing prior periods of activity of various types. More commonly, BI provides the relationships between operating measures, such as customer visits, and financial… Continue
Added by David Rader on November 19, 2010 at 2:06pm —
It is about integrating external data sources in your data warehouse, and leveraging this data to answer questions such as "why are we losing so many users last month" or "why do we have so few new users recently", or "what new product / feature should I produce". The answer (and the cure) might not come from within your internal data, but from the outside:
- what are my competitors up to?
- what do my clients / employees write on Facebook, Twitter or elsewhere?
Added by Vincent Granville on November 18, 2010 at 10:44pm —
Businesses should not deploy open source software for data mining just because it is generally cheaper, an open source consultant has advised.
"Don't focus solely on cost savings," said Jos van Dongen, an associate and principal at business intelligence (BI) consultancy DeltIQ Group at the Predictive Analytics World conference in London yesterday.
"It [open source] could turn out more expensive because it could require specialised people and more work. Other benefits could… Continue
Added by Titus on November 18, 2010 at 9:28pm —
Most SEM / SEO experts consider a drop in new members or page-views as bad news. Partly because it means decreased advertising revenue.
The reality could be different:
- If web site navigation is optimized, users will have a better experience. The drop in page-views is actually good news, as users get to the right page much faster.…
Added by Vincent Granville on November 17, 2010 at 9:00pm —
Added by Amy on November 17, 2010 at 11:33am —
Well, I have ventured into the land of Technorati and set up a blog for myself. In case you did not know, Technorati is a leading blog search engine directory.
Although I have had some technical issues, I believe that I am on the home stretch with setting up an account so that I can begin to post. The Technorati technical people have been great in helping me to arrange for a feed from my AnalyticBridge site to my Technorati one. I had to click a button called 'claim a blog'… Continue
Added by Tom Wolfer on November 17, 2010 at 10:30am —
Some of them require very serious skills to succeed, or being well known, and income is volatile. Feel free to add other jobs to my list.
- Independent statistical consultant (some travel involved)
- Statistical programmer…
Added by Titus on November 16, 2010 at 10:00pm —
Source: Google Keyword Tool
|Scale is based on the average worldwide traffic of analytics in all…
Added by Vincent Granville on November 16, 2010 at 2:17pm —
Targeted, high quality news are now aggregated and produced semi-automatically
. Can someone shed some light on how such a decision could make sense, from a business viewpoint? At the end of the day, advertisers pay for the content, will they be happy with this Yahoo solution?
The article below was published on… Continue
Added by Vincent Granville on November 15, 2010 at 2:00pm —
Trying to compare 2 methods for churn prediction . 1st method is based on churn attrition calculation by extrapolating churn attrition rate and 2nd method is based on churn prediction by logistic regression. Obvious difference is that the fiurst method gives the headcount but the second method can give individual prediction. Another is that 2nd method is more realistic as it is based on number of realistic variables. Interested to know some more comparisons.
Added by Minethedata on November 13, 2010 at 1:20am —
For all those who participated or followed the IEEE ICDM Contest: TomTom Traffic Prediction for Intelligent GPS Navigation and are interested in how the top solutions were reached -… Continue
Added by magdalena pancewicz on November 10, 2010 at 6:55am —
Dr. Ingo Mierswa is giving a talk together with Dr. Dimitre Leonidov of Ingres about new challenges related to the analysis of extremely large data sets and unstructured data sets. They will show the current state of the RapidMiner / RapidAnalytics integration with Ingres VectorWise. I am sure that many of you already know VectorWise - at least the visitors of the RCOMM 2010 got already a first impresssion about how fast and scalable data analysis becomes with this… Continue
Added by Vincent Granville on November 9, 2010 at 1:42pm —