Subscribe to DSC Newsletter

Featured Blog Posts (1,390)

Extract meta concepts through co-occurrences analysis and graph theory

So what I did is the following (be aware that is not the formal implementation of LSA!):
  1. Filter and take the base form of the words as usual.
  2. Build the multidimensional sparse matrix of the co-occurrences;
  3. I calculated for each instance the frequency to find it in the corpus;
  4. I calculated for each instance the frequency to find it in the doc;
  5. I weighted such TF-IDF considering also the distance among the…

Added by Cristian Mesiano on January 13, 2012 at 9:04am — No Comments

Using Analytics to Connect Talent Characteristics with Enterprise Performance

Economic pressure drives businesses to find every advantage for increased business performance. While enterprises have carefully chosen key performance indicators, they are housed separately from key performance indicators of the people who do the work, missing an opportunity to quantitatively link a company's talent characteristics to business operations and outcomes.

IIA's January Office Hours call will discuss the connections between innate talent characteristics and enterprise…


Added by Mike Kennedy on January 10, 2012 at 1:30pm — No Comments

Latest Article: Data Mining Lessons from the Movie "Moneyball"

My company (Discovery Corps, Inc.)  president Tim Graettinger has released a new article I think you'll like:

"If you’re a data miner, there’s a good chance you saw the movie Moneyball.  For me, the movie was entertaining, but the book was really enlightening.  I found it to be a source of numerous and familiar lessons about data mining - while couched in the intriguing context of major league baseball.  In this article, I’d like to share some of…


Added by Daniel Graettinger on January 6, 2012 at 1:30pm — No Comments

Business Technology: So, What's Your Algorithm? | Wall Street Journal

We are ruined by our own biases. When making decisions, we see what we want, ignore probabilities, and minimize risks that uproot our hopes.

What's worse, "we are often confident even when we are wrong," writes Daniel Kahneman, in his masterful new book on psychology and economics called "Thinking, Fast and Slow."

An objective…


Added by Vincent Granville on January 4, 2012 at 3:00pm — No Comments

Great apes think analytically | ScienceDaily

Great Apes Make Sophisticated Decisions: Research Suggests That Great Apes Are Capable of Calculating the Odds Before Taking Risks

ScienceDaily (Dec. 29, 2011) — Chimpanzees, orangutans, gorillas and bonobos make more sophisticated decisions than was previously thought. Great apes weigh their chances of success, based on what they know and the likelihood to…


Added by Vincent Granville on January 4, 2012 at 3:00pm — No Comments

Missing Values Imputation with Bayesian Networks

A New Approach to Missing Values Processing with Bayesian Networks

Download the white paper here (39.5 MB).

With the abundance of “big data” in the field of analytics, and all the challenges today’s immense data volume is causing, it may not be particularly fashionable or pressing to discuss missing values. After all, who cares…


Added by Vincent Granville on January 4, 2012 at 2:39pm — No Comments

Using Visual Analysis to Detect Call Center and BPO Fraud

Outsourced customer service and Back Office Processing (BPO) organizations process an ever increasing number of transactions and contacts every day. Agents have access to large amounts of sensitive customer information and access to company inventory to perform their operations. Every operation that is granted to outsourced agents allows for potential theft, abuse or breach of sensitive information or goods and services.

Threats within these industries range…


Added by Andrew Marane on December 28, 2011 at 8:30am — No Comments

More about retail analytics

I lived in California a while back, and have experienced a few instances where large companies headquartered e.g. in Minneapolis, were selling products in all their stores nationwide as if everybody in US had the same weather, seasons and body size.

In particular, some stores (maybe Target - I don't remember) that sell garden tables did not have any product available in late February, although late February in much of California is a great time to remodel your garden.



Added by Mirko Krivanek on December 26, 2011 at 9:00pm — No Comments

Get prediction intervals for your forecasts, with this easy to apply statistical model


In the words of Dr. Chris Chatfield: Predictions are often given as point forecasts with no guidance as to their likely accuracy (and perhaps even with an unreasonable high number of significant digits implying spurious accuracy!).


Our paper sets forth a synergy of existing statistical theories to obtain a clear-cut model for calculating forecasts with prediction intervals, named the…


Added by MARTIN VAN WUNNIK on December 26, 2011 at 3:39pm — No Comments

Big Data Analytics

Big data i s the new buzzword within the data warehousing and business analytics community.

According to TDWI r ecent report on BIG data, there are 3 Vs of big data – Volume which is multiple terabytes or over petabytes, Variety which is numbers, audio, video, text, streams , weblogs, Social media etc…

Added by Sandeep Raut on December 26, 2011 at 10:03am — No Comments

How to build simple, accurate, data-driven, model-free confidence intervals

An updated version with source code and detailed explanations can be found here.

If observations from a specific experiment (for instance, scores computed on 10…


Added by Vincent Granville on December 25, 2011 at 8:00pm — 5 Comments

How could Amazon increase sales by redefining relevancy?

By improving its search and relevancy engines, to include item price as a main factor. The type of optimization and ROI boosting described below applies to all digital catalogs. Here we focus on books.…


Added by Vincent Granville on December 25, 2011 at 5:00pm — 1 Comment

Sales forecasts: how to improve accuracy while simplifying models?

The solution is simple: leverage external data, and simplify your predictive model.

Back in 2000 I was working with GE's analytic team to improve sales forecasts for NBC Internet, a web portal owned by NBC. The sales / finance…


Added by Vincent Granville on December 24, 2011 at 9:00pm — 3 Comments

Real Life Example of Text Mining to Detect Fraudulent Buyers

The credit card transaction described here in details is a real example of a fraudulent transaction performed by organized criminals, undetected by all financial institutions involved, and very easy to detect with simple text mining techniques.

It was not caught by any of the financial…


Added by Vincent Granville on December 21, 2011 at 5:00pm — 1 Comment

IBM Tech Growth Predictions: Energy, Biometrics, Mind Reading, Mobile, Junk Mail | ABC News

Who would have guessed, five years ago, that by 2011 we would have mobile phones that would, in a rudimentary way, listen to our questions and give us useful answers? Or ways for doctors to get help to…


Added by Vincent Granville on December 21, 2011 at 1:08pm — No Comments

MINE: Detecting novel associations in large data sets

Tool detects patterns hidden in vast data sets

By Haley Bridger, Broad Communications, December 16th, 2011…


Added by Vincent Granville on December 20, 2011 at 1:12pm — No Comments

Analytic professionals: are we detailed-oriented or do we think "big picture", or both?

Hiring managers always assumed that I was a very detailed-oriented person. It turns out that this is not the case: I'm certainly a very analytic person, yet I always think "big picture", and everybody who knows me well would say that I am everything but detail-oriented.…


Added by Vincent Granville on December 19, 2011 at 4:30pm — 4 Comments

Full employment through inefficiency?


According to the recent landmark McKinsey Big Data study, by 2018 one in two US jobs requiring deep analytical skills will go unfilled, and there will also be an under-served demand for 1.5 million business managers and analysts “who can ask the right questions and consume the results of the analysis of big data effectively.”

McKinsey and others see it…


Added by Tim Negris on December 19, 2011 at 12:00pm — 4 Comments

Apply for the Innovation in Analytics Award


Added by Vincent Granville on December 15, 2011 at 7:29pm — No Comments

Great Greenplum resources for data scientists

Data Scientist Summit 2011 Highlights

New to the world of big data? New to the world of Greenplum? The Data Scientist Community is the platform to share ideas, network with industry peers and…


Added by Vincent Granville on December 15, 2011 at 3:30pm — No Comments

Featured Monthly Archives









© 2015 is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Terms of Service