Subscribe to DSC Newsletter

Featured Blog Posts (1,439)

Has a degree in mathematics become worthless? | IEEE

Interesting article published in IEEE Spectrum. 

The queen of the sciences may someday lose its royal status



Added by Vincent Granville on March 10, 2012 at 3:30pm — 7 Comments

50 unusual ways analytics are used to make our lives better

Many more to come soon. Let's start with 10 for today.
  1. Automated patient diagnostic and customized treatment. Instead of going to the doctor, you fill an online (decision tree type of) questionnaire. At the end of the online medical exam, customized drugs are prescribed, manufactured and delivered to you: for instance, male drug abusers receive a different mix than healthy females, for the same condition. In short,…

Added by Vincent Granville on March 10, 2012 at 3:33pm — No Comments

J.D. Opdyke, Author: A Powerful and Robust Nonparametric Statistic for Joint Mean-Variance Quality Control

For statistical process control, a number of single charts that jointly monitor both process mean and variability recently have been developed. For quality control-related hypothesis testing, however, there has been little analogous development of joint mean-variance tests: only one two-sample statistic that is not computationally intensive has been designed specifically for the one-sided test of Ho: Mean2<=Mean1 and StDev2<=StDev1 vs. Ha: Mean2>Mean1 OR StDev2>StDev1 (see…


Added by J.D. Opdyke on March 9, 2012 at 7:41am — No Comments

27 types of data scientists - where do you fit?

Three metrics can be used to segment the population of data scientists. Each metric has three levels: high, medium, low. Hence the 27 (= 3 * 3 * 3) types of scientists.

Here are the metrics in question:

  1. Soft skills: sales, business…

Added by Vincent Granville on March 8, 2012 at 8:00pm — 6 Comments

From my email inbox: lot's of interesting stuff for Quant professionals

Random Walkers, Informal Quant Meetup in the Crosse Keys Pub

Thursday 8th March

A meetup for quants, risk managers, traders, algos, quant/devs and anyoneelse who (mis)uses maths in finance.This is not a formal affair, we are taking over the balcony area at the back ofthe Crosse Keys pub near Leadenhall…


Added by Vincent Granville on March 8, 2012 at 1:00pm — No Comments

risk transfer, insurance layers

The financial crisis – risk transfer, insurance layers and (no?) reinsurance culture

Michael Fackler  freelance actuary Munich, Germany


The financial crisis of 2007 has triggered various debates, ranging from the stability of the banking system to subtle technical issues regarding the Gaussian and other copulas. All these debates are important, and it might be good to start even a further one: Credit derivatives have much in common with…


Added by John A Morrison on March 7, 2012 at 9:00pm — No Comments

Exploratory Data Analysis for Complex Models


“Exploratory” and “confirmatory” data analysis can both be viewed as methods for comparing observed data to what would be obtained under an implicit or explicit statistical model. For example, many of Tukey’s methods can be interpreted as checks against hypothetical linear models and Poisson distributions. In more complex situations, Bayesian methods can be useful for constructing reference distributions for various plots that are useful in exploratory…


Added by John A Morrison on March 7, 2012 at 10:30pm — No Comments

Warranty Analytics - Increase Product quality, Customer satisfaction & Brand perception

As a customer, when you buy any home appliances like TV, AC, Refrigerator, Home Theater or a brand new car, you get a company warranty along with it. This is the commitment from the manufacturer that if any problem arises in the product or spare parts within the warranty period, then company will repair or replace…

Added by Sandeep Raut on March 7, 2012 at 7:37pm — No Comments

Support Vector Regression (SVR): predict earthquakes through sunspots

In the last months we discussed a lot about text mining algorithms, I would like for a while focus on data mining aspects.

Today I would talk about one of the most intriguing topics related to data mining tasks: the regression  analysis.

...To read the entire post click here

Experiment: Earthquakes prediction using sunspots as…

Added by Cristian Mesiano on March 7, 2012 at 2:27pm — No Comments

Top analytic blogs and websites, with trending information


  • This analysis is based on data submitted on sign-up by 16,000 Analyticbridge members, between February 2008 and December 2011.
  • A "plus" (+) sign on the right-hand column (below) shows growth…

Added by Vincent Granville on March 3, 2012 at 4:00pm — 7 Comments

Counterparty Credit Risk Management in Industrial Corporates


Ever since the financial crisis of the banking system of 2008 - 2010 the paradigm that deposits or other exposures towards major banks are safe has been fundamentally questioned. This put industrial corporates, who to support their business usually need to manage significant cash holdings or incur counterparty credit risk via derivatives, in the situation to develop or extend their resources for counterparty credit risk management. This paper provides a…


Added by John A Morrison on March 4, 2012 at 12:30am — No Comments

Six keywords characterizing milestones in the history of analytic engineering: from 1988 to 2033

  • 1988: Artificial Intelligence. Also: Computational Statistics.
  • 1995: Web Analytics. Also: Machine Learning, Business Intelligence, Data Mining, ROI, Distributed Architecture, Data…

Added by Vincent Granville on March 3, 2012 at 10:00pm — 7 Comments

Radoop and RapidMiner are partners now!

Read below more about this disruptive technology for Big Data Analytics.

This is probably the most exciting announcement of the last months: Radoop and RapidMiner are partners now! Read below more about this disruptive technology for Big Data Analytics.…


Added by Vincent Granville on March 3, 2012 at 12:00pm — No Comments

What is a Data Scientist?

Interesting article posted in The Guardian (see below). Here's my answer to this question:

A data scientist (I like the term data wizard better) is someone who can consistently derive money out of data, e.g. working as an employee, consultant or in an other capacity, by providing value to clients or extracting…


Added by Vincent Granville on March 2, 2012 at 9:30am — 2 Comments

Visualization Databases for the Analysis of Large Complex Datasets

Saptarshi Guha / Paul Kidwell / Ryan P. Hafen / William S. Cleveland


Comprehensive visualization that preserves the information in a large complex dataset requires a visualization database (VDB): many displays, some with many pages, and with one or more panels per page. A single display using a specific display method results from partitioning the data into subsets, sampling the subsets, and applying the method to each sample, typically one per panel.…


Added by John A Morrison on March 2, 2012 at 2:39am — No Comments

Social Media Analytics Expert Interview Series: Part 3

What Is Social Media Analytics? - 4 Experts Debate

February 28, 2011Expert Speakers

As a lead-up to the Social Media Analytics Summit, Text Analytics News has partnered with Useful Social Media to publish a series of expert interviews with top Social Media Analytics…


Added by Vincent Granville on March 1, 2012 at 7:36pm — 1 Comment

Big Data for the Public Good - Seminar Series

We are generating more data than ever before. Thanks to the data scientists who organize and analyze this information, this abundance of Big Data can be harnessed to serve the public interest in innovative ways.

Big Data for the Public Good is a four-part seminar series hosted by Code for America in San…


Added by Vincent Granville on February 29, 2012 at 10:00pm — No Comments

Features Extraction: Co-occurrences and Graph clustering

In the last two post we have discussed about co - occurrences analysis to extract features  in order to classify documents and extract "meta concepts" from the corpus.

We have also noticed that this approach doesn't return better than the traditional bag of words.

I would now explore some derivation of this approach, taking advantage of the graph theory.

the graph of the co occurrences is really huge and complex, how could we reduce its complexity without big information…


Added by Cristian Mesiano on February 29, 2012 at 1:59pm — No Comments

STATISTICA Decisioning Platform™ Profiled in Review of Decision Management Solutions

StatSoft’s (www.statsoft.comSTATISTICA Decisioning Platform™ is the only enterprise predictive analytics and decision management software platform to…


Added by Vincent Granville on February 29, 2012 at 9:55pm — No Comments

New Article on Data Mining: Classifiers & the ROC Curve

Many customer behaviors have the flavor of a choice between two alternatives:  Yes or no.  Buy or sell.  Renew or cancel.  Suppose software called a “classifier” is available to predict customer choices in advance.  Would you use it?  Perhaps you’d like to test it to see how well it performs before you commit.  In this installment of my series on the nuts and bolts of data mining, I discuss the use of classifiers and questions about their performance.  Regarding performance, we specifically…


Added by Daniel Graettinger on February 27, 2012 at 10:29am — No Comments

Featured Monthly Archives










Follow Us

On Data Science Central

On DataViz

On Hadoop

© 2016 is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Terms of Service