Subscribe to DSC Newsletter

Featured Blog Posts (1,447)

Understanding the Reality of Real-Time Analytics


Added by Vincent Granville on May 15, 2012 at 1:30pm — No Comments

Machine Learning in Python has never been easier

At BigML we believe that over the next few years automated, data-driven decisions and data-driven applications are going to change the world.  In fact, we think it will be the biggest shift in business efficiency since the dawn of the office calculator, when individuals had “Computer” listed as the title on their business card.  We want to help people rapidly and easily create predictive models using their datasets, no matter what size they are. Our…


Added by Jos Verwoerd on May 15, 2012 at 3:20am — No Comments

BiG DaTa & Vectorization


It has been while when Big data entered into the market and buzz the analytics world. Now a day all analytics leaders are chanting about Big data applications. Since I have started with Hadoop technologies and with Machine learning one question has been bugging in mind:

Which is a greater innovation Big Data Or Machine Learning…


Added by Manish Bhoge on May 13, 2012 at 11:50pm — 1 Comment

SAS Global Forum: Here’s the Wrap Up!

SAS Global Forum 2012 was a success! After a whirlwind week of activities followed by a vacation and week of rest – I’m ready to give you some highlights.  It was a lot of fun! Tip: Click on any picture to enlarge it.

Day 1 – Saturday Ready for the Tweet-Up

The biggest drama was at the airport – our flight was delayed due to mechanical failure so I decided it might be better to take a later flight. Met…


Added by Tricia Aanderud on May 14, 2012 at 6:46am — No Comments

Email marketing: analytic tips to boost performance by 300% - case study

This post is part of our blog post series on data science case studies and success stories.

Analyticbridge improved open rates by 300%, and dramatically improved total clicks and click-through rates using the following strategies:…


Added by Vincent Granville on May 13, 2012 at 1:00pm — No Comments

Four different ways to solve a data science problem - case study

Here we discuss four approaches to solve the following marketing problem: identify, each day, the most popular Google groups, within a large list of target groups. You want to post in these groups only. The only information that is quickly available for each group, is the time when the last posting occured. Intuitively, the newer the last posting, the most active the group. There are some caveats such as groups…


Added by Vincent Granville on May 12, 2012 at 11:30pm — 4 Comments

The Math Behind Ticket Bargains | SeatGeek


Added by Capri on May 12, 2012 at 5:20pm — No Comments

Quickly start and optimize keyword advertising campaigns on Google in 7 days: a 11-step procedure

This is what I did, and it worked quite well. 

  1. Identify 10 top, high volume, well targeted keywords for your business. These are your seed…

Added by Vincent Granville on May 12, 2012 at 4:00pm — 2 Comments

More resources for data scientists and analytic professionals

Recently posted on DataScienceCentral and AnalyticBridge:

1. Conferences

  • Europe’s Best and Brightest come together at Analytics 2012 -…

Added by Capri on May 12, 2012 at 1:00pm — No Comments

R you ready for Big Machine Learning?

Recently, we released python bindings for our API.  We received fantastic feedback on the related

blog post from hacker news and twitter, so we started thinking about other languages that could benefit from a tighter integration with the…


Added by Justin Donaldson on May 11, 2012 at 11:40am — No Comments

SAS Enterprise Guide: Import the Excel Spreadsheet – Easy Peasy

One SAS Enterprise Guide feature I particularly like is the ability to import Microsoft Excel data quickly and easily.  SAS offers many ways to work with Excel spreadsheets but often I find I just want to extract data from Excel and get on with my job.  

Use a “Known Good” First Time

If you are trying this process for the first time, use a “known good” or simple spreadsheet so if any issues arise you can at least eliminate the data as the cause. When this process fails, I…


Added by Tricia Aanderud on May 10, 2012 at 10:00am — No Comments

Big Data Will Need 1.5 Million Data Scientists | Dice

How many of these jobs can be performed by bots (computer programs)? Here's the story:…


Added by Capri on May 9, 2012 at 2:46pm — No Comments

Government Agencies Adding a Petabyte of New Data, Making no Progress in Big Data | Yahoo Finance

IT professionals estimate that they have less…


Added by Capri on May 8, 2012 at 5:00pm — No Comments

Data visualization: example of a great, interactive chart

This chart was published in the Forbes magazine by Jon Bruner, and the data comes from the IRS tax stats tables.


More people left Phoenix in 2009 than came. The map above visualizes moves to and from Phoenix; counties that took more migrants than they sent are linked with red lines. Counties that sent more migrants than they took are linked with blue lines.


Added by Mirko Krivanek on May 8, 2012 at 9:30am — 5 Comments

Uncertainty coefficients for Features Reduction - comparison with LDA technique

Uncertainty coefficient
Consider a set of people's data labelled with two different labels, let's say blue and red, and let's assume that for this people we have a bunch of variables to describe them.
Moreover, let's assume that one of the variables is the social…

Added by Cristian Mesiano on May 5, 2012 at 1:09pm — No Comments

Three blog posts with tons of valuable information

There is at least one thing of interest to any data scientist in the following:

  • List of publicly traded analytic companies -…

Added by Vincent Granville on May 3, 2012 at 8:30pm — 1 Comment

Pentaho Introduces New Interactive Visualization and Expanded Big Data Analytics

Orlando, Fla. — April 25, 2012 — Delivering the future of business analytics, Pentaho Corporation today announced the general availability of Pentaho Business Analytics 4.5. With this release, Pentaho provides new user-driven, interactive visualization and data exploration capabilities that access all data sources, including big data, as well as a pluggable and extensible interface for software and SaaS companies to…


Added by Vincent Granville on May 3, 2012 at 4:30pm — No Comments

A new set of six great data science articles from top news outlets


Added by Capri on May 3, 2012 at 2:00pm — No Comments

SimaFore Partners With Rapid-I To Provide Cost-Effective Analytics Apps For SME’S

Online PR News – 01-May-2012…


Added by Richard on May 3, 2012 at 2:03pm — No Comments

Featured Monthly Archives











Follow Us

On Data Science Central

On DataViz

On Hadoop

© 2017 is a subsidiary and dedicated channel of Data Science Central LLC   Powered by

Badges  |  Report an Issue  |  Terms of Service