Data Intelligence, Business Analytics
Q7. What are the keys to operationalizing a machine learning ranking system from an organization and/or engineering management point of view? (Quora) - http://www.quora.com/What-are-the-keys-to-operationalizing-a-machin...
A. The very first thing would be to have the right database model / architecture from the very beginning, months before it's in testing mode. So many companies went wrong just by choosing SQL server for their production platform. It is very expensive, once you realize you have scalability issues or slowness, to switch to Hadoop or some other architecture.
Q8. Can you patent mathematical procedures? Really? (LinkedIn) - http://www.linkedin.com/groupAnswers?viewQuestionAndAnswers=&di...
A. Yes, I created and sold mathematical patents on scoring technology - used e.g. in the context of fraud detection or transaction scoring. You can patent pretty much anything if you have the right lawyers, and nothing if you do not - the system is very unfair.
Now I don't create patents anymore (not in the classical sense of the word), instead I make my inventions widely available to everyone by publishing them e.g. in my free e-Book on data science: you can call it "open patents" just like "open source", and yes I still do make money, although very indirectly.
Q9. If you could add or edit 3 features in your modeling or solution development software, what would they be? (LinkedIn) - http://www.linkedin.com/groups/If-you-could-add-edit-35222.S.113756...
A. It would be:
Q10. What is machine learning? (LinkedIn) - http://www.linkedin.com/groupAnswers?viewQuestionAndAnswers=&di...
A. I think it depends what your context it. Your context is CS (computer science), mine is CS (computational statistics) and thus to me machine learning relates to techniques (decision trees, regression, SVM, neural networks, pattern recognition) to perform what is, at the end of the day, supervised clustering.
Back in 1993 when I completed my PhD on clustering techniques applied to image analysis, in Belgium, in a stats (not computer science) research lab, supervised clustering meant (at least in our stats lab) supervised classification, and unsupervised clustering meant unsupervised classification. Maybe this was not the correct terminology (none of us was English native speaker), but that's the words that we used. Also, we never used the word "data mining", instead we used "computational statistics". We never used "vector", we used "feature" - while here vector = set of attributes (what I call a feature) while feature = variable.
We should create a synonym dictionary, or maybe a translation dictionary :-) English to English.
Q11. If I want to do Data Science, would LinkedIn or Twitter be a better place to start work? (Quora) - http://www.quora.com/If-I-want-to-do-Data-Science-would-LinkedIn-or...
What about declining both offers and becoming a start-up yourself? Since I left the corporate world, I've never been so happy. My only regret is to have waited so long before becoming independent. Not only is my revenue higher and more diversified (than when working for one company) and getting more diversified every day, but also I have a feeling of creating great things and helping people pretty much every day, without any limitation in what I do (other than the limitations imposed by the market, which all companies are also subject to).
Here some highlights about the joys of working for my startup:
Read part 1 (Q1 to Q6) of this series at http://www.analyticbridge.com/profiles/blogs/new-series-vincent-s-a...
© 2013 AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC
You need to be a member of AnalyticBridge to add comments!
Join AnalyticBridge