Executable demo at t http://www.text2data.net/index.html
I am fairly newbie to text mining. I found the "document extraction" problem interesting, esp. for SEC docs - in a generic way that can be applied to any doc with latin chars. I think the generic text mining problem from documents has practical use, and dont really have an idea how satisfactorily it has been solved, would like to have your views...
While doing this, I do not know how many conventional approaches I have rudely trampled upon, but the end result (extraction depth, width) seemed decent enough.
Maybe posting to this forum is like carrying coal to Newcastle - but I was encouraged by the general interest in the thing by financial and text mining companis - and when I actually visited the web sites of a few generic text mining products, I did not find any special features that were not already present or future-possible in the demo.
Hope you like it....
You need to be a member of AnalyticBridge to add comments!
Join AnalyticBridge