Executable demo at t http://www.text2data.net/index.html
I am fairly newbie to text mining. I found the "document extraction" problem interesting, esp. for SEC docs - in a generic way that can be applied to any doc with latin chars. I think the generic text mining problem from documents has practical use, and dont really have an idea how satisfactorily it has been solved, would like to have your views...
While doing this, I do not know how many conventional approaches I…
Continue