Text mining resources and experiences.
There is one test at the moment which unpacks the output from a manual search of TheLens. About 100 patents of which about 80 have textual "descriptions".
Contains the whole of scikit-learn tutorial. To load the data you have to
cd text_analysics/data/languages
python -m fetch_data
This downloads the data.