Skip to content

sueiras/text_mining

Repository files navigation

Text Mining

Notebooks to the Text Mining course

  • 01 NLP whit NLTK
  • 02 Text classification whit Sklearn
  • 03 Word representations whit Gensim
  • 04 RNN whit Keras
    • Sentiment model
    • Word tagging
  • 99 Homework

Instructions to use Google Colab

  • Open: https://colab.research.google.com/
  • In the load notebook screen select the tag github
  • Find: sueiras/text_mining
  • Select the notebook to open.
  • To execute the notebooks with RNN change te

If you want to make changes in the repository and save it, the best way are fork the repository in your github account and work in the forked copy as usual with git.

OPTIONAL: Create a local Anaconda environment and install text mining packages

To execute the notebooks in local you need a local python environment with the required packages.

Open an Anaconda terminal and execute

conda create -n tm python=3.6
activate tm

conda install pip

conda install graphviz
conda install pandas scikit-learn
conda install -c anaconda jupyter
conda install matplotlib
conda install pillow
conda install -c conda-forge spacy

# Spacy models (37.4Mb - 36.7Mb - 120.8Mb)
python -m spacy download en
python -m spacy download es
python -m spacy download en_core_web_md

pip install --upgrade gensim
pip install h5py
pip install pydot-ng
pip install nltk
pip install pyldavis

pip install --ignore-installed --upgrade tensorflow

OPTIONAL: To execute notebooks in local download the next linguistic resources from the web and unzip in the data directory:

About

Text mining course

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published