Skip to content
Branch: master
Find file History
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

Better NLP Better NLP License

This is a wrapper program/library that encapsulates a couple of NLP libraries that are popular among the AI and ML communities.

Examples have been used to illustrate the usage as much as possible. Not all the APIs of the underlying libraries have been covered.

The idea is to keep the API language as high-level as possible, so its easier to use and stays human-readable.

Libraries / frameworks covered:

Non-NLP related libraries / frameworks used:


  • Python 3.7.x or higher
  • Docker (optional)
  • Diskspace: 2-3GB

Get started

Linux / MacOS (Docker environment)

  • Clone the repo: git clone
  • cd /path/to/awesome-ai-ml-dl/examples/better-nlp
  • Ensure the Docker daemon is running in the background
  • Run
  • Wait for the docker image to download (first time, one-off)
  • Wait for the container to get started with the JuPyter notebook running
  • Copy the notebook's url published in the console, should be of the form http://....?token....) - you will have to do a tiny bit of amendment to make the url look like http://localhost:8888?token....
  • Go to the browser and paste the corrected url: http://localhost:8888?token...., you should see the screen as described in Jupyter Notebook.

Linux / MacOS (local environment)

  • Clone the repo: git clone
  • cd /path/to/awesome-ai-ml-dl/examples/better-nlp/build
  • Run or depending on the OS you are running on
  • Run
  • Run cd ..
  • Run jupyter-lab notebooks/jupyter/better_nlp_spacy_texacy_examples.ipynb
  • Wait for the JuPyter lab to open in the browser
  • You should see the screen as described in Jupyter Notebook.

In theory, the above should work for Windows as well, if run via git-bash or cgywin with all the necessary requirements installed and available - although it has not been tested, please provide feedback or fixes if you find any.



See Jupyter Notebook

Google Colab

You can open these notebooks directly into Google Colab:

Kaggle kernels

Utility Script Competition! (September 23rd - Oct 7th): submission, discussion topic


Setup an environment needed to be able to run these programs without having to worry about the dependencies they use.

Please be aware that even though we are install only a few components, the installation process takes some time (irrespective if you are running in via your local environment or inside a docker container). Give it about 20-30 minutes depending on network bandwidth and overall machine performance. Or you can use a pre-built docker image, see Docker environment for more details.

Local environment

For the brave at heart, install the dependencies in your local environment.





Alternatively please refer to the Docker environment section.


In principle, the script should work in the cygwin or git bash environments - although it has not been tested, please raise PR with fixes if any. Alternatively please refer to the Docker environment section.

Docker environment

See Docker environment

Example code

See Examples and results


Contributions are very welcome, please share back with the wider community (and get credited for it)!

Please have a look at the CONTRIBUTING guidelines, also have a read about our licensing (and warranty) policy.

Back to NLP page
Back to main page (table of contents)

You can’t perform that action at this time.