Skip to content

Machine Learning applied to Natural Language Processing Toolkit used in the Lisbon Machine Learning Summer School

License

Notifications You must be signed in to change notification settings

Askinkaty/lxmls-toolkit

 
 

Repository files navigation

Travis-CI Build Status Requirements Status Coverage Status Code Quality Status Scrutinizer Status Codacy Code Quality Status

Summary

Machine learning toolkit for natural language processing. Written for Lisbon Machine Learning Summer School (lxmls.it.pt). This covers

  • Scientific Python and Mathematical background
  • Linear Classifiers
  • Sequence Models
  • Structured Prediction
  • Syntax and Parsing
  • Feed-forward models in deep learning
  • Sequence models in deep learning

Machine learning toolkit for natural language processing. Written for LxMLS - Lisbon Machine Learning Summer School

Instructions for Students

Install with Anaconda

The simplest method is to use Anacondato handle your packages as described on Day 0 of the lxmls-guide.

Alternative install with pip and virtualenv

If you like pip, install the toolkit modules

virtualenv venv
source venv/bin/activate
pip install -r requirements.txt

Then get the right pip install command for your platform for pytorch from http://pytorch.org/ and apply them. Finally call

python setup.py develop

to instal the toolkit in a way that is modifiable.

Bear in mind that the main purpose of the toolkit is educative. You may resort to other toolboxes if you are looking for efficient implementations of the algorithms described.

Running

  • Run from the project root directory. If an importing error occurs, try first adding the current path to the PYTHONPATH environment variable, e.g.:
    • export PYTHONPATH=.

Development

To run the all tests install tox and pytest

pip install tox pytest

and run

tox

Note, to combine the coverage data from all the tox environments run:

  • Windows
    set PYTEST_ADDOPTS=--cov-append
    tox
    
  • Other
    PYTEST_ADDOPTS=--cov-append tox
    

About

Machine Learning applied to Natural Language Processing Toolkit used in the Lisbon Machine Learning Summer School

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 50.7%
  • Jupyter Notebook 46.6%
  • Perl 2.6%
  • Shell 0.1%