Skip to content

Some experiments and datasets for natural language processing and classification

Notifications You must be signed in to change notification settings

codito/nlp-expt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A bunch of random experiments in NLP.

Usage

# create a virtualenv, I usually install pandas etc. in system
> python -m venv .venv --system-site-packages
> source .venv/bin/activate

# in case you don't have base packages, install them in venv
> pip install sklearn pandas seaborn jupterlab

> pip install cython
> pip install -r requirements.txt

# create an ipython kernel to use the virtualenv
> ipython kernel install --user --name=nlp-expt
# modify the kernel.json file to include python executable from the venv

> jupyter lab

Data

Simplewiki

A cleaned and category labeled dataset of articles/pages in https://simple.wikipedia.org.

See data/simplewiki and README.

License

Datasets are licensed similar to the upstream licenses. Check individual sections above.

About

Some experiments and datasets for natural language processing and classification

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published