Skip to content

sdabhi23/people-occupations-classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

people-occupations-classifier

This project has been undertaken as a part of our undergraduate coursework, to understand generation of datasets and classification algorithms.

Getting started

  1. Clone this repo

    $ git clone https://github.com/sdabhi23/people-occupations-classifier.git
  2. Setup virtual environment (optional)

    $ virtualenv .venv
    # for windows
    $ .venv\Scripts\activate
    # for *nix
    $ .venv/Scripts/activate
  3. Installing the required libraries

    $ pip install -r requirements.txt
  4. Additional steps for nltk

    >>> import nltk
    >>> nltk.download('stopwords')
  5. Steps to configure ipython kernel

    Required only if using virtual environment

    $ ipython kernel install --user --name=people_classifier

    Then change the kernel in the jupyter interface to people_classifier.

Maintainers

References

  • Classifying Wikipedia People Into Occupations by Aleksander Gabrovski (.pdf)
  • Multi-Class Text Classification with Scikit-Learn (article)
  • Using jupyter notebooks with a virtual environment (article)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published