Skip to content

lovelyk0910/resume-parser-in-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

resume-parser-in-python

A parser to extract information from resumes in PDF and DOCX formats written in Python

##Dependencies The parser requires two Python modules for it to work as intended,

  1. PyPdf
  2. python-docx

The module used for tokenizing and stop word removal are:

  1. word_tokenize from nltk.tokenize #can be replaced with the split() which is built-in
  2. stopwords from nltk.corpus

To get both you'll to install the Python NLTK module.

The script is written in Python 2.7..6

##License The script is licensed under the General Public License (GPL), for more details do check out the LICENSE.md.

About

A parser to extract information from resumes in PDF and DOCX formats written in Python

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages