Skip to content

A simple NLP project of resume processing using python 3.0.

Notifications You must be signed in to change notification settings

PearlCoastal/ResumeProcessing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

Resume Processing

Description

A simple NLP project of resume processing using python 3.0.

Python libraries and the dataset

imgae

Show categories of resumes present in the dataset

image

Visualize the number of categories in the dataset

imgae

Visualize the distribution of categories in the dataset

image

Remove the wrong format inside the resume and make it clean and able to read

Wrong format included:

  • URLs
  • hashtags
  • mentions
  • special letters
  • puctuations

image Left column is original dataset which contains lots of wrong format informations. Right column is the resume dataset after cleaned. image

Import NLTK and visualize the most numbers of words larger and vice versa inside the resume

Its easily to read that "Details" appeared 484 times,"Experience" 446 times, as well as "company", "less", "year", "Machine Learning", and etc. These are those most numbers of words appeared in one resume. image

Train maching learning model for resume processing and here is the classification report of this dataset

Here I used the onevsrest classifier and KNN classifier. First, split the data into training and data sets. image

About

A simple NLP project of resume processing using python 3.0.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published