Skip to content

Artificial Intelligence has recently been used to automate resume parsing which deciphers various resumes using an NLP (Natural Language Processing) engine. It is important for recruiters as it may locate excellent candidates with a perfect match in qualifications, talents, and experience in a similar industry.

Notifications You must be signed in to change notification settings

riya1606/NLP_Resume_Parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 

Repository files navigation

Resume Parser using Natural Language Processing

Welcome to an innovative Resume Parser project that leverages cutting-edge Artificial Intelligence techniques to automate the process of analyzing and extracting valuable information from resumes. This project harnesses the power of Natural Language Processing (NLP) and deep learning algorithms to assist recruiters in identifying ideal candidates quickly and efficiently.

Project Overview:

In today's fast-paced recruiting landscape, time is of the essence. This Resume Parser is designed to be a game-changer for recruiters, enabling them to navigate through a vast pool of resumes and pinpoint exceptional candidates with precision. By utilizing advanced technologies, including NLP, Named Entity Recognition (NER), and Optical Character Recognition (OCR), a robust system has been created to handle even the most complex resumes.

Key Features of the Resume Parser:

  • Document Preprocessing: The initial step involves converting resumes into a standardized text format. This ensures uniformity of data, making it ready for analysis.

  • Custom Named Entity Recognition (NER): Customization is taken to the next level by constructing a tailored NER model using SpaCy. This model is trained to recognize and categorize specific attributes within resumes, such as skills, experiences, and qualifications.

  • Attribute Labeling: The NER model labels each piece of information within the resume, associating it with a relevant attribute. This offers recruiters a clear and organized view of candidates' qualifications.

  • Parts of Speech Analysis: In addition to attribute labeling, the system analyzes the parts of speech within the resume text. This advanced feature enhances the parser's contextual understanding and enhances its capabilities.

  • Multiclass Classification: To further assist recruiters, TF-IDF (Term Frequency-Inverse Document Frequency) and cosine similarity techniques are employed to build a multiclass classification model. This model ranks resumes based on their relevance to the job posting, ensuring that the most suitable candidates are prioritized.

Dataset:

Dataset Available at:

  • Dataset Overview: Our training dataset comprises approximately 200 entries in pickle format ("Resume_train_data.pkl") and 300 resumes in CSV format ("UpdatedResumeDataSet.csv"). These datasets serve as the foundation for training our model.

  • Training and Testing: We use the pickle dataset to train our model, allowing it to learn from the labeled data. This trained model can then be applied to extract information from resumes in PDF or DOC format, facilitating result testing and resume parsing.

  • Named Entity Recognition (NER): Within the training data, we represent information using tuples in the form of dictionaries. These dictionaries assign labels to groups of tokens that are contiguous within the resumes. The NER labels are diverse and encompass a wide range of named or numerical entities, including but not limited to company names, locations, and organizations.

  • Default Model: Our project incorporates a default NER model that is capable of recognizing a broad spectrum of named or numerical entities. This model is instrumental in identifying and categorizing critical information within the resumes.

Flowchart:

image

Conclusion:

In conclusion, this Resume Parser project is a game-changer for recruiters and HR professionals seeking to streamline the candidate selection process. By automating the parsing and classification of resumes, it empowers recruiters to channel their time and efforts towards engaging with the most promising candidates.

About

Artificial Intelligence has recently been used to automate resume parsing which deciphers various resumes using an NLP (Natural Language Processing) engine. It is important for recruiters as it may locate excellent candidates with a perfect match in qualifications, talents, and experience in a similar industry.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published