ResumeParser

A utility to make handling many resumes easier by automatically pulling contact information, required skills and custom text fields. These results are then surfaced as a convenient summary CSV.

Quick Start Guide

This assumes you've installed Anaconda (as discussed in Python Environment)

# Create Python virtual enviornment
conda env create -f environment.yml

# Activate Python virtual environment
source activate resume

#Retrieve language model from spacy
python -m spacy download en

# Run code (with default configurations)
cd bin/
python main.py

# Review output
open ../data/output/resume_summary.csv

Getting started

Repo structure

bin/main.py: Code entry point
confs/confs.yaml.template: Configuration file template
data/input/example_resumes: Example resumes, which are parsed w/ default configurations
data/output/resume_summary.csv: Results from parsing example resumes

Python Environment

Python code in this repo utilizes packages that are not part of the common library. To make sure you have all of the appropriate packages, please install Anaconda, Python 2.7 Version, and install the environment described in environment.yml (Instructions here, under Creating an environment from an environment.yml file, and Activating an environment).

Once Anaconda is installed, you can follow the steps described in the Quick Start Guide

Configuration file

This program utilizes a configuration file to set program parameters. You can run this program with the default parameters view sample output, but you'll probably want to create a config file and modify it to get the most value from this program.

# Create configuration file from template
scp confs/confs.yaml.template confs/confs.yaml

# Modify confs to match your needs
open confs/confs.yaml

The configuration file has a few parameters you can tweak:

resume_directory: A directory containing resumes you'd like to parse
summary_output_directory: Where to place the .csv file, summarizing your resumes
data_schema_dir: The directory to store table schema. This is mostly for development purposes
skills: A YAML list of skills. Each element in this list can either be a string (e.g. skill1 or machine learning), or a list aliases for the same skill (e.g. [skill2_alias_A, skill2_alias_B] or [ml, machine learning, machine-learning])
universities: A YAML list of universities you'd like to search for

Contact

Feel free to contact me at 13herger <at> gmail <dot> com

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
bin		bin
confs		confs
data		data
.gitignore		.gitignore
README.md		README.md
changelog.md		changelog.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bin

bin

confs

confs

data

data

.gitignore

.gitignore

README.md

README.md

changelog.md

changelog.md

environment.yml

environment.yml

Repository files navigation

ResumeParser

Quick Start Guide

Getting started

Repo structure

Python Environment

Configuration file

Contact

About

Releases

Packages

Languages

iwangu/ResumeParser

Folders and files

Latest commit

History

Repository files navigation

ResumeParser

Quick Start Guide

Getting started

Repo structure

Python Environment

Configuration file

Contact

About

Resources

Stars

Watchers

Forks

Languages