Preprocessing package that contains methods and utility functions custom-made for our thesis tool, Job-FinDR.
- An instance of ELMo must be present in the current working directory. This will be a directory containing the model and should be named "elmo".
-
Run the following command in the terminal.
pip install git+https://github.com/conraduouou/job-findr-preprocessing
-
This should then give you the
preprocessing
package made for Job-FinDR. To confirm its functionality, you can run the following command in your terminal:python -c "import preprocessing as pre; print(pre.prepare_age([\"__12]]\"]))"
The above code should output "12" in the terminal.
All you need with this package, really, is just the prepare_features
function.
To run it as I have intended, you have to supply a dictionary argument that contains all the features as string keys:
- age
- sex
- experience
- experience_role
- experience_years
- hard_skills
- soft_skills
- certifications
- training
- degree
... with each key having a list (array) value of strings (except for sex, which should only contain the values m or f). If there is no valid value for a given feature, supply an empty array instead.
This is effective when preprocessing a single set of resume data. For preprocessing multiple resume data, it should be in the form of a
csv, where rows denote each resume and columns defining the feature. Instead of supplying a dictionary argument in the prepare_features
function, supply the path to the csv to be prepared.
age,sex,experience,experience_role,experience_years,hard_skills,soft_skills,certifications,training,degree,job_field
28,m,Software Engineer,Lead Developer,5,Java,Communication,Oracle Certified Professional,Agile Development,computer science,Technology
34,f,Data Scientist,Machine Learning Engineer,7,Python,Analytical Thinking,Data Science Professional,Deep Learning,economics,Technology
25,m,UX Designer,Interaction Designer,3,UI/UX Design,Empathy,Adobe XD Certification,Human-Computer Interaction,education,Services
31,f,Project Manager,Scrum Master,8,Project Management,Leadership,Project Management Professional,Agile Leadership,sociology,Management
29,m,Network Administrator,Systems Administrator,6,Network Security,Troubleshooting,Cisco Certified Network Associate,Network Administration,tourism,Technology