Standard-library Python solution for the Redrob AI Campus Hackathon resume matching task.
The program:
- normalizes noisy skill strings with the provided alias mapping
- deduplicates canonical skills per resume
- builds an alphabetically sorted vocabulary from resume skills only
- computes resume TF-IDF vectors with
TF = 1 / unique_resume_skillsandIDF = ln(10 / df) - builds binary JD vectors over the same vocabulary
- ranks candidates by cosine similarity, breaking ties alphabetically
python3 resume_matcher.pyJD-1 - Kakao (ML Engineer) Sneha Patel(0.57), Karan Mehta(0.53), Arjun Sharma(0.40)
JD-2 - Naver (Backend Engineer) Rahul Gupta(0.81), Ananya Krishnan(0.28), Deepika Rao(0.19)
JD-3 - Line (Frontend Engineer) Aditya Kumar(0.67), Priya Nair(0.58), Ananya Krishnan(0.35)
resume_matcher.py: complete implementation with embedded dataset and alias map.gitignore: excludes local/cache files and submission notes