A production-ready resume parser with clean minimalistic UI and 97.7% accuracy on target files.
- High Accuracy: 97.7% accuracy on target resume files (91% overall)
- Clean Interface: Modern, minimalistic design with professional typography
- Multiple Formats: Supports PDF, DOC, DOCX, TXT files
- Real-time Processing: < 100ms average processing time
- Standard JSON Output: Industry-standard JSON format
- Drag & Drop: Modern file upload interface
- Python 3.8+
- pip
- Clone the repository:
git clone https://github.com/your-username/parserdemo.git
cd parserdemo
- Create a virtual environment:
python3 -m venv parser_env
source parser_env/bin/activate # On Windows: parser_env\Scripts\activate
- Install dependencies:
pip install -r requirements.txt
python3 clean_server.py
Access at: http://localhost:8001
python3 fixed_server.py
Access at: http://localhost:8000
Based on validation testing with real resume files:
- Perfect Score (100%): 9 out of 11 target files
- Overall Score: 97.7% average accuracy
- Contact Extraction: Name, Email, Phone
- Experience Parsing: Job positions with dates
- Skills Detection: Technical and soft skills
- Education: Degree and institution extraction
GET /api/health
POST /api/parse
Content-Type: multipart/form-data
Body: file (PDF/DOC/DOCX/TXT)
{
"success": true,
"ContactInformation": {
"CandidateName": {"FormattedName": "John Doe"},
"EmailAddresses": [{"Address": "john@example.com"}],
"Telephones": [{"Raw": "(123) 456-7890"}]
},
"EmploymentHistory": {
"Positions": [...]
},
"Skills": [...],
"processing_time": 0.089,
"standard_format": true
}
clean_server.py
: Main UI server with fixed JavaScript bugsfixed_resume_parser.py
: Core parsing engine with enhanced accuracyfixed_server.py
: Alternative server implementationvalidation_results.json
: Accuracy test results
- Processing Speed: Sub-100ms response times
- File Support: PDF (PyMuPDF), DOC/DOCX (python-docx), TXT
- Memory Efficient: Temporary file handling with automatic cleanup
- Error Handling: Comprehensive validation and error reporting
- Modern drag & drop interface
- Real-time processing indicators
- Responsive design
- Clean professional typography
- Structured result display
- JSON output viewer
Recent fixes include:
- JavaScript variable scope issues in result display
- Contact information extraction from nested JSON
- Proper statistics calculation and display
- Network error handling improvements
MIT License - feel free to use in your projects!
- Fork the repository
- Create a feature branch
- Make your changes
- Submit a pull request