Skip to content

Comments

Job 27 cv file parsing#13

Merged
Ivon-Martinez merged 4 commits intomainfrom
JOB-27-cv-file-parsing
Dec 13, 2025
Merged

Job 27 cv file parsing#13
Ivon-Martinez merged 4 commits intomainfrom
JOB-27-cv-file-parsing

Conversation

@AnsonChung3
Copy link
Contributor

Summary/Description/Done changes

Add backend functionality to convert pdf/docx/txt into plain text.
This branch includes a demo test page, please do give it a try and let me know if it doesn't work as intended.
Extra packages are installed for parser. Please do npm i when you check out this branch or after this branch is merge into main.

Please delete this branch after merging. Thanks. 😄 🍻

Known issues (if any)

  1. parser does not accept .doc file
  • .doc is a binary file while .docx is an openXML file. They are very different in nature. Trying to parse .doc file is a sizeable task in itself. It can be added if we have enough time.
  1. the server endpoint is not inside the correct module
  • This is intentional and temporary. It can be fixed after integration test.

Time spent on this issue

Total time: 6hr 30min

  • initial run time error gave no information
  • time took to find out the error log option
  • unexpected typescript problem such as import syntax, import types, and types
  • package for pdf parsing didn't work as expected:
    • expected: package can be called as a function
    • actual implementation: package is imported as a class, to perform task, I had to go inside the package file to find out the class function that can be called

@Ivon-Martinez
Copy link
Contributor

Tested the /upload-cv endpoint on Postman and all file parsing (PDF, DOCX, TXT) works correctly

@Ivon-Martinez Ivon-Martinez merged commit e17d8c0 into main Dec 13, 2025
1 check passed
@Ivon-Martinez Ivon-Martinez deleted the JOB-27-cv-file-parsing branch December 13, 2025 14:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants