Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update grobid.py #16298

Merged
merged 2 commits into from
Jan 22, 2024
Merged

Update grobid.py #16298

merged 2 commits into from
Jan 22, 2024

Commits on Jan 19, 2024

  1. Update grobid.py

    there is a case where "coords" does not exist in the "sentence" therefore, the "split(";")" will lead to error. 
    
    we can fix that by adding "if sentence.get("coords") is not None:" 
    
    the resulting empty "sbboxes" from this scenario will raise error at "sbboxes[0]["page"]" because sbboxes are empty.
    
    the PDF from https://pubmed.ncbi.nlm.nih.gov/23970373/ can replicate those errors.
    naarkhoo committed Jan 19, 2024
    Configuration menu
    Copy the full SHA
    6c46177 View commit details
    Browse the repository at this point in the history
  2. Update grobid.py

    naarkhoo committed Jan 19, 2024
    Configuration menu
    Copy the full SHA
    e10f908 View commit details
    Browse the repository at this point in the history