Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix Broken Coordinate System in Parser #112

Conversation

JSv4
Copy link
Owner

@JSv4 JSv4 commented Sep 13, 2023

There was an issue in the app where the tokens did not seem to have the same scale / coordinate system as the page they were on. I realized my branch of PAWLs had some OpenCV image processing improvements beyond fixing the issue processing empty pages. There was some dynamic resizing and scaling that was getting the token coordinate system out of sync with the pdf coordinate system. Some of those improvements I made to processing documents were useful for bad quality docs, so those OpenCV improvements should be re-incorporated later for better parsing of bad quality docs. For now, though, the application is working 100% once again :-)

…ng the PAWLS preprocessor code. Realized my branch of PAWLs had some changes beyond fixing the issuing processing empty pages. There was some dynamic resizing and scaling that was getting the token coordinate system out of sync with the pdf coordinate system. Some of those improvements I made to processing documents were useful, so those open cv improvements should be re-incorporated later for better parsing of bad quality docs. For now, though, the application is working 100% once again :-)
@JSv4 JSv4 changed the title Fix Broken Coordinate System in Annotator Fix Broken Coordinate System in Parser Sep 13, 2023
@codecov
Copy link

codecov bot commented Sep 13, 2023

Codecov Report

Merging #112 (39c0f49) into main (c2b2902) will not change coverage.
The diff coverage is 0.00%.

❗ Current head 39c0f49 differs from pull request most recent head 7f77a0f. Consider uploading reports for the commit 7f77a0f to get more accurate results

@@           Coverage Diff           @@
##             main     #112   +/-   ##
=======================================
  Coverage   65.50%   65.50%           
=======================================
  Files          48       48           
  Lines        2128     2128           
=======================================
  Hits         1394     1394           
  Misses        734      734           
Files Changed Coverage Δ
opencontractserver/utils/pdf.py 30.00% <0.00%> (ø)

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@JSv4 JSv4 merged commit be3de1c into main Sep 13, 2023
3 checks passed
@JSv4 JSv4 deleted the JSv4/reverted-some-of-the-automagic-changes-in-parser-and-restored-full-functionality branch September 13, 2023 04:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant