Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add GUI Elements to Filter on Metadata and Thumbnails for Docs #75

Merged
merged 15 commits into from
Feb 18, 2023

Conversation

JSv4
Copy link
Owner

@JSv4 JSv4 commented Feb 16, 2023

Thumbnails are now auto-generated for docs on import. The thumbnails are processed using opencv to produce a 200x400 px snapshot of the top part of the doc. We MAY want to just take a high res snapshot of the frontpage and do image manipulation on frontend (or change the card view so you get a full thumbnail view initially. All things to be discussed.

…f middleware rechecking user on every single object being queried in a given query even though user should onyl need to be determiend once per request, not once per object needed to fulfill request.
…e graphql global id as one of the return values - useful and necessary when using an API client and creating objects, as you'll almost always want to know ID of created obj.
…dule, the migrations needed to be re-built as they had references to old python paths. Though... now, as I'm thinking about this, I think I can edit the migrations directly to prevent this from being breaking... probably best given this is released, though, admittedly with limited uptake. Also made a few tweaks in furtherance of headless crawlers being able to dump docs into an OpenContracts installation. Also removed some unwanted Affero GPL headers (this is NOT Affero licensed)
…o reference new project structure. Should allow people on old database structure to migrate still.
… in their Tesseract processor. IF they merge my change, I'll switch back to their repo, but, right now, it's causing issues with some PDFs where they're unable to complete the PAWLS pipeline where tokens in the pdf are extracted not as string type but float64. See issue I opened in the PAWLs repo and my PR.
…s (celery workers were being terminated for long pdfs). Add *very* rudimentary capabilities to interact with metadata (pretty much just filtering atm).
@codecov
Copy link

codecov bot commented Feb 16, 2023

Codecov Report

Merging #75 (deb5853) into main (4635f17) will decrease coverage by 1.70%.
The diff coverage is 59.37%.

@@            Coverage Diff             @@
##             main      #75      +/-   ##
==========================================
- Coverage   68.47%   66.77%   -1.70%     
==========================================
  Files          46       47       +1     
  Lines        1732     1794      +62     
==========================================
+ Hits         1186     1198      +12     
- Misses        546      596      +50     
Impacted Files Coverage Δ
opencontractserver/utils/cleanup.py 26.31% <ø> (ø)
opencontractserver/utils/pdf.py 44.82% <ø> (ø)
opencontractserver/tasks/doc_tasks.py 28.68% <12.06%> (-17.69%) ⬇️
opencontractserver/utils/etl.py 19.82% <50.00%> (ø)
opencontractserver/analyzer/models.py 100.00% <100.00%> (ø)
opencontractserver/analyzer/utils.py 80.95% <100.00%> (ø)
opencontractserver/analyzer/views.py 61.70% <100.00%> (+0.83%) ⬆️
opencontractserver/annotations/models.py 95.91% <100.00%> (ø)
opencontractserver/documents/signals.py 100.00% <100.00%> (ø)
opencontractserver/tasks/analyzer_tasks.py 100.00% <100.00%> (ø)
... and 12 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@JSv4 JSv4 merged commit d349fb3 into main Feb 18, 2023
@JSv4 JSv4 deleted the JSv4/add-gui-elements-to-create-metadata-labels branch February 18, 2023 06:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant