Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TLDR-538 tesseract postprocessing #388

Merged
merged 4 commits into from
Jan 31, 2024
Merged

TLDR-538 tesseract postprocessing #388

merged 4 commits into from
Jan 31, 2024

Conversation

oksidgy
Copy link
Collaborator

@oksidgy oksidgy commented Dec 20, 2023

TLDR-538 Research of ocr correction (post processing) influence on result

@oksidgy oksidgy self-assigned this Dec 20, 2023
@oksidgy oksidgy changed the title Tldr 538 tesseract TLDR-538 tesseract postprocessing Dec 20, 2023
@oksidgy oksidgy changed the title TLDR-538 tesseract postprocessing TLDR-551 tesseract postprocessing Dec 20, 2023
Base automatically changed from develop to master December 25, 2023 13:34
@oksidgy oksidgy changed the base branch from master to develop January 9, 2024 14:20
requirements.txt Outdated Show resolved Hide resolved
@NastyBoget NastyBoget changed the title TLDR-551 tesseract postprocessing TLDR-538 tesseract postprocessing Jan 31, 2024
@NastyBoget NastyBoget merged commit 6a3cfed into develop Jan 31, 2024
3 checks passed
@NastyBoget NastyBoget deleted the TLDR-538-tesseract branch January 31, 2024 10:22
NastyBoget added a commit that referenced this pull request Mar 5, 2024
* TLDR-549 delete custom loggers (#393)

* changed dedoc-utils version (#394)

* remove PdfTxtlayerReader from TxtLayerDetector (#395)

* Make train dataset API separated (#396)

* TLDR-584 words boldness for images (#397)

* TLDR-584 text boldness for words in images

* TLDR-582 fix pdf_txtlayer_reader bboxes for lines (labeling_mode="true") (#399)

* TLDR-585 added TEDS table benchmark (#398)

* TLDR-538 tesseract postprocessing (#388)

* TLDR-590 fix code style in scripts directory (#400)

* Add job to meet requirements of the develop branch (#401)

* TLDR-602 some fixes of web form (#402)

* Translate labeling web pages into English (#403)

* TLDR-556 tutorial how to add a new structure type (#405)

* TLRD-182 eml reader bug fix (#406)

* new version (#407)

---------

Co-authored-by: Nikita Shevtsov <61932814+Travvy88@users.noreply.github.com>
Co-authored-by: Alexander Golodkov <golodkov@ispras.ru>
Co-authored-by: Andrew Perminov <perminov@ispras.ru>
Co-authored-by: Oksana Belyaeva <belyaeva@ispras.ru>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants