Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(deps): update dependency tesseract.js to v4 #9

Merged
merged 1 commit into from
Apr 21, 2023

Conversation

renovate[bot]
Copy link
Contributor

@renovate renovate bot commented Mar 22, 2023

Mend Renovate

This PR contains the following updates:

Package Change Age Adoption Passing Confidence
tesseract.js ^3.0.3 -> ^4.0.0 age adoption passing confidence

Release Notes

naptha/tesseract.js

v4.0.2

Compare Source

What's Changed

  • Fixed bug breaking compatibility with certain devices (#​701)

Full Changelog: naptha/tesseract.js@v4.0.1...v4.0.2

v4.0.1

Compare Source

What's Changed

  • Running recognize or detect with invalid image argument now throws error message (#​699)
  • Fixed bug with custom langdata paths (#​697)

New Contributors

Full Changelog: naptha/tesseract.js@v4.0.0...v4.0.1

v4.0.0

Compare Source

Breaking Changes

  1. createWorker is now async
    1. In most code this means worker = Tesseract.createWorker() should be replaced with worker = await Tesseract.createWorker()
    2. Calling with invalid workerPath or corePath now produces error/rejected promise (#​654)
  2. worker.load is no longer needed (createWorker now returns worker pre-loaded)
  3. getPDF function replaced by pdf recognize option (#​488)
    1. This allows PDFs to be created when using a scheduler
    2. See browser and node examples for usage

Major New Features

  1. Processed images created by Tesseract can be retrieved using imageColor, imageGrey, and imageBinary options (#​588)
    1. See image-processing.html example for usage
  2. Image rotation options rotateAuto and rotateRadians have been added, which significantly improve accuracy on certain documents
    1. See Issue #​648 example of how auto-rotation improves accuracy
    2. See image-processing.html example for usage of rotateAuto option
  3. Tesseract parameters (usually set using worker.setParameters) can now be set for single jobs using worker.recognize options (#​665)
    1. For example, a single job can be set to recognize only numbers using worker.recognize(image, {tessedit_char_whitelist: "0123456789"})
    2. As these settings are reverted after the job, this allows for using different parameters for specific jobs when working with schedulers
  4. Initialization parameters (e.g. load_system_dawg, load_number_dawg, and load_punc_dawg) can now be set (#​613)
    1. The third argument to worker.initialize now accepts either (1) an object with key/value pairs or (2) a string containing contents to write to a config file
    2. For example, both of these lines set load_number_dawg to 0:
      1. worker.initialize('eng', "0", {load_number_dawg: "0"});
      2. worker.initialize('eng', "0", "load_number_dawg 0");

Other Changes

  1. loadLanguage now resolves without error when language is loaded but writing to cache fails
    1. This allows for running in Firefox incognito mode using default settings (#​609)
  2. detect returns null values when OS detection fails rather than throwing error (#​526)
  3. Memory leak causing crashes fixed (#​678)
  4. Cache corruption should now be much less common (#​666)

New Contributors

Full Changelog: naptha/tesseract.js@v3.0.3...v4.0.0


Configuration

📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.


  • If you want to rebase/retry this PR, check this box

This PR has been generated by Mend Renovate. View repository job log here.

@renovate renovate bot force-pushed the renovate/tesseract.js-4.x branch from 68316b2 to 0b9f888 Compare April 21, 2023 02:42
@renovate renovate bot force-pushed the renovate/tesseract.js-4.x branch from 0b9f888 to 86d81a7 Compare April 21, 2023 02:43
@pdx1204 pdx1204 merged commit 7a27b4c into main Apr 21, 2023
@renovate renovate bot deleted the renovate/tesseract.js-4.x branch April 21, 2023 02:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant