-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix OCR errors option ? #41
Comments
@Tentacule Just bumping this one. I've run a bit of a test with the latest version of PgsToSrt and Subtitle Edit w/ Tesseract 5.3.3 Command I'm using for PgsToSrt is: Any ideas here? |
I won't add "Fix OCR errors" for now because this functionality is not included in LibSE. I have done some tests, it looks like an issue on windows, it's working fine when run on linux. I'll investigate. |
I just flicked you an email. I don't think "Fix OCR errors" will make a difference anyway as I had it disabled in SE (see first screenshot) and it still converted the PGS subs almost perfectly. Issue is something else. Thanks for looking into it. |
There was an isssue in windows Tesseract dll, I tried another one and it looks good now. Here is a new release with this change: PgsToStr-1.4.5.zip |
Can confirm that 1.4.5 fixes it. Does a much better job at conversion with no random gibberish to be seen. Command: dotnet "PgsToSrt-1.4.5\\PgsToSrt.dll" --input "file.mkv" --tracklanguage eng --tesseractdata "C:\\Program Files\\Tesseract-OCR\\tessdata" --tesseractlanguage eng
dotnet "PgsToSrt-1.4.5\\PgsToSrt.dll" --input "file.mkv" --tracklanguage eng --tesseractdata "C:\\Program Files\\Tesseract-OCR\\tessdata_best" --tesseractlanguage eng On the test I ran with the english subtitles for the movie Blade, using |
Do you plan on adding "Fix OCR errors" like subtitle edit option to resolve badly OCRd text ?
The text was updated successfully, but these errors were encountered: