You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When globally aligning sequences that deviate much, combinatory explosion can quickly leed to excessive runtime memory consumption in the current implementation. And it is not always easy to detect those cases by score heuristics in a prior backtrace=False pass.
I believe these should be added:
a package-exposed variable with a default limit (perhaps relative to the sequences' length)
an optional parameter with an override limit to be able to control the quality-performance trade-off.
(The limit could be based on stack depth or number of alternatives, for example.)
Example: I am trying to align OCRed images of German Fraktur script with their corresponding ground truth text. Sometimes the OCR fails miserably like so: Mitreden andrer 274. Günſtiger Eindruck der Staatsrathsſitzungen 274. (original line) *0obe-ondrer '? '-änſiger Eindrue der Torerotheflgg,, (OCR result)
In this case, using StrictGlobalSequenceAligner tries to take more than 20 GB RSS (at which point I quit).
The text was updated successfully, but these errors were encountered:
When globally aligning sequences that deviate much, combinatory explosion can quickly leed to excessive runtime memory consumption in the current implementation. And it is not always easy to detect those cases by score heuristics in a prior
backtrace=False
pass.I believe these should be added:
(The limit could be based on stack depth or number of alternatives, for example.)
Example: I am trying to align OCRed images of German Fraktur script with their corresponding ground truth text. Sometimes the OCR fails miserably like so:
Mitreden andrer 274. Günſtiger Eindruck der Staatsrathsſitzungen 274.
(original line)*0obe-ondrer '? '-änſiger Eindrue der Torerotheflgg,,
(OCR result)In this case, using
StrictGlobalSequenceAligner
tries to take more than 20 GB RSS (at which point I quit).The text was updated successfully, but these errors were encountered: