Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TextGrid: multi-line parsing, escaped quote parsing #55

Merged
merged 3 commits into from
Jan 23, 2024

Conversation

myrix
Copy link
Contributor

@myrix myrix commented Jan 22, 2024

These are fixes for some problem encountered while parsing some of our TextGrids:

  1. Sometimes broken line-by-line decoding of n-byte encodings like UTF-16.
  2. Inability to parse texts with newlines in them.
  3. Inner text quotes escaped by doubling remaining doubled.

PR a) reworks line-by-line decoding of text format TextGrids to whole file decoding, b) enables parsing of multiline texts containing arbitrary number of newlines by repeatedly looking at more and more lines until the whole text is completed, and c) properly turns double-escaped quotes inside each read text back into single quotes after the text is read.

@dopefishh
Copy link
Owner

Thank you very much. This may fix #52 also.

@dopefishh dopefishh merged commit ebb0f90 into dopefishh:master Jan 23, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants