Skip to content

Commit

Permalink
Major structural changes (#16)
Browse files Browse the repository at this point in the history
The major change here is that I updated RTFDE to only work with bytes and leave all encoding up to the downstream library. Please update your code as necessary! 

* Updated to only work with bytes.
* Added far greater unicode support.
* Fixed various whitespace issues.
* Added proper htmlrtf support.
* Added support for extracting (but not parsing binary data). 
* Adding support for surrogates which use raw unicode instead of 16bit signed encoding.
* Added better handling of parsing unicode HH replacement chars.
  • Loading branch information
seamustuohy committed Jun 18, 2023
1 parent 06ce9b3 commit 1b29704
Show file tree
Hide file tree
Showing 52 changed files with 10,877 additions and 563 deletions.
5 changes: 5 additions & 0 deletions .gitattributes
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
tests/test_data/html/multiple-encodings.rtf text eol=crlf
tests/test_data/rtf_parsing/surrogate_pairs.rtf text eol=crlf
tests/test_data/rtf_parsing/encapsulated_example.html text eol=crlf
tests/test_data/rtf_parsing/surrogates.rtf text eol=crlf
tests/test_data/rtf_parsing/small_template.rtf text eol=crlf
5 changes: 5 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -129,3 +129,8 @@ dmypy.json

# Pyre type checker
.pyre/


# Testing
tests/test_data/personal_rtf_test_files/*.rtf
tests/test_data/personal_rtf_test_output_files/*.html

0 comments on commit 1b29704

Please sign in to comment.