Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cleanxml #1259

Merged
merged 5 commits into from Mar 20, 2022
Merged

Cleanxml #1259

merged 5 commits into from Mar 20, 2022

Conversation

AngledLuffa
Copy link
Contributor

Add cleanxml as an option to the tokenize annotator.

Will do ssplit next

@AngledLuffa AngledLuffa force-pushed the cleanxml branch 3 times, most recently from 7ca5fdc to 1742849 Compare March 16, 2022 21:49
@AngledLuffa AngledLuffa force-pushed the cleanxml branch 2 times, most recently from aa71022 to e57afc1 Compare March 16, 2022 22:30
@AngledLuffa AngledLuffa force-pushed the cleanxml branch 2 times, most recently from 55595d3 to 4011cfa Compare March 17, 2022 20:00
For the TokenizerAnnotatorITest, this uppdates the behavior of the test
to reflect that the newlines are now being consumed by the inner ssplit
@AngledLuffa AngledLuffa merged commit b0d1e46 into dev Mar 20, 2022
@AngledLuffa AngledLuffa deleted the cleanxml branch March 20, 2022 01:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant