Skip to content

Commit

Permalink
Updated documentation
Browse files Browse the repository at this point in the history
  • Loading branch information
tsproisl committed Jun 17, 2020
1 parent e1ea43f commit 01dc552
Show file tree
Hide file tree
Showing 2 changed files with 8 additions and 2 deletions.
5 changes: 4 additions & 1 deletion doc/build/markdown/somajo.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,7 +32,10 @@ Tokenization and sentence splitting.


* **xml_sentences** (*str, (default=None)*) – Delimit sentences by XML tags of this name
(`xml_sentences='s'` → <s>…</s>).
(`xml_sentences='s'` → <s>…</s>). When used with XML input,
this might lead to minor changes to the original tags to
guarantee well-formed output (tags might need to be closed and
re-opened at sentence boundaries).


#### tokenize_text(paragraphs, \*, parallel=1)
Expand Down
5 changes: 4 additions & 1 deletion somajo/somajo.py
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,10 @@ class SoMaJo:
Perform sentence splitting in addition to tokenization.
xml_sentences : str, (default=None)
Delimit sentences by XML tags of this name
(``xml_sentences='s'`` → <s>…</s>).
(``xml_sentences='s'`` → <s>…</s>). When used with XML input,
this might lead to minor changes to the original tags to
guarantee well-formed output (tags might need to be closed and
re-opened at sentence boundaries).
"""

Expand Down

0 comments on commit 01dc552

Please sign in to comment.