Skip to content

PTB Tokenizer deletes last token #108

Closed
@garfieldnate

Description

@garfieldnate

(apologies for accidentally submitting before finishing typing)

The MaxentTagger is deleting the last token in the following input:

List<List<HasWord>> sentences = MaxentTagger
            .tokenizeText(new StringReader("1. Narakattaramukku"));

The resulting sentences variable contains only [[1, .]].

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions