Skip to content

module__SimpleProjector2

Robert Bossy edited this page Jul 27, 2017 · 1 revision

#org.bibliome.alvisnlp.modules.trie.SimpleProjector2

Synopsis

Deprecated alias for SimpleProjector.

This module is obsolete, superceded by org.bibliome.alvisnlp.modules.trie.TabularProjector

Description

Parameters

Optional

Type: SourceStream

Source of the dictionary.

Optional

Type: String

Name of the layer that contains the match annotations.

Optional

Type: String[]]

Target features in match annotations. The values are the columns in the matched entry line.

Optional

Type: Mapping

Constant features to add to each annotation created by this module

Optional

Type: OutputFile

If set, org.bibliome.alvisnlp.modules.trie.SimpleProjector2 writes the compiled dictionary to the specified file.

Optional

Type: InputFile

If set, read the compiled dictionary from the specified files. Compiled dictionaries are generally faster for large dictionaries.

Default value: false

Type: Boolean

Either the match allows case substitution on all characters in words that are all upper case.

Default value: false

Type: Boolean

Either the match allows arbitrary suppression of whitespace characters in the subject. For instance, the contents aminoacid matches the entry amino acid.

Default value: false

Type: Boolean

Either the match allows case substitutions on all characters.

Default value: true

Type: Expression

Only process document that satisfy this filter.

Default value: false

Type: Boolean

Either the match allows dicacritics substitutions on all characters. For instance the contents acide amine matches the entry acide aminé.

Default value: false

Type: Boolean

Either to treat dash characters (-) as whitespace characters if allowJoined is true. For instance, the contents aminoacid matches the entry amino-acid.

Default value: 0

Type: Integer[]]

Specifies the key column index (starting at 0).

Default value: false

Type: Boolean

Either the match allows case substitution on the first character of the entry key.

Default value: all

Type: MultipleEntryBehaviour

Specifies the behavious of org.bibliome.alvisnlp.modules.trie.SimpleProjector2 if dictFile contains several entries with the same key.

Default value: true

Type: Expression

Process only sections that satisfy this filter.

Default value:

Type: Character

Specifies the character that separates columns in dictFile.

Default value: false

Type: Boolean

In dictFile, skip lines that contain only whitespace characters.

Default value: false

Type: Boolean

Either the match allows insertion of consecutive whitespace characters in the subject. For instance, the contents amino acid matches the entry amino acid.

Default value: false

Type: Boolean

In dictFile, skip empty lines.

Default value: false

Type: Boolean

Either the match allows arbitrary insertion of whitespace characters in the subject. For instance, the contents amino acid matches the entry aminoacid.

Default value: true

Type: Boolean

Either to check that every line in dictFile has the same number of columns as the number of features specified in entryFeatureNames.

Default value: WORD

Type: Subject

Specifies the contents to match.

Default value: false

Type: Boolean

Either to trim column values in dictFile from leading and trailing whitespace characters.

Default value: false

Type: Boolean

Either the match allows case substitution on the first character of words.

Clone this wiki locally