Skip to content

Latest commit

 

History

History
53 lines (37 loc) · 976 Bytes

features.rst

File metadata and controls

53 lines (37 loc) · 976 Bytes

Feature Extraction

Feature Extraction Utilitites

webstruct.feature_extraction

HtmlToken

HtmlTokenizer

HtmlFeatureExtractor

Predefined Feature Functions

webstruct.features

webstruct.features.token_features

webstruct.features.data_features

webstruct.features.block_features

webstruct.features.global_features

Gazetteer Support

webstruct.gazetteers.features

webstruct.gazetteers.geonames