Skip to content

Hungarian NLP pipeline for social media text analysis (TrendMiner project)

License

Notifications You must be signed in to change notification settings

mmihaltz/trendminer-hunlp

Repository files navigation

Trendminer Hungarian Processing Pipeline (trendminer-hunlp): a suite of scripts that perform Hungarian NLP processing steps (tokenization, pos-tagging, morphological analysis, lemmatization) by extending existing tools (huntoken, hunmorph, hunpos) to be able to deal with some of the challenges presented by the special language of social media messages, which differs from the domain of standard language (generally newswire) texts that were used to develop and train the exising tools.

Trendminer Project:

Author: Márton Miháltz mmihaltz@gmail.com

About

Hungarian NLP pipeline for social media text analysis (TrendMiner project)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages