-
Notifications
You must be signed in to change notification settings - Fork 0
dsd-sztaki-hu/huSocC
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
HuSocC is a multilabel classifier for Hungarian social science interviews.
The labels will be assigned from taxonomy.tsv.
The method uses textacy with HuSpaCy and a mapping of keywords to topics (tax-kw.tsv).
To use it directly on interview text, run
python husocc.py sample.txt
To use it with Annif, one has to install husocc-backend.py as an Annif backend, and create a new configuration
in projects.cfg like this:
[husocc]
name=HuSocC
language=hu
backend=husocc
analyzer=snowball(hungarian)
vocab=husocc-vocab
Then one can load the vocabulary and then use the other annif commands (suggest, eval, etc.):
annif load-vocab --language=hu husocc-vocab husocc-vocab.tsv
About
A multilabel classifier for Hungarian social science interviews.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published