Skip to content

Using LDA to uncover proto-registers = pregisters (SFB1412/A04 Humboldt-Universität)

License

Notifications You must be signed in to change notification settings

rsling/pregisters

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Pregisters

Using LDA to uncover proto-registers = pregisters (SFB1412/A04 Humboldt-Universität)

In this project, we use LDA to uncover proto-registers (pregisters) in large corpora using lexical and gramatical surface features. This is a much more plausible approach than Douglas Biber's MDA, because it allows for a probabilistic many-to-many mapping between documetns and pregisters as well as between pregisters and features.

The project builds on previous work from the COW initiative, where we developed CoREX, a surface feature extractor, which in turn is based on COWTek16 – the COW annotation pipeline – for German.

Investigators (alphabetically):

Felix Bildhauer

Elizabeth Pankratz

Roland Schäfer

About

Using LDA to uncover proto-registers = pregisters (SFB1412/A04 Humboldt-Universität)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages