Code Switching

Aim: detect the languages of the words and use this information for a POS tagging that tags the different languages in their context of a foreign language

Language detection

Training and testing done in a 10-fold cross validation setting. Results so far 95% of accuracy

Algorithm used: CRF features: output and confidence of Middle-English tree tagger, Latin tree tagger, character-bigram and character-trigram context: window of 5 on all features

POS tagging

Waiting for train data

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
POS		POS
data/historical_data		data/historical_data
language_detection		language_detection
literature		literature
scripts		scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

POS

POS

data/historical_data

data/historical_data

language_detection

language_detection

literature

literature

scripts

scripts

README.md

README.md

Repository files navigation

Code Switching

Language detection

POS tagging

About

Releases

Packages

Languages

sarschu/Code_Switching

Folders and files

Latest commit

History

Repository files navigation

Code Switching

Language detection

POS tagging

About

Resources

Stars

Watchers

Forks

Languages