Skip to content

Which one of five German authors can text be attributed to?

Notifications You must be signed in to change notification settings

taylorhawks/deutsch-nlp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 

Repository files navigation

deutsch-nlp

Overview

I took five german authors from the Kaggle German Literature dataset. Using a Multinomial Naive Bayes classifier with TF-IDF vectorization, I built a pipeline that takes in German text and produces a prediction.

Resources

To Do

  • add translation API to pipeline
  • improve recall for Kafka texts