The first file uploaded was for predicting tags on StackOverflow using Linear Models
The second file uploaded was NER(Named Entity Recognition) on twitter using BiDirectional LSTM. It is not the best, achieving around 90% precision on train data set and 50% on test data set, with similar on validation. However implementing a hybrid model of LSTM, CNN and CRF(Conditional Random Field) may prove beneficial and I will try it soon (following the paper https://arxiv.org/abs/1603.01354)
The third file uploaded was to find duplicate questions on StackOverflow. It utilised the StarSpace embeddings (by Facebook) for this task.
-
Notifications
You must be signed in to change notification settings - Fork 0
EeshaanJain/natural-language-processing-hse
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published