Skip to content

Given an IMDB dataset with reviews, we need to estimate sentiment

Notifications You must be signed in to change notification settings

NikitaGordia/SentimentAnalysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sentiment analysis

Problem: Given an IMDB dataset with reviews, we need to estimate sentiment

Project:

root
└─── data
│   │   └───dev
│   │   |   feed_count.npz
│   │   |   feed_tfidf_lite.npz
│   │   submission.csv
│   │   test.csv
│   │   train.csv
└─── notebooks
    │   data.ipynb
    │   model.ipynb
    │   model_ensamble.ipynb

Notebooks

  1. data.ipynb - notebook for data cleaning and efficient preprocessing.
  2. model.ipynb - notebook with model selection.
  3. model_ensemble.ipynb - notebook for estimation of performance improvements via usage of ensemble techniques.

Data

  1. feed_count.npz - preprocessed docs and vectorized with CountVectorizer
  2. feed_tfidf_lite.npz - preprocessed docs and vectorized with TfIdf.

About

Given an IMDB dataset with reviews, we need to estimate sentiment

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published