Skip to content

Deep learning & NLP based project, using word2vec model for creating word embeddings and using it to feed to LSTM neural network to predict the essay score awarded to a student

Notifications You must be signed in to change notification settings

taozeze/Automated_Essay_Scorer

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Automated_Essay_Scorer

Deep learning & NLP based project, using word2vec model for creating word embeddings and using it to feed to LSTM neural network to predict the essay score awarded to a student.

Pre-requisites

  • Pandas
  • Numpy
  • Natural language toolkit (NLTK)
  • Gensim 3.5
  • Scikit-learn
  • Keras 2.2.2
  • Tensorflow 1.9

Models used in training

  • Linear Regressor
  • Gradient Boosting Regressor
  • Support Vector Regressor
  • Long Short Term Memory(LSTM)

Evaluation metric

  • Mean squared error
  • Variance
  • Cohen's Kappa score

Approach

Here, Word2vec model is used to generate word embeddings on which the whole model relies to predict the scores. Before generating word vectors, essays given in the essay sets are cleaned(removing stopwords, punctuations, special symbols etc.) and then word vectors are created so as to feed it to the word2vec model for generating word embeddings.After that, model is trained on various regression algorithms and each is avaluated on the basis of certain evaluation metric.

Installation

  1. Install all the packages which are mentioned above using the following command:
pip install keras
pip install numpy
pip install pandas
pip install sklearn
pip install gensim
pip install nltk
  1. Run the essay_scorer.py file in jupyter notebook.
  2. Have a cup of coffee and wait for the models to generate their respective mean squared error values, variances, cohen kappa scores.

Conclusion

Out of all the models that were trained and tested, LSTMs outperform all other models and hence the same is used to generate scores for the final dataset.

References

About

Deep learning & NLP based project, using word2vec model for creating word embeddings and using it to feed to LSTM neural network to predict the essay score awarded to a student

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 85.0%
  • Python 15.0%