NLP_learning

一些传统NLP技术学习代码仓库

拼写纠错(斯坦福NLP公开课)

使用Markov Chain，结合贝叶斯概率，运用统计的方法实现英语单词的拼写纠错，该部分代码参考自github作者Rshcaroline的Spell Correction项目，我在该代码上做了一些重构和修改。

基于机器学习的文本分类(Kaggle竞赛Sentiment Analysis on Movie Reviews)

使用N-Gram和Bag-of-words实现对自然语言文本的特征提取，使用numpy实现线性回归模型，并实验验证了使用shuffle、batch、mini-batch等权重更新方法的区别。

词向量的计算(斯坦福的stanfordSentimentTreebank数据集)

使用pytorch实现Skip-Gram模型，并借此训练word2vec词向量。

基于深度学习的文本分类(Kaggle竞赛Sentiment Analysis on Movie Reviews)

使用预训练的GloVe词向量提取文本特征，使用pytorch实现了TextCNN和TextRNN两种模型。TextCNN部分参考了论文Convolutional Neural Networks for Sentence Classification，TextRNN模型使用了RNN、LSTM、Bi-LSTM实现。

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
CNN_RNN_text_classification_with_GloVe		CNN_RNN_text_classification_with_GloVe
ML_text_classification_with_N_Gram		ML_text_classification_with_N_Gram
Word2Vec		Word2Vec
spell_correction		spell_correction
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP_learning

About

Releases

Packages

Languages

xiningin/NLP_learning

Folders and files

Latest commit

History

Repository files navigation

NLP_learning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages