GitHub - yuanjing-ma/RNN-NLP: sequence data analysis (RNN/NLP)

Deep learning projects

Project: toxic comment classification challenge Kaggle

toxic_comment_classification_CNN_Keras.ipynb

Approach: CNN with 3 convolutional layers, 3 pooling layers and 1 dense layer
Result: (10 epochs) training data: loss = 0.0503, accuracy = 0.9820; validation set: loss = 0.0898, accuracy = 0.9718

toxic_comment_classification_LSTM_Keras.ipynb

Approach: 1 LSTM layer, 1 pooling layer, 1 dense layer
Result: (2 epochs) training data: loss = 0.0573, accuracy = 0.9800; validation set: loss = 0.0576, accuracy = 0.9800

toxic_comment_classification_BidirectionalLSTM_Keras.ipynb

Approach: 1 Bidirectional-LSTM layer, 1 pooling layer, 1 dense layer
Result: (2 epochs) training data: loss = 0.0550, accuracy = 0.9805; validation set: loss = 0.0551, accuracy = 0.9803

Project: MNIST (digit recognizer)

MNSIT_Bidirectional-LSTM_Keras.ipynb

Approach: 1 permute-dimension layer, 1 Bidirectional-LSTM layer, 1 pooling layer, 1 dense layer
Result: (5 epochs) training data: loss = 0.1219, accuracy = 0.9646; validation set: loss = 0.1239, accuracy = 0.9627

Project: Neural machine translation (English-to-Spanish)

neural-machine-translation_seq2seq_Keras.ipynb

Approach: regular sequence-to-sequence modeling with encoder-decoder architecture
- LSTM for encoder and decoder
- teacher forcing

neural-machine-translation_seq2seq_attention_Keras.ipynb

Approach: sequence-to-sequence modeling with encoder-decoder architecture and attention
- Bidirectional-LSTM for encoder
- LSTM for decoder
- Attention with 2 dense layers
- teacher forcing
Conclusion: Compared to regular seq-to-seq model, adding "attention" will increase the translation accuracy
- Utilize all encoder's hidden states instead the last one
- For each output word, "attention" tells the model which part of input sequence to be paid attention to

Project: bABI automatic text understanding and reasoning

bABI_memory_network_Keras.ipynb

Approach: memory network for two supporting facts
Conclusion: With 10000 training samples, and 1000 testing samples, the model ran very fast for 30 epochs with high accuracy. Training data: loss = 0.1848, accuracy = 0.9438; testing data: loss = 0.3510, accuracy = 0.8900

Project: word embedding

word2vec_tensorflow.py
glove_tensorflow.py

others

pos_seq2seq_prediction_tf.py

sequence to sequence prediction using GRU: sentence to POS tag
tensorflow has special requirement on input dimensions, need to do a lot of transformation in the process

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
GRU_class.ipynb		GRU_class.ipynb
LSTM_class.ipynb		LSTM_class.ipynb
MNIST_Bidirectional-LSTM_Keras.ipynb		MNIST_Bidirectional-LSTM_Keras.ipynb
README.md		README.md
RRNN_poetry_generating.ipynb		RRNN_poetry_generating.ipynb
artical_spinner.ipynb		artical_spinner.ipynb
bABI_memory_network_Keras.ipynb		bABI_memory_network_Keras.ipynb
bow_classifier.ipynb		bow_classifier.ipynb
glove_tensorflow.py		glove_tensorflow.py
lsa.ipynb		lsa.ipynb
ner_tf.ipynb		ner_tf.ipynb
neural-machine-translation_seq2seq-attention_Keras.ipynb		neural-machine-translation_seq2seq-attention_Keras.ipynb
neural-machine-translation_seq2seq_Keras.ipynb		neural-machine-translation_seq2seq_Keras.ipynb
pos_ner_keras.ipynb		pos_ner_keras.ipynb
pos_seq2seq_prediction_tf.py		pos_seq2seq_prediction_tf.py
sentiment.ipynb		sentiment.ipynb
sentiment_RansomForest_cv.ipynb		sentiment_RansomForest_cv.ipynb
toxic_comment_classification_BidirectionalLSTM_Keras.ipynb		toxic_comment_classification_BidirectionalLSTM_Keras.ipynb
toxic_comment_classification_CNN_Keras.ipynb		toxic_comment_classification_CNN_Keras.ipynb
toxic_comment_classification_LSTM_Keras.ipynb		toxic_comment_classification_LSTM_Keras.ipynb
word2vec_tensorflow.py		word2vec_tensorflow.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep learning projects

Project: toxic comment classification challenge Kaggle

Project: MNIST (digit recognizer)

Project: Neural machine translation (English-to-Spanish)

Project: bABI automatic text understanding and reasoning

Project: word embedding

others

About

Releases

Packages

Languages

yuanjing-ma/RNN-NLP

Folders and files

Latest commit

History

Repository files navigation

Deep learning projects

Project: toxic comment classification challenge Kaggle

Project: MNIST (digit recognizer)

Project: Neural machine translation (English-to-Spanish)

Project: bABI automatic text understanding and reasoning

Project: word embedding

others

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages