GitHub - runtime-error786/text-vectorization: This repository demonstrates various text vectorization techniques including Bag of Words (BoW), TF-IDF, N-grams, and Word2Vec (CBOW,SKIPGRAM) using nltk,Gensim and Scikit-Learn. The steps outlined here show how to convert textual data into numerical vectors, which are essential for machine learning models.

Text Vectorization Techniques using nltk,Gensim and Scikit-Learn

This repository demonstrates various text vectorization techniques including Bag of Words (BoW), TF-IDF, N-grams, and Word2Vec (CBOW) using nltk,Gensim and Scikit-Learn. The steps outlined here show how to convert textual data into numerical vectors, which are essential for machine learning models.Word2Vec is a popular word embedding technique that uses either Continuous Bag of Words (CBOW) or Skip-gram model to learn vector representations of words based on their context.The CBOW model predicts the target word based on context words, while the Skip-gram model does the reverse by using a word to predict its surrounding context.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
Fake_news_classification_Bidirectional_LSTM.ipynb		Fake_news_classification_Bidirectional_LSTM.ipynb
Fake_news_classification_LSTM_.ipynb		Fake_news_classification_LSTM_.ipynb
README.md		README.md
emotion_detection_Bow.ipynb		emotion_detection_Bow.ipynb
emotion_detection_Ngrams.ipynb		emotion_detection_Ngrams.ipynb
emotion_detection_TF-IDF.ipynb		emotion_detection_TF-IDF.ipynb
emotion_detection_word2vec(CBOW,SKIPGRAM).ipynb		emotion_detection_word2vec(CBOW,SKIPGRAM).ipynb
emotion_detection_word2vec(pre-trained).ipynb		emotion_detection_word2vec(pre-trained).ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Vectorization Techniques using nltk,Gensim and Scikit-Learn

About

Uh oh!

Releases

Packages

Languages

runtime-error786/text-vectorization

Folders and files

Latest commit

History

Repository files navigation

Text Vectorization Techniques using nltk,Gensim and Scikit-Learn

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages