An introduction of the development from encoding(One hot, tf-idf......) to embedding(Word2vec,Glove,Fasttext......) and language model(Elmo, GPT2, BERT......)
python3.6 git clone https://github.com/aron3312/embedding2lm.git pip install -r requirements.txt Open FromEmbeddingToLM.pptx to learn with ppt
word_encoding.ipynb
An example of title recommendation, using single news title to find the most related one from other news. Including three method:
-
one hot encoding
- DTM
- TDM
-
TF-IDF encoding
-
sequence level related encoding
word_embedding.ipynb
An example of word embedding, using word2vec, GloVe, Fasttext Including three method:
- Word2Vec
- GloVe
- FastText