tad

Text as data:

    - cleaning;
    - pre-processing;
    - post processing:
    - Topic Modelling: LDA, seeded LDA
    - Word2Vec and other static embedding
    - RNN, LSTM, and Seq2Seq
    - Attention and Transformers

Python Version: 3.7.16

Scikit-learn versions that are supported: 0.20 to 0.24 and 1.0, 1.0.1, and 1.0.2

To install the packages:

pip install -r requirements.txt

Ad-hoc materials:

Along with the lecture slides, we can also refer to the resources below:

Speech and Language Processing:
- Book: Speech and Language Processing by Dan Jurafsky and James H. Martin
- Book: Machine Learning for Text by Charu C. Aggarwal
Text Processing:
- Blog: Jaccard vs Cosine Distance
- Blog: Understanding the Levenshtein Distance Equation
- Blog: Stemming vs Lemmatization
Fundamentals:
- Tf-IDf:
  - Blog: TF-IDF
- Zipf's Law:
  - Blog: Zipf's Law
  - YT: Zipf's Law
- Heaps' Law:
  - YT: Heaps' law
Clustering:
- SVD:
  - Basics, YT: Singular Value Decomposition (the SVD)
  - High Level Overview, YT: Singular Value Decomposition (SVD): Overview
  - Mathematical Overview: YT: Singular Value Decomposition (SVD): Mathematical Overview
  - Rank R Approximation, YT: Singular Value Decomposition (SVD): Matrix Approximation
- LSA:
  - YT: LSA
Topic Modelling:
- Data Pre-processing for Topic Modelling:
  - Blog: NLP Preprocessing and Latent Dirichlet Allocation (LDA) Topic Modeling with Gensim
- LDA:
Word2Vec:
Doc2Vec:
RNN and LSTM:
Attention and Transformers:

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
Week1		Week1
Week10		Week10
Week3		Week3
Week4		Week4
Week5		Week5
Week6		Week6
Week7		Week7
Week8		Week8
Week9		Week9
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tad

About

Releases

Packages

Contributors 2

Languages

alcatraz47/tad

Folders and files

Latest commit

History

Repository files navigation

tad

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages