What is this?

In late 2020 I began my exploration of AI. After the excellent introductory course at Columbia University and the Deep Learning lessons at MIT, it became clear to me that I want to pursue my career in this area.

I always liked and enjoyed mathematics, statistics, and other pure sciences while in college. Sadly I rarely used their concepts during my professional career as a CS Engineer. So, it was a pleasant surprise to see calculus, linear algebra, or probability distributions again during class!.

After some time exploring the applications of ML, I became very interested in NLP since other of my passions are knowledge, learning, and data/information management. That is why I decided to go back to the classroom. In this case, I choose Stanford's excellent CS224N (Natural Language Processing with Deep Learning) taught by the charismatic Professor Manning.

This repository contains my solutions to the assignments for the 2021 class. Please take these materials as a reference only. I hope it is helpful for your learning experience.

Enjoy!

A short description of each project

L01_01_Exploring_Word_Vectors: This Jupyter notebook helps to understand and explore word vectors using Python, Gensim, and pre-trained GloVe word embeddings. Read the accompanying article here.
L02-01-exploring-word2vec: Check this project if you want to understand and implement the word2vec algorithm. This project implements the skip-gram model with NumPy. Read the accompanying article here.
L03-01-dependency-parsing: This project implements the training algorithm for a dependency parser using the Adam optimizer and the Dropout regularization technique. Read the accompanying article here.
L04-01-neural-machine-translation: This is the implementation of a neural machine translator using a BILSTM with the Attention mechanism. Read the accompanying article here.
L05-01-transformer: This project implements a Transformer model with Multi-Headed Self-Attention to predict the birthplace of a person. It uses pretraining to improve performance and explores alternatives to the scaled dot-product scoring function. Read the accompanying article here.

Prerequisites to consider

You don't need to be an expert, but I'd recommend some background in the following fields:

Coding with Python and NumPy
Multivariate calculus
Linear Algebra
Probability and Statistics
Foundations of Machine Learning

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
L02-01-exploring-word2vec		L02-01-exploring-word2vec
L03-01-dependency-parsing		L03-01-dependency-parsing
L04-01-neural-machine-translation		L04-01-neural-machine-translation
L05-01-transformer		L05-01-transformer
L01_01_Exploring_Word_Vectors.ipynb		L01_01_Exploring_Word_Vectors.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

L02-01-exploring-word2vec

L02-01-exploring-word2vec

L03-01-dependency-parsing

L03-01-dependency-parsing

L04-01-neural-machine-translation

L04-01-neural-machine-translation

L05-01-transformer

L05-01-transformer

L01_01_Exploring_Word_Vectors.ipynb

L01_01_Exploring_Word_Vectors.ipynb

README.md

README.md

Repository files navigation

What is this?

A short description of each project

Prerequisites to consider

About

Languages

ig-perez/nlp-roadmap

Folders and files

Latest commit

History

Repository files navigation

What is this?

A short description of each project

Prerequisites to consider

About

Topics

Resources

Stars

Watchers

Forks

Languages