Skip to content

abhilash1910/NLP-Workshop-ML-India

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

NLP-Workshop-ML-India 🚀

This repository contains the codes and the notebooks for NLP Workshop which was organized by ML India on October 10-11.

Contents 📙

The first notebook- Notebook 1 contains the contents for Day-1 of the session. Quora Binary Classification was chosen as the main topic.

The contents include:

  1. Statistical Analysis or Words
  2. Word Based Frequency /Gram analysis
  3. Vectorization
  4. Statistical Models
  5. Dimensionality Reduction
  6. Embeddings
  7. Neural Network architectures
  8. LSTM/CNN based models

The second notebook- Notebook 2 contains the contents for Day-2 of the session. This mainly relies on Transformer models.

The contents include:

  1. Encoder Decoder Architecture
  2. Disadvantages of Encoder Decoders
  3. Transformer Architectures
  4. Attention Mechanism
  5. Bahdanau,Luong Attention
  6. Self and Multi Head Attention
  7. Designing a Keras Transformer
  8. Finetuning Bert
  9. Finetuning and Training DistilBert,Roberta,XLM-Roberta
  10. Finetuning GPT-2,BART and Transformer-XL
  11. Evaluating via TPU Clusters

Guidelines

This code has been released under Apache License. The resources for the notebooks is present inside Kaggle,particularly embedding files. These can be used locally by either downloading them from kaggle manually or can be used in kaggle notebooks by using the "Add Data" tab in kaggle notebooks.