My Pytorch implementation of BERT. Some resources that helped me: Official BERT Paper Jay Alammar's Blog Post Don't forget to give it a ⭐