BERT (Bidirectional Encoder Representations from Transformers) Implementation With PyTorch

Liked our work? give us a ⭐!

This repository contains minimalistic implementation of BERT (Bidirectional Encoder Representations from Transformers) that is introduced in the paper BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding using PyTorch.

YouTube Tutorial

This repository also contains a corresponding YouTube tutorial with the title Implement BERT From Scratch - PyTorch

BERT Implementation

We need two classes to implement BERT. One contains the embeddings, BertEmbedding the other contains the BERT itself, BERT.

BERT

class BERT(nn.Module):
    def __init__(self,
                 vocab_size,
                 n_segments,
                 max_len,
                 embed_dim,
                 n_layers,
                 attn_heads,
                 dropout):
        super().__init__()
        self.embedding = BERTEmbedding(vocab_size, n_segments, max_len, embed_dim, dropout)
        self.encoder_layer = nn.TransformerEncoderLayer(embed_dim, attn_heads, embed_dim*4)
        self.encoder_block = nn.TransformerEncoder(self.encoder_layer, n_layers)

    def forward(self, seq, seg):
        out = self.embedding(seq, seg)
        out = self.encoder_block(out)
        return out

BERTEmbedding

class BERTEmbedding(nn.Module):
    def __init__(self,
                 vocab_size,
                 n_segments,
                 max_len,
                 embed_dim,
                 dropout):
        super().__init__()
        self.tok_embed = nn.Embedding(vocab_size, embed_dim)
        self.seg_embed = nn.Embedding(n_segments, embed_dim)
        self.pos_embed = nn.Embedding(max_len, embed_dim)

        self.drop = nn.Dropout(dropout)
        self.pos_inp = torch.tensor([i for i in range(max_len)],)

    def forward(self, seq, seg):
        embed_val = self.tok_embed(seq) + self.seg_embed(seg) + self.pos_embed(self.pos_inp)
        embed_val = self.drop(embe

Usage

You can edit the parameters as you like to mimic your input dimensions. You can run BERT.py file directly.

Contact

You can contact me with this email address: uygarsci@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
.gitignore		.gitignore
BERT.py		BERT.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

.gitignore

.gitignore

BERT.py

BERT.py

README.md

README.md

Repository files navigation

BERT (Bidirectional Encoder Representations from Transformers) Implementation With PyTorch

YouTube Tutorial

Table of Contents

BERT Implementation

BERT

BERTEmbedding

Usage

Contact

About

Releases

Packages

Languages

uygarkurt/BERT-PyTorch

Folders and files

Latest commit

History

Repository files navigation

BERT (Bidirectional Encoder Representations from Transformers) Implementation With PyTorch

YouTube Tutorial

Table of Contents

BERT Implementation

BERT

BERTEmbedding

Usage

Contact

About

Resources

Stars

Watchers

Forks

Languages