Text Generation using Transformer :

This repository contains PyTorch code for training a character-level language model using the Transformer architecture. The trained model can generate text character-by-character in a similar style to the training data.

The Transformer model is implemented in several sub-modules.
- The Head class implements one head of self-attention.
- The MultiHeadAttention scaled dot product attention allows a network to attend over a sequence. sub-queries, sub-keys, and sub-values, which we pass through the scaled dot product attention independently. Afterward, we concatenate the heads and combine them with a final weight matrix.
- The FeedFoward class implements a simple linear layer followed by a non-linearity.
- The Block class implements the main Transformer block, which consists of communication (multi-head self-attention) followed by computation (feed-forward network).
- The TransformerModel class got an encoder-decoder structure where the encoder takes an input text and generates an attention-based representation.
The estimate_loss function computes the loss on a batch of data for both the train and validation set. This function is used to estimate the performance of the model while training.
Finally, the code trains the model for a given number of iterations, prints the training loss every eval_interval steps, and computes the validation loss every eval_iters steps. The trained model is then saved to disk.

Requirements:

This code was developed using Python 3.7 and PyTorch 1.7.1.

Usage:

The code can be run from the command line as follows: python demo.py This will train a model on the input text file specified in train.py, which is set to input.txt by default. The trained model parameters will be saved to a file named model.pt and generate text using the trained model

Acknowledgements

This code is based on the karpathy/char-rnn repository, which implements a character-level language model using a recurrent neural network (RNN). The Transformer architecture used in this repository was introduced in the paper "Attention Is All You Need" by Vaswani et al.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
GPT.py		GPT.py
README.md		README.md
Step_1.py		Step_1.py
demo.py		demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Generation using Transformer :

Requirements:

Usage:

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

DataByteNITT/DB-GPT

Folders and files

Latest commit

History

Repository files navigation

Text Generation using Transformer :

Requirements:

Usage:

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages