Transformers Architecture with TensorFlow

This project was completed as a part of the Honors portion of the Sequence Models Course on Coursera.

Credit to DeepLearning.AI and the Coursera platform for providing the course materials and guidance.

Objective

In this notebook, my primary objective is to explore and understand the Transformer architecture, a sophisticated neural network designed to take advantage of parallel processing, leading to significant improvements in the training process. Throughout this assignment, I will gain valuable insights into various aspects of the Transformer architecture. Firstly, I will learn how to create positional encodings, which play a crucial role in capturing sequential relationships within the data. This will allow the model to better understand the order and context of input sequences.

Next, I will delve into the concept of scaled dot-product self-attention, which involves calculating attention weights between different words in a sentence. This mechanism helps the model focus on relevant words and learn meaningful representations. Additionally, I will implement masked multi-head attention, a fundamental building block of the Transformer. This allows the model to selectively attend to different parts of the input, enabling it to handle variable-length sequences efficiently. Finally, I will build and train a complete Transformer model, harnessing its parallel processing capabilities to expedite the training process and achieve improved performance.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSE		LICENSE
README.md		README.md
decoder.png		decoder.png
decoder_layer.png		decoder_layer.png
encoder.png		encoder.png
encoder_layer.png		encoder_layer.png
ner.json		ner.json
public_tests.py		public_tests.py
self-attention.png		self-attention.png
transformer.png		transformer.png
transformers-architecture-with-tensorflow.ipynb		transformers-architecture-with-tensorflow.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformers Architecture with TensorFlow

Objective

Results

About

Releases

Packages

Languages

License

jihadakbr/transformers-architecture-with-tensorflow

Folders and files

Latest commit

History

Repository files navigation

Transformers Architecture with TensorFlow

Objective

Results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages