Skip to content

This is my implementation of decoder part of transformer.

License

Notifications You must be signed in to change notification settings

CptNemo0/MyFirstTransformer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Stargazers Issues MIT License LinkedIn


Logo

Transformer Decoder Implementation

My 'tiny-scale' implementation of decoder (ChatGPT)
Description/Docs

About The Project

I have spent most of last 2 summers (2022, 2023) studying artificial intelligence. For the past 2 months ([6/7]/2023) I was studying deep learning, especially very popular transformer architecture. This project is inauguration of my studies. It's simple, small scale implementation of the decoder part of the transformer.

It can be trained on .txt files. This is very primitive approach, nontheless it results in visible results. I have trained a small 100M model on old polish texts, and the output was striking. It wasn't perfectly logical, or beautiful, but:

  • it was polish
  • it sounded like archaic polish (which was the goal)
  • proof that my code works

(back to top)

Built With

I used Python, and it's machine learning library PyTorch, although it's important to notice, that i didn't use stock implmentation for everything, mainly so I could get more experience and a better understanding.

(back to top)

Installation

To tinker with my code you will first need to clone to repository

git clone https://github.com/CptNemo0/MyFirstTransformer

Than install requirements from the txt file

pip install -r requirements.txt

You're good to go!!

(back to top)

Usage

Work in progress. TBA

(back to top)

Roadmap

  • Barebones logic
  • Working small scale models
  • Designed gui
  • Large Language Model
  • Research more pretrainig techniques
  • Research finetuning (chain-of-thought, intruction finetuning, meta-learning (few shot))
  • Working gui
    • Training gui
    • Inferece gui
  • Documentation
  • Usage section of README

(back to top)

Contact

Paweł Stus - pawel.j.stus@gmail.com Project Link: https://github.com/CptNemo0/MyFirstTransformer

(back to top)

Resources

Scientific

Others

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

About

This is my implementation of decoder part of transformer.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages