DLite

Code used to train DLite, a large language model from AI Squared

DLite primarily demonstrates that even though model size does have an impact on performance, smaller models can perform relatively well on a wide range of tasks, including acting as a chatbot. We have utilized the Alpaca dataset to fine-tune the 124 million parameter GPT-2 model from OpenAI to create DLite.

DLite is not a state-of-the-art language model, and it is not expected to compete with more modern language models trained on more comprehensive datasets. Instead, the main contribution of DLite is that such a small model which is (at the time of this work) nearly four years old can be effectively trained to act as a chat-based agent.

Please note that the GPT-2 model is licensed under the MIT license, and the Alpaca dataset is licensed under the Creative Commons Noncommercial (CC BY-NC 4.0) license.

DLite is intended only for research purposes and is not licensed for commercial use.

Training DLite

There are two ways to train DLite using this repository. The first is to simply run the train_dlite.ipynb notebook in the top level of this repository. Additionally, if you would like to run the training script from the command line, the train.py file in the train directory of this repository is complete with a command line interface.

Limitations

DLite is an experimental technology and is not designed for use in any environment other than for research purposes. Furthermore, the model can sometimes exhibit undesired behaviors. Some of these behaviors include, but are not limited to: factual inaccuracies, biases, offensive responses, toxicity, and hallucinations. Just as with any other LLM, we advise users of this technology to exercise good judgment when applying this technology.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
train		train
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
requirements.txt		requirements.txt
train_dlite.ipynb		train_dlite.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train

train

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

init.py

init.py

requirements.txt

requirements.txt

train_dlite.ipynb

train_dlite.ipynb

Repository files navigation

DLite

Training DLite

Limitations

About

Releases

Packages

Languages

License

AISquaredInc/DLiteV1

Folders and files

Latest commit

History

Repository files navigation

DLite

Training DLite

Limitations

About

Resources

License

Stars

Watchers

Forks

Languages