Try it out in ICGPT !
The LLMs of this repo run in it's back-end canisters.
A step-by-step guide to deploy your first LLM to the internet computer is provided in llama2_c/README.md.
The canisters within the Internet Computer have certain constraints. They come with memory restrictions, and there's a cap on the number of instructions one can execute per message, as discussed here.
This might lead one to question the rationale behind operating an LLM within an Internet Computer's canister.
For me, the primary incentive is the unparalleled simplicity of using the IC in comparison to conventional cloud platforms. You develop, deploy & test using a local replica of the cloud, and when everything is ready, you deploy it to the IC with just one command. Everything becomes instantly and securely accessible online. You can very easily restrict access to the endpoints in case you don't want to make it fully public yet and want to share it with a smaller group instead.
Thanks to the Internet Computer's foundational cryptographic and blockchain technologies, concerns related to IT and security vanish. It's truly remarkable.
With such user-friendliness, the IC canister runtime serves as an ideal environment for my research pursuits. It complements the type of research presented in this paper that offers a dataset designed to boost the creation, examination, and study of Language Models for areas with scarce resources or specific niches:
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Besides the ease of use and the enhanced security, running LLMs directly on-chain also facilitates a seamless integration of tokenomics, eliminating the need to juggle between a complex blend of web3 and web2 components, and I believe it will lead to a new category of Generative AI based dApps.
We use MiniConda and run the QA locally like this:
-
Create a conda environment, and install icpp-pro and other python dependencies:
conda create --name icpp_llm python=3.11 conda activate icpp_llm pip install -r requirements.txt
-
This installs icpp-pro. Next install wasi-sdk, dfx & clang++ as explained in icpp-pro Installation
-
Run the full QA via the Makefile:
make all-tests
You can also peak in .github/workflows/cicd.yml
to see how the QA is run as part of a GitHub actions workflow.
More details are provided in the README of the sub-folders, which are standalone icpp-pro projects.
For support, kindly create a GitHub Issue as outlined in the Support documentation page.