LLaMA Inference API 🦙

Inference API for LLaMA

pip install llama-inference

or

pip install git+https://github.com/aniketmaurya/llama-inference-api.git@main

Note: You need to manually install and setup Lit-LLaMA to use this project.

pip install lit-llama@git+https://github.com/Lightning-AI/lit-llama.git@main

For Inference

from llama_inference import LLaMAInference
import os

WEIGHTS_PATH = os.environ["WEIGHTS"]

checkpoint_path = f"{WEIGHTS_PATH}/lit-llama/7B/state_dict.pth"
tokenizer_path = f"{WEIGHTS_PATH}/lit-llama/tokenizer.model"

model = LLaMAInference(checkpoint_path=checkpoint_path, tokenizer_path=tokenizer_path, dtype="bfloat16")

print(model("New York is located in"))

For deploying as a REST API

Create a Python file app.py and initialize the ServeLLaMA App.

# app.py
from llama_inference.serve import ServeLLaMA, Response

import lightning as L

component = ServeLLaMA(input_type=PromptRequest, output_type=Response)
app = L.LightningApp(component)

lightning run app app.py

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github		.github
assets		assets
docs		docs
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
demo.ipynb		demo.ipynb
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLaMA Inference API 🦙

For Inference

For deploying as a REST API

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

xjohnxjohn/LLaMA-Inference-API

Folders and files

Latest commit

History

Repository files navigation

LLaMA Inference API 🦙

For Inference

For deploying as a REST API

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages