Deploying Smaller Open Source Large Language Model on AWS Lambda

Overview

Large Language Models (LLMs) are cutting-edge technology that I'm experimenting with. While managed services like OpenAI offer cost-effective LLM usage, there are scenarios where running an LLM locally becomes necessary. This may be due to handling sensitive data or needing high-quality outputs in languages other than English. Open source LLMs match the quality of major players like OpenAI but often demand significant compute resources. Deploying smaller models on platforms like AWS Lambda can offer cost-effective alternatives.

Project Goal

My goal with this project is to deploy a smaller open-source LLM, specifically Microsoft Phi-2, a 2.7 billion parameter model that rivals outputs from larger open-source models. I'll explore LLMs, and docker-based lambdas, evaluate performance, and assess costs for real-world applications.

Steps

1. Environment Setup (AWS, Docker, and Python)

Ensure the necessary tools are installed, including an AWS account, AWS CLI, Docker, and Python.

2. Set up Lambda Function Locally with Docker

Create a basic Python Lambda function handler in a lambda_function.py file.
Define dependencies in requirements.txt, starting with the AWS library (boto3).
Create a Dockerfile specifying the Docker image composition.
Set up docker-compose.yml for running and building the container.
Build and start the container locally using docker-compose up.

3. Run an LLM Inside the Container

Add llama-cpp-python to requirements.txt.
Introduce a Docker build stage for llama-cpp installation and model download.
Modify my Lambda code to run LLM inference.

4. Test Locally

Rebuild the container and test with a real prompt using curl.

5. Deploy to AWS Lambda

Execute the deployment using the provided script (deploy.sh). This involves creating or checking the ECR repository, IAM role, Docker-ECR authentication, Docker image construction, ECR image upload, IAM role ARN acquisition, Lambda function verification, configuration, and deployment.

6. Test Remotely

Use the Lambda function URL obtained during deployment to test with a prompt.

Prerequisites

Working knowledge of programming, Docker, AWS, and Python.

Notes

All files should be stored in a single project directory without subfolders.

Feel free to explore, modify, and run the provided scripts to deploy and test open-source LLM on AWS Lambda.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.devcontainer		.devcontainer
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
deploy.sh		deploy.sh
docker-compose.yml		docker-compose.yml
lambda_function.py		lambda_function.py
requirements.txt		requirements.txt
test_local.sh		test_local.sh
test_remote.sh		test_remote.sh
trust-policy.json		trust-policy.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deploying Smaller Open Source Large Language Model on AWS Lambda

Overview

Project Goal

Steps

1. Environment Setup (AWS, Docker, and Python)

2. Set up Lambda Function Locally with Docker

3. Run an LLM Inside the Container

4. Test Locally

5. Deploy to AWS Lambda

6. Test Remotely

Prerequisites

Notes

References

About

Releases

Packages

Languages

sajidkhan2067/LLMOnAWS

Folders and files

Latest commit

History

Repository files navigation

Deploying Smaller Open Source Large Language Model on AWS Lambda

Overview

Project Goal

Steps

1. Environment Setup (AWS, Docker, and Python)

2. Set up Lambda Function Locally with Docker

3. Run an LLM Inside the Container

4. Test Locally

5. Deploy to AWS Lambda

6. Test Remotely

Prerequisites

Notes

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages