[Feature] Adding Lambda LLMs #53
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Overview
This PR introduces a new
LambdaLLMimplementation that allows routing LLM requests through AWS Lambda functions. It also enhances the AWS utilities with async support and improves error handling with a robust retry mechanism.Key Changes
LambdaLLMimplementation for routing LLM requests through AWS Lambda functionsBotoFactorywith async session support viaaioboto3glomandaioboto3dspyto version 2.6.11Implementation Details
LambdaLLMuses both synchronous and asynchronous AWS clients for Lambda invocationTesting
The PR includes updates to the local test script to demonstrate both Bedrock and Lambda inference options.