-
Notifications
You must be signed in to change notification settings - Fork 0
Design Documentation
Michael Frederic edited this page Jun 10, 2024
·
12 revisions
Nodejs w/ Javascript + Express
AWS + EC2 for the API
AWS Bedrock
Smaller model for pre-parsing potential use: AWS Bedrock
Might experiment with sagemaker during testing phase.
The initial model for summarizing/parsing information
The primary model for inferring key points and conducting transactions to the API
Calculated assuming 13,500 input tokens per hour of speaking and 1,350 output tokens per hour of speaking
Model | Price per 1000 input tokens | Price per 1000 output tokens | Price for 1 hour of transcript |
---|---|---|---|
Amazon Titan Text Lite | $0.00015 | $0.0002 | $2.30 |
LLama 3 (8B) | $0.0004 | $0.0006 | $6.21 |
LLama 2 (13B) | $0.00075 | $0.001 | $11.48 |
Command-Light | $0.0003 | $0.0006 | $4.86 |
Command R | $0.0005 | $0.0015 | $8.78 |
Claude 3 Haiku | $0.00025 | $0.00125 | $5.06 |
Claude Instant | $0.0008 | $0.0024 | $14.04 |
AWS offers regex power output filtering and keyword filtering on a free tier.
AWS also offers topic filtering at a paid tier. AWS Bedrock Pricing