Serverless MCP Enterprise Artifacts Platform

A fully serverless, cost-efficient AWS architecture for exposing enterprise artifacts (coding guidelines, documentation) via semantic search to IDE-integrated APIs.

Architecture Overview

Components

Amazon S3:
- my-enterprise-artifacts-bucket: Raw artifacts (PDFs, Markdown)
- my-enterprise-vectors-bucket: Vector embeddings and indexes (S3 Vectors)
AWS Lambda:
- Ingestion Lambda: Processes uploads, chunks text, generates embeddings
- Query Lambda: Handles API requests, performs vector search
Application Load Balancer: HTTPS API endpoint with authentication
Amazon Bedrock (Optional): Titan Embeddings fallback
Amazon Cognito (Optional): API authentication

Cost Optimization

S3 Vectors: 70-95% savings vs traditional vector DBs (~$0.023/GB/month)
Open-source embeddings (sentence-transformers) to avoid Bedrock costs
Low-memory Lambda functions (128-512MB)
ALB instead of API Gateway (~$0.0225/hour + data)
S3 Intelligent-Tiering for storage optimization

Project Structure

.
├── lambda/
│   ├── ingestion/
│   │   ├── handler.py          # Ingestion Lambda handler
│   │   ├── requirements.txt    # Dependencies
│   │   └── Dockerfile          # Container image (optional)
│   └── query/
│       ├── handler.py          # Query Lambda handler
│       ├── requirements.txt    # Dependencies
│       └── Dockerfile          # Container image (optional)
├── utils/
│   ├── s3_vectors.py           # S3 vector operations
│   ├── embedding.py            # Embedding generation
│   └── text_processing.py     # Text chunking utilities
├── infrastructure/
│   ├── app.py                  # AWS CDK main app
│   ├── stacks/
│   │   ├── storage_stack.py    # S3 buckets
│   │   ├── compute_stack.py    # Lambda functions
│   │   └── network_stack.py    # ALB, VPC
│   └── cdk.json                # CDK configuration
├── scripts/
│   ├── deploy.sh               # Deployment script
│   └── upload_artifact.py      # Sample upload script
└── requirements.txt            # Root dependencies

Prerequisites

AWS CLI configured with credentials
Python 3.9+
AWS CDK installed (npm install -g aws-cdk)
Docker (for Lambda container images)

Setup Instructions

1. Install Dependencies

pip install -r requirements.txt

2. Deploy Infrastructure

cd infrastructure
cdk bootstrap aws://ACCOUNT-ID/us-east-1
cdk deploy --all

3. Upload Sample Artifacts

python scripts/upload_artifact.py --file path/to/coding-guidelines.pdf

4. Test the API

curl -X POST https://YOUR-ALB-ENDPOINT/query \
  -H "Content-Type: application/json" \
  -d '{"query": "What are the guidelines for error handling in Java?"}'

API Endpoints

POST /query

Performs semantic search on enterprise artifacts.

Request:

{
  "query": "What are the guidelines for error handling in Java?",
  "top_k": 5
}

Response:

{
  "results": [
    {
      "document_id": "coding-guidelines.pdf",
      "chunk": "Error handling in Java should use try-catch blocks...",
      "score": 0.89,
      "metadata": {
        "page": 15,
        "s3_uri": "s3://my-enterprise-artifacts-bucket/coding-guidelines.pdf"
      }
    }
  ]
}

Monitoring

CloudWatch Logs: Lambda execution logs
CloudWatch Metrics: Lambda invocations, duration, errors
S3 Metrics: Storage, request counts

Cost Estimation (Monthly)

S3 Storage (100GB): ~$2.30
S3 API Calls (1M): ~$0.40
Lambda (1M invocations): ~$0.20 + compute
ALB: ~$16.20 (720 hours)
Total: ~$20-30/month for moderate usage

Security

IAM roles with least-privilege access
ALB with HTTPS/TLS
Optional: Amazon Cognito for authentication
S3 bucket encryption at rest
VPC isolation for Lambda functions

Development

Local Testing

# Test ingestion locally
cd lambda/ingestion
python -m pytest tests/

# Test query locally
cd lambda/query
python -m pytest tests/

Adding New Artifacts

Upload to S3 bucket - EventBridge will trigger ingestion automatically:

aws s3 cp my-document.pdf s3://my-enterprise-artifacts-bucket/

Documentation

QUICKSTART.md - Get started in 15 minutes
ARCHITECTURE.md - Detailed architecture documentation
DIAGRAMS.md - Visual flow diagrams and architecture
TROUBLESHOOTING.md - Common issues and solutions
IDE_INTEGRATION.md - IDE integration examples
PROJECT_SUMMARY.md - Complete project overview

Troubleshooting

See TROUBLESHOOTING.md for common issues.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
infrastructure		infrastructure
lambda		lambda
mcp-server		mcp-server
scripts		scripts
terraform		terraform
utils		utils
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
DEPLOYMENT_ISSUES.md		DEPLOYMENT_ISSUES.md
DEPLOYMENT_PROGRESS.md		DEPLOYMENT_PROGRESS.md
DEPLOYMENT_STATUS.md		DEPLOYMENT_STATUS.md
DEPLOYMENT_SUCCESS.md		DEPLOYMENT_SUCCESS.md
DIAGRAMS.md		DIAGRAMS.md
DIRECT_DEPLOYMENT.md		DIRECT_DEPLOYMENT.md
IDE_INTEGRATION.md		IDE_INTEGRATION.md
LICENSE		LICENSE
MCP_INTEGRATION_GUIDE.md		MCP_INTEGRATION_GUIDE.md
MCP_TEST_COMMANDS.md		MCP_TEST_COMMANDS.md
MCP_TEST_RESULTS.md		MCP_TEST_RESULTS.md
PROJECT_SUMMARY.md		PROJECT_SUMMARY.md
QUICKSTART.md		QUICKSTART.md
README.md		README.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md
coding-guidelines.md		coding-guidelines.md
ingest.py		ingest.py
ingest.zip		ingest.zip
notif.json		notif.json
query.py		query.py
query.zip		query.zip
requirements.txt		requirements.txt
s3.json		s3.json
trust.json		trust.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Serverless MCP Enterprise Artifacts Platform

Architecture Overview

Components

Cost Optimization

Project Structure

Prerequisites

Setup Instructions

1. Install Dependencies

2. Deploy Infrastructure

3. Upload Sample Artifacts

4. Test the API

API Endpoints

POST /query

Monitoring

Cost Estimation (Monthly)

Security

Development

Local Testing

Adding New Artifacts

Documentation

Troubleshooting

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Serverless MCP Enterprise Artifacts Platform

Architecture Overview

Components

Cost Optimization

Project Structure

Prerequisites

Setup Instructions

1. Install Dependencies

2. Deploy Infrastructure

3. Upload Sample Artifacts

4. Test the API

API Endpoints

POST /query

Monitoring

Cost Estimation (Monthly)

Security

Development

Local Testing

Adding New Artifacts

Documentation

Troubleshooting

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages