# Amazon Bedrock: Comprehensive Overview

## Table of Contents
1. [Introduction to Amazon Bedrock](#introduction-to-amazon-bedrock)
2. [Architecture](#architecture)
3. [Foundation Models](#foundation-models)
4. [Inference Parameters](#inference-parameters)
5. [Pricing](#pricing)
6. [Console Walkthrough](#console-walkthrough)

## Introduction to Amazon Bedrock
<span style="color:blue">#introduction</span>

Amazon Bedrock is a fully managed serverless service from AWS that provides API access to various foundation models. Key features include:

- Serverless and fully managed
- Unified API for multiple foundation models
- Pay-per-use pricing model
- Access to third-party foundation models

## Architecture
<span style="color:blue">#architecture</span>

- Bedrock service deployed in AWS-managed Bedrock Service Account
- Foundation models hosted in AWS-owned Model Provider Escrow Account
- Request flow:
  1. User initiates request via Console, CLI, or SDK
  2. Bedrock service determines appropriate foundation model
  3. Request sent to specific model provider account
  4. Response returned to user
- Networking options:
  - API requests over the internet
  - Private connectivity via VPC interface endpoints

## Foundation Models
<span style="color:blue">#foundationModels</span>

Bedrock provides access to various foundation models:

- Amazon Titan: General purpose model
- AI21 Labs Jurassic-2: Multilingual text generation
- Anthropic Claude: Question answering and workflow automation
- Stability AI Stable Diffusion: Image generation
- Cohere: Text and embedding models

## Inference Parameters
<span style="color:blue">#inferenceParameters</span>

### Randomness and Diversity
- Temperature: Controls output randomness (typically 0-1 or 0-5)
- Top K: Limits selection to top K most probable words
- Top P: Caps choices based on cumulative probability

### Length
- Max Length: Controls the length of the generated response (typically 1-4,096 tokens)
- Stop Sequence: Stops token generation when specified keyword is encountered

### Repetition
- Presence Penalty: Reduces probability of repeating tokens
- Count Penalty: Lowers probability of word repetition
- Frequency Penalty: Reduces probability of repeating tokens, normalized to text length
- Penalize Special Token: Reduces repetition of special characters

## Pricing
<span style="color:blue">#pricing</span>

### On-Demand Pricing
- Pay-as-you-go model
- Charged for input and output tokens (text models)
- Per-image pricing for image generation models
- Input token pricing for embeddings

### Provisioned Throughput
- For large, consistent workloads or custom models
- Requires time commitment (1-6 months)
- Significantly more expensive than on-demand pricing

## Console Walkthrough
<span style="color:blue">#consoleWalkthrough</span>

- Accessing Bedrock via AWS console
- Requesting model access
- Exploring available foundation models
- Using the playground for model testing
- Creating custom models (fine-tuning)
- Managing provisioned throughput

## Key Takeaways
<span style="color:blue">#keyTakeaways</span>

1. Amazon Bedrock provides serverless access to powerful foundation models
2. Inference parameters allow fine-tuning of model outputs
3. On-demand pricing is cost-effective for most use cases
4. Provisioned throughput offers guaranteed performance for production workloads
5. The service is designed for easy integration and experimentation