feat(providers): add Amazon Bedrock provider by SantiagoDePolonia · Pull Request #316 · ENTERPILOT/GoModel

SantiagoDePolonia · 2026-05-10T17:28:33Z

Summary

add Amazon Bedrock provider registration and AWS SDK dependencies
support Bedrock model listing, chat, streaming, tools, usage, and endpoint normalization
document BEDROCK_* config/env defaults

Tests

go test ./...
pre-commit hooks: make test-race, go mod tidy, make lint

Summary by CodeRabbit

New Features
- Amazon Bedrock support for chat completions, streaming (OpenAI-compatible SSE), and tool/function-style interactions; API-keyless AWS credential chain supported; embeddings rejected.
Documentation
- Added Bedrock config examples, credential/env guidance (including BEDROCK_BASE_URL and optional model allowlist), and updated supported LLM providers list with example model IDs.
Tests
- Comprehensive unit tests covering Bedrock initialization, request/response conversions, tool handling, and streaming behavior.

coderabbitai · 2026-05-10T17:28:47Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 1d948531-4baa-4b4d-9ff5-b0f0c63387da

📥 Commits

Reviewing files that changed from the base of the PR and between 622219c and 3fd5f3a.

📒 Files selected for processing (2)

internal/providers/bedrock/bedrock_test.go
internal/providers/bedrock/chat.go

📝 Walkthrough

Walkthrough

This PR adds Amazon Bedrock as a supported LLM provider. It integrates AWS SDK v2, registers the provider, implements synchronous and streaming OpenAI-compatible chat adapters (Converse), centralizes AWS error mapping, adds tests for conversion/streaming, and updates config and docs for Bedrock credentials and base URL.

Changes

Amazon Bedrock Provider Implementation

Layer / File(s)	Summary
Dependencies & Registration `go.mod`, `cmd/gomodel/main.go`	AWS SDK v2 modules added and Bedrock provider registered at startup.
Core Provider `internal/providers/bedrock/bedrock.go`	Provider type/Registration and New() constructor; BaseURL parsing and region/endpoint rewriting; AWS clients created via credential chain; readiness probe via ListFoundationModels; ListModels filters for text-capable models; Embeddings returns invalid-request; Responses/StreamResponses delegate to chat bridge; mapAWSError centralizes AWS→gateway error mapping.
Chat Completion `internal/providers/bedrock/chat.go`	ChatCompletion builds Bedrock ConverseInput from core.ChatRequest (messages, system, inference config, tool config), calls runtime.Converse, maps AWS errors, and converts ConverseOutput to core.ChatResponse (aggregates text, maps tool calls, normalizes finish_reason, and includes usage).
Streaming Chat `internal/providers/bedrock/chat_stream.go`	StreamChatCompletion wraps ConverseStream into an OpenAI-compatible SSE io.ReadCloser; streamConverter emits role/content/tool_call chunks, defers final finish chunk until metadata/usage arrives, supports Close(), and serializes stable SSE framing.
Unit Tests `internal/providers/bedrock/bedrock_test.go`	Tests cover BaseURL parsing, plane endpoint derivation, stop-reason mapping, buildConverseParts behaviors (max token fallback/precedence, merging tool results, top_p forwarding, max_tokens overflow), tool conversion, Converse output conversion (text, tool args, usage), provider init with bearer token, registration metadata, and streaming SSE framing/finish deferral/usage formatting.
Configuration & Documentation `.env.template`, `config/config.example.yaml`, `README.md`	Adds Bedrock configuration notes and examples: BEDROCK_BASE_URL (region or endpoint), BEDROCK_MODELS allowlist, AWS credential chain guidance, and README table entry listing Bedrock as a supported provider.

Sequence Diagram(s)

sequenceDiagram
  participant Client
  participant GatewayProvider as Provider
  participant BedrockRuntime as BedrockRuntime
  participant BedrockControl as ControlPlane
  Client->>GatewayProvider: ChatRequest / StreamRequest
  GatewayProvider->>BedrockRuntime: Converse / ConverseStream
  BedrockRuntime-->>GatewayProvider: ConverseOutput / Events
  GatewayProvider->>Client: ChatResponse / SSE chunks
  GatewayProvider->>BedrockControl: ListFoundationModels (availability, discovery)

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

ENTERPILOT/GoModel#285: Adds a different LLM provider and updates provider registration, sharing provider-registration patterns with this PR.
ENTERPILOT/GoModel#170: Earlier provider integration and provider registration refactor that this Bedrock registration aligns with.

Poem

I hopped through code and found a rock,
Bedrock listens — chat will talk.
Streams that sing in tiny frames,
Tokens counted, named by names,
Rabbit cheers — deploy with a hop. 🐰✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 31.91% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main change: adding Amazon Bedrock as a new LLM provider, which is reflected across all modified files including provider implementation, configuration, documentation, and dependencies.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/amazon-bedrock

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codecov-commenter · 2026-05-10T17:31:36Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

❌ Patch coverage is 50.25467% with 293 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
internal/providers/bedrock/chat_stream.go	27.51%	105 Missing and 3 partials ⚠️
internal/providers/bedrock/bedrock.go	29.62%	90 Missing and 5 partials ⚠️
internal/providers/bedrock/chat.go	70.72%	63 Missing and 26 partials ⚠️
cmd/gomodel/main.go	0.00%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

coderabbitai

Actionable comments posted: 7

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@internal/providers/bedrock/bedrock_test.go`:
- Around line 575-577: Remove the misleading comment and the unused shim
variable by deleting the `// trimRightAll silences staticcheck for unused
strings import in some local` comment and the declaration `var _ =
strings.TrimSpace`; these are unnecessary because the `strings` package is
actually used elsewhere in this test file (so no staticcheck suppression is
needed). Locate the `var _ = strings.TrimSpace` symbol in `bedrock_test.go` and
remove that line and its preceding comment, leaving the existing `import
"strings"` and all real usages intact.
- Around line 557-573: Replace the custom errorsAs helper with the standard
library errors utilities: remove the errorsAs function, add "errors" to the
imports, and update all call sites that relied on errorsAs (checks for
*core.GatewayError) to use errors.As(err, &gw) or, if building with Go 1.26+,
errors.AsType[*core.GatewayError](err); ensure targets use a variable of type
**core.GatewayError or *core.GatewayError as appropriate so the standard
errors.As/AsType assigns correctly.

In `@internal/providers/bedrock/bedrock.go`:
- Around line 98-145: parseBaseURL, regionFromHost, controlPlaneEndpoint and
runtimePlaneEndpoint are currently inferring/rewriting hosts too broadly; change
them to only operate on known AWS Bedrock hostnames (e.g. hosts that follow the
AWS pattern like "bedrock.<region>.amazonaws.com" or
"bedrock-runtime.<region>.amazonaws.com"). Specifically: have parseBaseURL parse
the input with net/url and call regionFromHost only if the parsed host matches
the AWS bedrock pattern, otherwise return the original endpoint unchanged and an
empty region; update regionFromHost to validate the host ends with the AWS
domain (for example ".amazonaws.com") and extract the region only when parts
match the exact "bedrock" or "bedrock-runtime" + region structure; and modify
controlPlaneEndpoint and runtimePlaneEndpoint to parse the URL and swap only the
service label in the host for validated AWS bedrock hosts (not via blind string
Replace), returning the input unchanged for any non-AWS/custom hosts.

In `@internal/providers/bedrock/chat_stream.go`:
- Around line 153-177: Replace the ad-hoc map[string]any payloads passed into
s.formatChunk with small, strongly-typed Go structs (e.g., ToolCallChunk,
FunctionSpec, DeltaChunk) that have explicit JSON tags and fields matching the
streamed contract; update the code paths that build tool payloads (the block
creating toolStreamState and calling s.append(s.formatChunk(...))) and the other
similar spots referenced (around the s.formatChunk calls near the other ranges)
to instantiate and pass those structs instead of nested maps, and adjust
formatChunk/serialization if needed to accept/encode the typed structs while
keeping the same output shape; use toolStreamState.openAIIndex, id, and name to
populate the typed fields.

In `@internal/providers/bedrock/chat.go`:
- Around line 79-83: resolveMaxTokens returns an int but we currently cast its
result to int32 when populating brtypes.InferenceConfiguration.MaxTokens (in the
chat handler functions using resolveMaxTokens), which can overflow and produce
invalid/negative values; update the code to check the returned int against
math.MaxInt32 before casting, and if it exceeds that bound return an
invalid_request_error describing the overflow; apply the same
bound-check-and-error logic to the other occurrence that sets
InferenceConfiguration.MaxTokens (the second chat path around the other
resolveMaxTokens usage) so both code paths refuse oversized token limits rather
than silently wrapping.

In `@README.md`:
- Line 107: Update the Amazon Bedrock table row to mark BEDROCK_BASE_URL as
optional (instead of required) to match the documented fallback to
AWS_REGION/AWS_DEFAULT_REGION; edit the README table cell for the "Amazon
Bedrock" row referencing the BEDROCK_BASE_URL variable and adjust its
wording/indicator to show it is optional and mention that
AWS_REGION/AWS_DEFAULT_REGION will be used when unset (keep the rest of the row
values unchanged).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 2e9e7138-4ed1-4c80-9f52-6ac5699f454a

📥 Commits

Reviewing files that changed from the base of the PR and between 2a90459 and de8d2e1.

⛔ Files ignored due to path filters (1)

go.sum is excluded by !**/*.sum

📒 Files selected for processing (9)

.env.template
README.md
cmd/gomodel/main.go
config/config.example.yaml
go.mod
internal/providers/bedrock/bedrock.go
internal/providers/bedrock/bedrock_test.go
internal/providers/bedrock/chat.go
internal/providers/bedrock/chat_stream.go

coderabbitai · 2026-05-10T17:35:31Z

+		s.append(s.formatChunk(map[string]any{"role": role}, nil, nil))
+
+	case *brtypes.ConverseStreamOutputMemberContentBlockStart:
+		if e.Value.Start == nil {
+			return
+		}
+		if start, ok := e.Value.Start.(*brtypes.ContentBlockStartMemberToolUse); ok {
+			idx := awssdk.ToInt32(e.Value.ContentBlockIndex)
+			state := &toolStreamState{
+				openAIIndex: len(s.toolByIndex),
+				id:          awssdk.ToString(start.Value.ToolUseId),
+				name:        awssdk.ToString(start.Value.Name),
+			}
+			s.toolByIndex[idx] = state
+			s.append(s.formatChunk(map[string]any{
+				"tool_calls": []map[string]any{{
+					"index": state.openAIIndex,
+					"id":    state.id,
+					"type":  "function",
+					"function": map[string]any{
+						"name":      state.name,
+						"arguments": "",
+					},
+				}},
+			}, nil, nil))


🛠️ Refactor suggestion | 🟠 Major | ⚡ Quick win

Replace the ad-hoc SSE payload maps with typed structs.

This path is constructing public response payloads with nested map[string]any, so field names and JSON shape are only checked at runtime. Please switch this to small typed chunk/delta structs to keep the streamed contract explicit and safer to evolve. Based on learnings "Do not use interface{} or map[string]interface{} for API request/response payload types. Prefer strongly-typed structs for API payload definitions to improve type safety, serialization, and documentation."

Also applies to: 199-206, 237-257

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@internal/providers/bedrock/chat_stream.go` around lines 153 - 177, Replace the ad-hoc map[string]any payloads passed into s.formatChunk with small, strongly-typed Go structs (e.g., ToolCallChunk, FunctionSpec, DeltaChunk) that have explicit JSON tags and fields matching the streamed contract; update the code paths that build tool payloads (the block creating toolStreamState and calling s.append(s.formatChunk(...))) and the other similar spots referenced (around the s.formatChunk calls near the other ranges) to instantiate and pass those structs instead of nested maps, and adjust formatChunk/serialization if needed to accept/encode the typed structs while keeping the same output shape; use toolStreamState.openAIIndex, id, and name to populate the typed fields.

coderabbitai · 2026-05-10T17:35:31Z

+func convertMessages(messages []core.Message) ([]brtypes.SystemContentBlock, []brtypes.Message, error) {
+	system := make([]brtypes.SystemContentBlock, 0)
+	out := make([]brtypes.Message, 0, len(messages))
+
+	for _, msg := range messages {
+		switch msg.Role {
+		case "system", "developer":
+			text := core.ExtractTextContent(msg.Content)
+			if text == "" {
+				continue
+			}
+			system = append(system, &brtypes.SystemContentBlockMemberText{Value: text})
+		case "tool":
+			block, err := convertToolResultMessage(msg)
+			if err != nil {
+				return nil, nil, err
+			}
+			out = append(out, brtypes.Message{
+				Role:    brtypes.ConversationRoleUser,
+				Content: []brtypes.ContentBlock{block},
+			})
+		case "user":
+			text := core.ExtractTextContent(msg.Content)
+			if text == "" {
+				continue
+			}
+			out = append(out, brtypes.Message{
+				Role:    brtypes.ConversationRoleUser,
+				Content: []brtypes.ContentBlock{&brtypes.ContentBlockMemberText{Value: text}},
+			})
+		case "assistant":
+			blocks, err := convertAssistantMessage(msg)
+			if err != nil {
+				return nil, nil, err
+			}
+			if len(blocks) == 0 {
+				continue
+			}
+			out = append(out, brtypes.Message{
+				Role:    brtypes.ConversationRoleAssistant,
+				Content: blocks,
+			})
+		default:
+			return nil, nil, core.NewInvalidRequestError("unsupported message role: "+msg.Role, nil)
+		}
+	}
+
+	return system, out, nil


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Do not silently drop unsupported message parts.

This converter only forwards extracted text for system / developer / user messages and ignores every other content-part type. A multimodal request can therefore lose non-text input entirely, or even end up sending an empty user turn. Until those parts are mapped to Bedrock content blocks, this path should fail fast with invalid_request_error instead of stripping them.

What about Postel's law?

@SantiagoDePolonia — that's a fair principle to raise, but I'd argue it doesn't straightforwardly apply here, for a couple of reasons.

Postel's Law is about tolerating variant input from external parties, not about silently transforming a well-formed caller request in ways the caller cannot detect.

The "be conservative in what you send" half of the principle actually works against the current behaviour: the gateway is sending a degraded or entirely absent message to Bedrock without any signal to the caller. The caller believes the model received their full request; it did not.

The most acute failure mode is a user message that contains only an image (no text). ExtractTextContent returns "", the continue fires, and the message is silently dropped from the conversation. Bedrock then sees a conversation with a missing user turn and may error or hallucinate. This is closer to silent data corruption than liberal acceptance.

The modern read of Postel's Law has also moved away from silent-drop as a pattern. Several IETF documents (e.g., the IAB Statement on Robust Extensibility and RFC 9110's guidance) have specifically noted that liberal-accept-without-feedback creates invisible bugs and makes debugging across system boundaries very difficult — the exact situation here where a multimodal request arrives, silently degrades, and produces a confusing or wrong model response.

A middle ground that honours both goals:

For content-part types the provider can't map (images, audio, files), return an explicit invalid_request_error — this is still "accepting" the request at the HTTP layer, just rejecting it with a clear explanation.

Continue silently dropping truly ignorable parts (e.g., empty text blocks), since those genuinely carry no information.

This gives callers actionable feedback while still being as permissive as the underlying provider allows. Happy to keep the comment open or close it if you prefer the current behaviour — just wanted to explain the reasoning.

🧠 Learnings used

Learnt from: SantiagoDePolonia Repo: ENTERPILOT/GOModel PR: 33 File: internal/auditlog/auditlog_test.go:272-279 Timestamp: 2026-01-15T23:07:26.646Z Learning: Go code guideline: Avoid using interface{} and map[string]interface{} for API request/response payload types. This should be restricted to public API surfaces and data payloads. It is acceptable to use interface{} in internal infrastructure packages (e.g., storage abstractions) only to break import cycles, and only when type assertions occur at initialization (not on every request) to minimize runtime type checks and improve readability. Apply this rule broadly to files under internal/ that deal with payload-like types, but not to internal components that are pure infrastructure or have runtime-per-request type assertions.

Learnt from: SantiagoDePolonia Repo: ENTERPILOT/GOModel PR: 33 File: internal/auditlog/factory.go:112-143 Timestamp: 2026-01-15T23:07:37.652Z Learning: Guideline: Do not use interface{} or map[string]interface{} for API request/response payload types. Prefer strongly-typed structs for API payload definitions to improve type safety, serialization, and documentation. Allow interface{} only in internal infrastructure code paths where pragmatic flexibility is necessary (e.g., to avoid import cycles or to handle highly dynamic internal contracts). In internal/auditlog/factory.go and similar non-API implementation files, applying this restriction is optional and should be evaluated on a case-by-case basis based on whether the type remains internal and does not define API boundary shapes.

Learnt from: SantiagoDePolonia Repo: ENTERPILOT/GOModel PR: 99 File: internal/providers/responses_adapter.go:128-131 Timestamp: 2026-02-25T16:55:22.071Z Learning: In internal/providers/responses_adapter.go (and related files), Do and DoStream should already normalize provider errors using core.ParseProviderError(). Call sites (e.g., ChatCompletion, StreamChatCompletion) receive normalized GatewayError types and must not call ParseProviderError again. When reviewing code, ensure no downstream calls duplicate Normalize error handling and that all provider-error surfaces rely on the GatewayError type as the canonical representation. If you find code that re-parses errors with ParseProviderError, suggest removing that duplication and relying on the existing normalization path.

Learnt from: SantiagoDePolonia Repo: ENTERPILOT/GoModel PR: 294 File: internal/admin/handler_usage.go:243-244 Timestamp: 2026-04-30T07:12:57.910Z Learning: In Go code targeting Go 1.26+ (e.g., this repo uses Go 1.26.x), do not flag `errors.AsType[T](err)` as a missing/non-existent API. `errors.AsType[E error](err error)` returns `(E, bool)` as a type-safe, generic alternative to `errors.As`. For example, code like `if val, ok := errors.AsType[*SomeErrorType](err); ok { ... }` is valid standard-library usage and should be treated as such during review.

Learnt from: SantiagoDePolonia Repo: ENTERPILOT/GoModel PR: 294 File: internal/admin/handler_usage.go:243-244 Timestamp: 2026-04-30T07:12:57.910Z Learning: In Go 1.26+ code, `errors.AsType[T](err)` is a standard-library, generic alternative to `errors.As`. Do not flag `errors.AsType[...]` as a non-existent function or compilation error when the repository’s Go version is 1.26 or newer. The expected usage pattern is `if val, ok := errors.AsType[*SomeErrorType](err); ok { ... }`.

coderabbitai · 2026-05-10T17:35:31Z

 | Oracle        | `ORACLE_API_KEY` + `ORACLE_BASE_URL`                              | `openai.gpt-oss-120b`              |  ✅  |      ✅      |  ❌   |  ❌   |   ❌    |    ❌    |
 | Ollama        | `OLLAMA_BASE_URL`                                                 | `llama3.2`                         |  ✅  |      ✅      |  ✅   |  ❌   |   ❌    |    ❌    |
 | vLLM          | `VLLM_BASE_URL` (`VLLM_API_KEY` optional)                         | `meta-llama/Llama-3.1-8B-Instruct` |  ✅  |      ✅      |  ✅   |  ❌   |   ❌    |    ✅    |
+| Amazon Bedrock | `BEDROCK_BASE_URL` (region or endpoint) + AWS credentials        | `anthropic.claude-3-5-haiku-20241022-v1:0` |  ✅  |      ✅      |  ❌   |  ❌   |   ❌    |    ❌    |


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Mark BEDROCK_BASE_URL as optional in the table.

This row currently makes BEDROCK_BASE_URL look required, but the rest of this PR documents fallback to AWS_REGION / AWS_DEFAULT_REGION when it is unset. Please reflect that here so the setup guidance stays consistent. As per coding guidelines "Documentation should be concise, practical, and user-focused. Show defaults, explain when to change them, and include minimal examples when useful."

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@README.md` at line 107, Update the Amazon Bedrock table row to mark BEDROCK_BASE_URL as optional (instead of required) to match the documented fallback to AWS_REGION/AWS_DEFAULT_REGION; edit the README table cell for the "Amazon Bedrock" row referencing the BEDROCK_BASE_URL variable and adjust its wording/indicator to show it is optional and mention that AWS_REGION/AWS_DEFAULT_REGION will be used when unset (keep the rest of the row values unchanged).

greptile-apps · 2026-05-10T17:37:36Z

Greptile Summary

This PR introduces a new Amazon Bedrock provider that wraps the Converse / ConverseStream APIs and normalises the result into OpenAI-compatible chat completions, streaming, and tool calls. The three previously flagged issues (missing TopP mapping, controlPlaneEndpoint hostname corruption, and flushToolResults slice aliasing) are all resolved and verified by dedicated regression tests.

chat.go builds the Converse request shape including message-role merging for parallel tool results, tool-choice normalisation, and deferred finish/usage for streaming.
chat_stream.go converts ConverseStreamOutput events into OpenAI SSE chunks, deferring the terminal chunk until the trailing metadata event so usage lands on the right frame.
Two minor gaps remain: the OpenAI stop parameter is not forwarded to InferenceConfiguration.StopSequences, and a user message whose entire content is non-text is silently dropped rather than producing a clear error.

Confidence Score: 5/5

Safe to merge; the three previously identified defects are all fixed and well-tested, and no new runtime-breaking issues were found.

The provider is a self-contained new addition with no changes to shared infrastructure. The critical message-merging and endpoint-rewriting bugs from earlier review rounds are fixed and covered by regression tests. The only remaining gaps are unimplemented pass-through of stop sequences and a missing explicit error for non-text user messages — both edge cases that don’t affect the primary happy path.

internal/providers/bedrock/chat.go — stop-sequence forwarding and empty-content user message handling

Important Files Changed

Filename	Overview
internal/providers/bedrock/bedrock.go	Provider registration, AWS SDK initialization, endpoint rewriting, model listing, error mapping — well-structured with anchored string replacements guarding custom hostnames
internal/providers/bedrock/chat.go	Converse request/response conversion; all three previously flagged issues (TopP, controlPlane hostname, flushToolResults aliasing) are fixed; stop-sequences from ExtraFields and empty-text user messages are silently dropped
internal/providers/bedrock/chat_stream.go	ConverseStream → OpenAI SSE bridge with correctly deferred finish/usage chunk; event handling and Close() are safe
internal/providers/bedrock/bedrock_test.go	Comprehensive table-driven tests covering message merging, aliasing regression, tool choice normalization, streaming deferred-finish, and bearer-token initialization
cmd/gomodel/main.go	Bedrock provider registered alongside other providers — change is minimal and correct

Sequence Diagram

sequenceDiagram
    participant Client as OpenAI Client
    participant GW as GoModel Gateway
    participant Chat as bedrock/chat.go
    participant Stream as bedrock/chat_stream.go
    participant BR as AWS Bedrock Runtime

    Client->>GW: POST /v1/chat/completions
    GW->>Chat: ChatCompletion(req)
    Chat->>Chat: buildConverseParts(req)
    Chat->>BR: Converse(ConverseInput)
    BR-->>Chat: ConverseOutput
    Chat->>Chat: convertConverseOutput()
    Chat-->>GW: ChatResponse
    GW-->>Client: OpenAI-compatible JSON

    Client->>GW: "POST /v1/chat/completions (stream=true)"
    GW->>Stream: StreamChatCompletion(req)
    Stream->>Stream: buildConverseParts(req)
    Stream->>BR: ConverseStream(ConverseStreamInput)
    BR-->>Stream: ConverseStreamOutput (event channel)
    loop Events
        BR-)Stream: contentBlockDelta / messageStop / metadata
        Stream->>Stream: handleEvent() → formatChunk()
        Stream-->>GW: SSE data chunk
    end
    Stream->>Stream: flushFinish() on metadata (with usage)
    Stream-->>GW: SSE [DONE]
    GW-->>Client: OpenAI SSE stream

_{Reviews (3): Last reviewed commit: "fix(bedrock): merge same-role neighbors ..." | Re-trigger Greptile}

Bedrock's Converse API requires strictly alternating user/assistant turns. The previous convertMessages emitted one user-role message per tool-role input, so any caller sending N parallel tool results in a single turn (the natural follow-up to N parallel tool_calls) hit a ValidationException at the Bedrock API boundary. Fold consecutive tool messages into one user message holding multiple ToolResult blocks. Verified against live AWS Bedrock — Warsaw + Tokyo parallel-tool roundtrip now returns a clean two-city answer where it previously failed. Other correctness fixes from the same review: - Forward top_p to InferenceConfiguration.TopP (via ExtraFields, since core.ChatRequest has no typed TopP field — matches gemini/native.go). - Reject max_tokens > math.MaxInt32 with invalid_request_error before casting int -> int32, so oversized values fail loudly instead of silently wrapping to a negative cap. - Anchor controlPlaneEndpoint's substring match to "://bedrock-runtime." for symmetry with runtimePlaneEndpoint, so custom hostnames like https://my-bedrock-runtime.internal.example.com aren't corrupted. - Tighten regionFromHost to require a ".amazonaws.com" suffix, so https://bedrock.internal.example.com no longer mis-extracts a region. Test cleanup: replace the custom errorsAs shim with stdlib errors.As, delete the now-unused strings.TrimSpace placeholder, and add focused unit tests for each fix above. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (2)

internal/providers/bedrock/chat.go (1)
136-156: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Don't silently discard unsupported non-text content.

core.ExtractTextContent(...) plus the empty-text skip path strips image/audio/file-only content from system / developer / user / assistant turns, and can remove a user turn entirely. That changes the caller’s prompt without any signal. Until those parts are mapped to Bedrock blocks, return invalid_request_error instead of mutating the conversation.

As per coding guidelines "Accept requests generously (e.g., allow max_tokens for any model) and adapt them to each provider's specific requirements before forwarding."

Also applies to: 178-182
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@internal/providers/bedrock/chat.go` around lines 136 - 156, The code
currently uses core.ExtractTextContent and silently skips messages with empty
text (in the switch cases handling "system"/"developer"/"user" and similarly at
lines 178-182), which drops non-text-only turns; change the behavior to detect
when core.ExtractTextContent(msg.Content) returns empty while msg.Content
contains non-text blocks and return an invalid_request_error instead of
continuing/omitting the message. Locate the switch handling roles and the code
that appends brtypes.SystemContentBlockMemberText or
brtypes.ContentBlockMemberText and, before skipping, validate whether the
original msg.Content includes non-text parts; if so, return a descriptive
invalid_request_error; only skip when the content is truly empty/no blocks.
Ensure this applies to both the system/developer and user/assistant handling
paths and preserve existing convertToolResultMessage usage for "tool" messages.
internal/providers/bedrock/bedrock.go (1)
140-153: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Custom full endpoints are still being rewritten.

runtimePlaneEndpoint("https://bedrock.internal.example.com") still returns https://bedrock-runtime.internal.example.com, and controlPlaneEndpoint does the inverse for https://bedrock-runtime.internal.example.com. That breaks the custom-endpoint flow this change is trying to preserve. Please parse the URL and swap the service label only when the hostname matches an AWS Bedrock host, then add both exact hostnames as regressions in TestPlaneEndpoint.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@internal/providers/bedrock/bedrock.go` around lines 140 - 153, The current
string-replace logic in controlPlaneEndpoint and runtimePlaneEndpoint rewrites
arbitrary hostnames like "bedrock.internal.example.com"; instead parse the input
using net/url, inspect the hostname, and only swap the service label ("bedrock"
↔ "bedrock-runtime") when the hostname exactly matches an AWS Bedrock pattern
(e.g. host starts with "bedrock." or "bedrock-runtime." and ends with the
expected AWS Bedrock domain suffix such as ".amazonaws.com" or whatever your
Bedrock hosts use); if parsing fails or the host doesn't match the AWS Bedrock
host pattern return the original endpoint unchanged. Update controlPlaneEndpoint
and runtimePlaneEndpoint to use this URL-parse + host-match approach and add two
precise regression cases to TestPlaneEndpoint that assert no rewrite for custom
full hostnames like "https://bedrock.internal.example.com" and
"https://bedrock-runtime.internal.example.com".

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@internal/providers/bedrock/chat.go`:
- Around line 131-168: The converter currently only folds consecutive "tool"
messages but still appends adjacent "user" or "assistant" messages as separate
Bedrock turns; update the loop that builds out so it merges consecutive messages
with the same role before appending (or rejects invalid sequences), i.e., when
handling cases for "user" and "assistant" (and when flushing pendingToolResults
into out via flushToolResults/pendingToolResults), detect if the last element in
out has the same brtypes.ConversationRole (check the last out entry's Role) and
if so concatenate the new Content blocks into that last Message's Content
instead of appending a new Message; ensure this logic is applied consistently
for both convertAssistantMessage and the user text branch and still preserves
system handling and existing tool folding via
convertToolResultMessage/flushToolResults.

---

Duplicate comments:
In `@internal/providers/bedrock/bedrock.go`:
- Around line 140-153: The current string-replace logic in controlPlaneEndpoint
and runtimePlaneEndpoint rewrites arbitrary hostnames like
"bedrock.internal.example.com"; instead parse the input using net/url, inspect
the hostname, and only swap the service label ("bedrock" ↔ "bedrock-runtime")
when the hostname exactly matches an AWS Bedrock pattern (e.g. host starts with
"bedrock." or "bedrock-runtime." and ends with the expected AWS Bedrock domain
suffix such as ".amazonaws.com" or whatever your Bedrock hosts use); if parsing
fails or the host doesn't match the AWS Bedrock host pattern return the original
endpoint unchanged. Update controlPlaneEndpoint and runtimePlaneEndpoint to use
this URL-parse + host-match approach and add two precise regression cases to
TestPlaneEndpoint that assert no rewrite for custom full hostnames like
"https://bedrock.internal.example.com" and
"https://bedrock-runtime.internal.example.com".

In `@internal/providers/bedrock/chat.go`:
- Around line 136-156: The code currently uses core.ExtractTextContent and
silently skips messages with empty text (in the switch cases handling
"system"/"developer"/"user" and similarly at lines 178-182), which drops
non-text-only turns; change the behavior to detect when
core.ExtractTextContent(msg.Content) returns empty while msg.Content contains
non-text blocks and return an invalid_request_error instead of
continuing/omitting the message. Locate the switch handling roles and the code
that appends brtypes.SystemContentBlockMemberText or
brtypes.ContentBlockMemberText and, before skipping, validate whether the
original msg.Content includes non-text parts; if so, return a descriptive
invalid_request_error; only skip when the content is truly empty/no blocks.
Ensure this applies to both the system/developer and user/assistant handling
paths and preserve existing convertToolResultMessage usage for "tool" messages.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: d367d5c6-2228-4e8e-bc78-4a8c2dd82ca9

📥 Commits

Reviewing files that changed from the base of the PR and between de8d2e1 and 622219c.

📒 Files selected for processing (3)

internal/providers/bedrock/bedrock.go
internal/providers/bedrock/bedrock_test.go
internal/providers/bedrock/chat.go

Two related correctness fixes in convertMessages: 1) flushToolResults previously returned blocks[:0], a zero-length slice sharing the backing array of the just-emitted message's Content. In a multi-turn conversation the next pending tool-result append wrote into that same array, silently overwriting the first tool-result block of the earlier turn. Return nil instead so subsequent appends allocate fresh storage. 2) Adjacent same-role inputs were appended as separate Bedrock turns, producing invalid sequences for the common pattern of a tool result followed by another user text message ([user, asst-tool-call, tool, user_text] became [user, asst, user_tool_result, user_text] — two consecutive user turns, rejected by Bedrock with ValidationException). New appendOrMerge helper folds same-role content into the previous message; flushToolResults and the user/assistant branches all route through it. Verified live against AWS Bedrock: a [user, asst-tool-call, tool, user_text] roundtrip now returns a coherent answer referencing both the tool result and the follow-up question, where it would previously have failed at the API boundary. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

feat(providers): add Amazon Bedrock provider

de8d2e1

coderabbitai Bot reviewed May 10, 2026

View reviewed changes

greptile-apps Bot reviewed May 10, 2026

View reviewed changes

Comment thread internal/providers/bedrock/chat.go

Comment thread internal/providers/bedrock/bedrock.go

coderabbitai Bot reviewed May 10, 2026

View reviewed changes

Comment thread internal/providers/bedrock/chat.go Outdated

greptile-apps Bot reviewed May 10, 2026

View reviewed changes

Comment thread internal/providers/bedrock/chat.go

SantiagoDePolonia merged commit d47c61c into main May 10, 2026
19 checks passed

Uh oh!

Conversation

SantiagoDePolonia commented May 10, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Tests

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

codecov-commenter commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot May 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SantiagoDePolonia May 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 10, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SantiagoDePolonia commented May 10, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 10, 2026 •

edited

Loading

codecov-commenter commented May 10, 2026 •

edited

Loading

coderabbitai Bot May 10, 2026 •

edited

Loading

greptile-apps Bot commented May 10, 2026 •

edited

Loading