Improve metadata validation in message handlers and add test for inva… by Subhajitdas99 · Pull Request #335 · GetBindu/Bindu

Subhajitdas99 · 2026-03-05T23:10:53Z

Summary

Improves metadata validation in MessageHandlers.

Changes

Normalize metadata when it is None or not a dictionary
Prevents runtime errors when checking _payment_context
Added unit test for invalid metadata type

Verification

Full test suite passes locally

…lid metadata

Subhajitdas99 · 2026-03-06T00:16:34Z

##Summary

Problem: message.metadata may be None or a non-dictionary value, which caused a TypeError when checking for _payment_context inside _submit_and_schedule_task.

Why it matters: This could break message processing and cause runtime failures when metadata is missing or malformed.

What changed: Added defensive normalization to ensure metadata is always a dictionary before accessing _payment_context, and added a unit test for invalid metadata types.

What did NOT change (scope boundary): No changes to task scheduling logic, message processing flow, or streaming behavior.

##Change Type
[+] Bug fix
[ ] Feature
[ ] Refactor
[ ] Documentation
[ ] Security hardening
[+] Tests
[ ] Chore/infra

##Scope
[+] Server / API endpoints
[ ] Extensions (DID, x402, etc.)
[ ] Storage backends
[ ] Scheduler backends
[ ] Observability / monitoring
[ ] Authentication / authorization
[ ] CLI / utilities
[+] Tests
[ ] Documentation
[ ] CI/CD / infra

##Linked Issue/PR
Closes #
Related #

##User-Visible / Behavior Changes
None.

##Security Impact
-New permissions/capabilities? No
-Secrets/credentials handling changed? No
-New/changed network calls? No
-Database schema/migration changes? No
-Authentication/authorization changes? No

##Verification
#Environment
-OS: Windows 11
-Python version: 3.11
-Storage backend: InMemoryStorage
-Scheduler backend: Mock scheduler for unit tests

##Steps to Test
1.Run full unit test suite
2.Execute send_message handler with messages containing invalid metadata
3.Confirm handler processes request without runtime errors

##Expected Behavior
Message handlers should safely normalize metadata and continue processing without errors.

##Actual Behavior
After fix, invalid metadata inputs are safely handled and tests pass.

##Evidence
[+]Test output / logs

-Example:
634 passed, 18 skipped, 1 deselected

##Human Verification
#Verified scenarios:
-message with metadata=None
-message with invalid metadata type
-normal message flow

#Edge cases checked:
_payment_context extraction
-metadata normalization

#What you did NOT verify:
-production storage backends

##Compatibility / Migration
-Backward compatible? Yes
-Config/env changes? No
-Database migration needed? No

##Failure Recovery
-How to disable/revert this change quickly: revert the commit affecting message_handlers.py
-Files/config to restore: bindu/server/handlers/message_handlers.py

Known bad symptoms reviewers should watch for: unexpected metadata mutation

##Risks and Mitigations
None.

##Checklist
[+] Tests pass (uv run pytest)
[ ] Pre-commit hooks pass (uv run pre-commit run --all-files)
[ ] Documentation updated (if needed)
[+] Security impact assessed
[+] Human verification completed
[+]Backward compatibility considered

raahulrahl · 2026-03-06T09:32:59Z

PR Review: #335 - Message Metadata Validation & Error Handling

Summary

This PR improves robustness in the message handling code by adding validation for message metadata and better error logging. The changes prevent potential crashes when invalid metadata types are received and ensure consistent data structures throughout the message processing pipeline.

Changes Overview

1. Enhanced Metadata Validation

File: bindu/server/handlers/message_handlers.py

Before:

message_metadata = message.get("metadata", {})
if (
    isinstance(message_metadata, dict)
    and "_payment_context" in message_metadata
):
    scheduler_params["payment_context"] = message_metadata["_payment_context"]
    del message_metadata["_payment_context"]

After:

message_metadata = message.get("metadata")
# Normalize metadata to a dictionary
if message_metadata is None:
    message_metadata = {}
    message["metadata"] = message_metadata

elif not isinstance(message_metadata, dict):    
    logger.warning(
        "Invalid metadata type received in message",
        extra={"type": type(message_metadata).__name__}
    )
    message["metadata"] = {}
    message_metadata = message["metadata"]

if "_payment_context" in message_metadata:
    scheduler_params["payment_context"] = message_metadata["_payment_context"]
    del message_metadata["_payment_context"]

What this improves:

Defensive Programming: The original code assumed metadata would always be a dictionary or None. If a client sent invalid data (e.g., metadata: "string" or metadata: 123), the code would fail when trying to check "_payment_context" in message_metadata.
Explicit Normalization: The new code explicitly handles three cases:
- metadata is None → create empty dict
- metadata is not a dict → log warning and replace with empty dict
- metadata is a dict → proceed normally
Data Integrity: By normalizing invalid metadata to an empty dictionary and updating the message object itself (message["metadata"] = message_metadata), we ensure downstream code (storage, task processing) always receives valid data structures.
Observability: The warning log includes the actual type received, making debugging easier when invalid data is sent.

2. Empty Payload Guard in SSE Event Serialization

Before:

@staticmethod
def _sse_event(payload: dict[str, Any]) -> str:
    """Serialize an SSE event payload."""
    return f"data: {json.dumps(MessageHandlers._to_jsonable(payload))}\n\n"

After:

@staticmethod
def _sse_event(payload: dict[str, Any]) -> str:
    """Serialize an SSE event payload."""
    if not payload:
         return ""

    return f"data: {json.dumps(MessageHandlers._to_jsonable(payload))}\n\n"

What this improves:

Prevents Empty Events: If payload is an empty dictionary or falsy value, we now return an empty string instead of sending data: {}\n\n over SSE. This avoids unnecessary network traffic and potential client-side confusion.
Cleaner SSE Stream: Clients won't receive meaningless empty JSON objects in the event stream.

3. Improved Error Logging

Before:

logger.error(
    f"Unhandled stream error for task {task['id']}: {e}", exc_info=True
)

After:

logger.error(
     "Unhandled stream error",
      extra = {task['id': str(task["id"])]},
      exc_info = True,
)

What this improves:

Structured Logging: Moving the task ID to the extra parameter makes it easier for log aggregation systems (like ELK, Datadog, etc.) to parse and index the task ID as a structured field.
Consistency: This follows Python logging best practices where contextual data goes in extra rather than being interpolated into the message string.

Note: There appears to be a syntax error in the diff. The correct code should be:

logger.error(
    "Unhandled stream error",
    extra={"task_id": str(task["id"])},
    exc_info=True,
)

The diff shows {task['id': str(task["id"])]} which has mismatched brackets and quotes. This would cause a syntax error.

4. Test Coverage

File: tests/unit/test_message_handlers.py

A new test was added:

@pytest.mark.asyncio
async def test_send_message_invalid_metadata_type_handled():
    """
    If metadata is not a dict, the handler should not crash and should normalize metadata safely.
    """
    storage = InMemoryStorage()
    handlers = _make_handlers(storage)

    message = create_test_message(
        text = "invalid metadata",
        metadata = "this_should_be_a_dict"
    )
    request = _send_request(message)

    response = await handlers.send_message(request)
    assert_jsonrpc_success(response)

    stored_task = await storage.load_task(response["result"]["id"])
    stored_metadata = (stored_task["history"] or [{}])[-1].get("metadata", {})
    #Metadata should be normalized to a dictionary
    assert isinstance(stored_metadata, dict)

What this validates:

Non-Crash Behavior: Verifies that sending invalid metadata (a string instead of a dict) doesn't crash the handler.
Normalization: Confirms that the stored metadata is normalized to an empty dictionary, ensuring data consistency in storage.
Regression Prevention: This test will catch any future changes that might break this defensive behavior.

Issues Found

Critical: Syntax Error in Logging Code

The error logging change has a syntax error:

extra = {task['id': str(task["id"])]},  # Wrong: mismatched brackets and quotes

Should be:

extra={"task_id": str(task["id"])},

This will cause the code to fail at runtime when an exception occurs during streaming.

Minor: Code Style Inconsistencies

Spacing: Inconsistent spacing around = in the logger call:

extra = {task['id': str(task["id"])]},  # Space before =
exc_info = True,                         # Space before =

Should be:

extra={"task_id": str(task["id"])},
exc_info=True,

Indentation: The SSE event guard has inconsistent indentation (extra space):
```
if not payload:
     return ""  # Extra space
```

Recommendations

Must Fix

Fix the syntax error in the logging statement:

logger.error(
    "Unhandled stream error",
    extra={"task_id": str(task["id"])},
    exc_info=True,
)

Should Consider

Add a test for the SSE empty payload case to verify the new guard works correctly:

def test_sse_event_empty_payload():
    result = MessageHandlers._sse_event({})
    assert result == ""

Consider adding type hints to make the metadata validation more explicit:
```
message_metadata: dict[str, Any] | None = message.get("metadata")
```
Document the metadata normalization behavior in the function docstring so future developers understand why this validation exists.

Verdict

REQUEST CHANGES - The PR has good intentions and improves code robustness, but the syntax error in the logging code must be fixed before merging. Once corrected, this will be a valuable improvement to error handling and data validation.

Positive Aspects

Excellent defensive programming approach
Good test coverage for the new validation logic
Improves observability with structured logging
Prevents potential crashes from malformed input
Maintains backward compatibility

Thank you for working on improving the robustness of the message handling code. The metadata validation is a great addition that will prevent subtle bugs. Please fix the syntax error and this will be ready to merge!

Subhajitdas99

Thanks for the review!

Good catch on the syntax error in the structured logging block.
I've fixed the issue and pushed the update.

Please let me know if anything else needs adjustment.

Subhajitdas99

Summary

This PR fixes and enables the bindufy validation test suite which was previously skipped due to broken imports and monkeypatching.

Changes

Correctly import the bindufy module for monkeypatching
Restore bindufy function import for test execution
Fix incorrect exception expectations in tests
Remove global skip marker and enable validation tests

Result

All validation tests now run successfully:

12 passed in 1.15s

These tests verify required configuration validation for:

author
name
deployment.url

This improves reliability of the agent configuration validation logic.

Subhajit Das added 2 commits March 6, 2026 00:47

Improve metadata validation in message handlers and add test for inva…

6fcf6e9

…lid metadata

Improve structured logging for stream_message error handling

39523df

raahulrahl added the inprogress label Mar 6, 2026

Fix syntax error in structured logging for stream_message

3fc13f2

Subhajitdas99 commented Mar 6, 2026

View reviewed changes

Fix bindufy validation tests and enable previously skipped test suite

807c396

Subhajitdas99 commented Mar 6, 2026

View reviewed changes

Merge branch 'main' into improve-message-handler-tests

4bac728

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve metadata validation in message handlers and add test for inva…#335

Improve metadata validation in message handlers and add test for inva…#335
Subhajitdas99 wants to merge 5 commits intoGetBindu:mainfrom
Subhajitdas99:improve-message-handler-tests

Subhajitdas99 commented Mar 5, 2026

Uh oh!

Subhajitdas99 commented Mar 6, 2026

Uh oh!

raahulrahl commented Mar 6, 2026

Uh oh!

Subhajitdas99 left a comment

Uh oh!

Subhajitdas99 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Subhajitdas99 commented Mar 5, 2026

Summary

Changes

Verification

Uh oh!

Subhajitdas99 commented Mar 6, 2026

Uh oh!

raahulrahl commented Mar 6, 2026

PR Review: #335 - Message Metadata Validation & Error Handling

Summary

Changes Overview

1. Enhanced Metadata Validation

2. Empty Payload Guard in SSE Event Serialization

3. Improved Error Logging

4. Test Coverage

Issues Found

Critical: Syntax Error in Logging Code

Minor: Code Style Inconsistencies

Recommendations

Must Fix

Should Consider

Verdict

Positive Aspects

Uh oh!

Subhajitdas99 left a comment

Choose a reason for hiding this comment

Uh oh!

Subhajitdas99 left a comment

Choose a reason for hiding this comment

Summary

Changes

Result

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants