Replace curl-download with paginated inline results + widget JSON endpoint by RafaelPo · Pull Request #232 · futuresearch/futuresearch-python

RafaelPo · 2026-02-25T21:49:58Z

Summary

Replace curl-download mode with paginated inline results (default 50 rows in LLM context)
Add ?format=json to download endpoint so the widget can fetch the full dataset independently via results_url
Tool docstring instructs the agent to ask the user how many rows to ingest before calling
Fix docker-compose.local.yaml to use variable substitution for MCP_SERVER_URL (was hardcoded, breaking tunnel setups)
Remove curl_mode, _build_curl_result_response, curl_preview_rows setting, and Redis-unavailable fallback

Test plan

366 unit tests pass
E2E: connect via Claude Desktop, run a task, verify agent asks about row count and widget shows full data

🤖 Generated with Claude Code

In HTTP mode, everyrow_results now returns a small schema preview (5 rows) plus a curl command to download the full CSV, instead of paginating up to 200 rows of JSON into the context window. This saves tokens and lets Claude use code execution to analyse data with pd.read_csv(). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…N endpoint - Default page_size=50 rows returned inline (was curl command) - Widget fetches full dataset via results_url (?format=json endpoint) - Remove curl_mode, _build_curl_result_response, curl_preview_rows setting - Remove Redis-unavailable fallback (Redis always available) - Fix docker-compose.local.yaml to use variable substitution for MCP_SERVER_URL - Tool docstring instructs agent to ask user how many rows to ingest Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

RafaelPo · 2026-02-25T21:50:02Z

@claude code review

everyrow-mcp/src/everyrow_mcp/result_store.py

The progress tool's completion message now tells the agent to ask the user how many rows they'd like to review before fetching results. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sentry · 2026-02-25T21:55:29Z

everyrow-mcp/src/everyrow_mcp/tools.py

        session_url,
        mcp_server_url=mcp_server_url,
    )


Bug: The function try_store_result incorrectly returns None when its type hint specifies list[TextContent], and the caller does not handle the None case.
_{Severity: HIGH}

Suggested Fix

Update try_store_result to handle the expired token case without returning None. For example, it could raise an appropriate HTTP exception or return an empty list. Alternatively, change the function's return type hint to list[TextContent] | None and ensure the calling function everyrow_results_http correctly handles the None case.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: everyrow-mcp/src/everyrow_mcp/tools.py#L925-L932 Potential issue: The function `try_store_result` is type-hinted to return `list[TextContent]`, but it can return `None` if a poll token has expired (typically after 24 hours). The calling function, `everyrow_results_http`, does not handle the `None` case and returns it directly. This mismatch between the declared return type and the actual return value will lead to a runtime error when a client attempts to retrieve results for a task whose poll token has expired, breaking the results retrieval flow for older tasks.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

… endpoint - try_store_result: raise RuntimeError instead of returning None when poll token expires (matches tightened return type) - JSON download endpoint: use pandas + _sanitize_records instead of csv.DictReader to preserve numeric types and handle NaN/Inf Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

everyrow-mcp/src/everyrow_mcp/result_store.py

- Add size guard (413) to ?format=json to prevent memory exhaustion - Add X-Content-Type-Options: nosniff header to JSON responses - Widget retry button now mints fresh download token instead of reusing consumed one - Strengthen progress completion message to prevent premature everyrow_results calls - Tell agent it can only see N of total rows (full dataset visible in widget) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

1. Atomic pop_download_token via Lua script (TOCTOU race fix) 2. Re-enable network policy in values.yaml 3. Lower JSON conversion threshold to 10MB (memory amplification) 4. Validate format query param — reject unknown values with 400 5. Use dtype=str in pd.read_csv for JSON conversion path 6. Widget uses fresh token on every full-results fetch (no stale URL) 7. Remove pre-minted results_url — widget mints tokens on demand Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

All four instruction locations now consistently require asking the user how many rows to load — no more "if small, use default" loopholes: - System instructions (app.py) - Progress completion message (tool_helpers.py) - Tool docstring (tools.py) - Parameter description (models.py) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Skip widget JSON for offset > 0 — the widget already has all data from the first page, so subsequent pages only need the text summary - Revert Lua script for pop_download_token back to simple TTL + GETDEL; the TOCTOU race has no exploitable consequence Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The model was telling users "load into our conversation" which is confusing. Updated all 5 instruction locations to say "load into my context so I can read them" and clarify the user always has full access via the widget. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

RafaelPo · 2026-02-26T09:38:09Z

@claude code review

github-actions · 2026-02-26T09:38:30Z

Claude finished @RafaelPo's task in 1m 51s —— View job

Code Review

Read CLAUDE.md and repo guidelines
Review PR diff and changed files
Evaluate Sentry bot findings
Post review feedback

Clean PR — the architecture shift from curl-download to paginated inline results + widget JSON endpoint is well-motivated and the test coverage is solid. A few items worth addressing:

1. Unhandled exception from `try_store_result` in `everyrow_results_http` (High)

tools.py:927-933 — try_store_result can now raise (Redis failure or poll-token expiration), but the caller has no try/except around it. Both Sentry findings point at this. The other code paths in everyrow_results_http (ownership check, API fetch) are all wrapped in try/except with user-friendly TextContent error messages — this one should follow the same pattern.

I agree that removing the silent inline-results fallback is a reasonable design choice (Redis is a required dependency), but the error should still be caught and returned as a TextContent error message rather than propagating as an unhandled exception through the MCP framework.

# tools.py:926-934 — suggested change
try:
    return await try_store_result(
        task_id, df, params.offset, params.page_size,
        session_url, mcp_server_url=mcp_server_url,
    )
except Exception:
    logger.exception("Failed to store/retrieve results for task %s", task_id)
    return [
        TextContent(
            type="text",
            text=f"Error preparing results for task {task_id}. Please try again.",
        )
    ]

Fix this →

2. Download token consumed before format validation (Low)

routes.py:271-301 — The download token is atomically popped (consumed, single-use) on line 271, but format validation happens on line 297. If a client passes ?format=xml, the token is wasted and they get a 400. Not a security issue (widget always sends format=json), but moving the format check before the token pop would be more defensive:

# Validate format before consuming the token
fmt = request.query_params.get("format", "csv")
if fmt not in ("csv", "json"):
    return JSONResponse(
        {"error": "Unsupported format"}, status_code=400, headers=cors
    )

# Atomically consume the download token
token_task_id, remaining_ttl = (
    await redis_store.pop_download_token(provided) if provided else (None, 0)
)

Fix this →

3. `_sanitize_records` is now a cross-module public API (Nit)

routes.py:19 — _sanitize_records (underscore-prefixed, private) is now imported by routes.py. Consider renaming it to sanitize_records to reflect its actual visibility, or move it to a shared utils module.

What looks good

Widget-first architecture: Having the widget mint fresh download tokens on demand (fetchFullResultsWithFreshToken) instead of baking URLs into the tool result is a solid improvement — eliminates stale-token issues on re-render.
Widget JSON only on first page (offset=0): Smart optimization with a well-documented design note explaining the trade-off.
Memory guard on JSON conversion (routes.py:305-320): Good defensive boundary with the max_json_size check.
docker-compose.local.yaml fix: Variable substitution for MCP_SERVER_URL — small but important for tunnel setups.
Test coverage: Good coverage of the new JSON format endpoint, fetch_full_results widget flag, and the Redis failure behavior change.

| Branch: feat/curl-download-results

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

everyrow-mcp/src/everyrow_mcp/routes.py

Case-insensitive search across all columns with 150ms debounce. Input expands on focus and clears when new data loads. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

dtype=str forced all values to strings in the JSON response, breaking numeric sorting/filtering in the widget. _sanitize_records already handles NaN/Inf safely. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Search input now sits in the session-link row, right-aligned - Widget hides itself entirely when it receives non-widget data (e.g. text summary from offset > 0 page requests) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

everyrow-mcp/src/everyrow_mcp/routes.py

Body starts with display:none and is only shown when ontoolresult receives recognized widget data (fetch_full_results, preview, or array). Eliminates the brief flash on subsequent page requests. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ontoolresult fires per TextContent item. The second item (summary text) was hiding the widget that the first item (JSON) just showed. Now non-JSON events are simply ignored instead of hiding the body. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Setting display="" removes the inline style, falling back to the CSS rule (display:none). Must explicitly set display:block. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

display:none hides content but the iframe still takes up space. Using height:0 + overflow:hidden makes the iframe auto-collapse to zero. Restored to height:auto with padding when widget has data. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sentry · 2026-02-26T10:44:47Z

everyrow-mcp/src/everyrow_mcp/routes.py

+        max_json_size = settings.max_upload_size_bytes // 5
+        if len(csv_text) > max_json_size:


Bug: The code compares the character count of csv_text using len() against max_json_size, a limit in bytes. This can underestimate the true memory size for multi-byte UTF-8 characters.
_{Severity: MEDIUM}

Suggested Fix

The check should compare the byte length of the string, not the character count. Encode the string to bytes before checking its length: if len(csv_text.encode('utf-8')) > max_json_size:.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: everyrow-mcp/src/everyrow_mcp/routes.py#L308-L309 Potential issue: A check intended to prevent memory exhaustion compares the character count of `csv_text` with a byte-based limit, `max_json_size`. Since `len()` on a string returns the number of characters, not bytes, this check is inaccurate for data containing multi-byte UTF-8 characters (e.g., accented or CJK characters). A string that passes the character count check could be significantly larger in bytes, potentially bypassing the intended memory limit and leading to excessive memory usage or Out-of-Memory errors during the subsequent JSON conversion.

RafaelPo and others added 2 commits February 25, 2026 20:17

This comment was marked as outdated.

Sign in to view

sentry bot reviewed Feb 25, 2026

View reviewed changes

everyrow-mcp/src/everyrow_mcp/result_store.py Show resolved Hide resolved

RafaelPo and others added 2 commits February 25, 2026 21:54

Prompt agent to ask user about row count when task completes (HTTP mode)

7679603

The progress tool's completion message now tells the agent to ask the user how many rows they'd like to review before fetching results. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add ask-about-row-count instruction to HTTP system prompt

bee026c

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sentry bot reviewed Feb 25, 2026

View reviewed changes

everyrow-mcp/src/everyrow_mcp/result_store.py Show resolved Hide resolved

RafaelPo and others added 7 commits February 26, 2026 09:02

Add design note about widget emission strategy

bbad9a2

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Disable network policy — breaks DNS resolution on GKE

5c11232

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Change widget default copy format from TSV to CSV

b444fe3

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sentry bot reviewed Feb 26, 2026

View reviewed changes

everyrow-mcp/src/everyrow_mcp/routes.py Outdated Show resolved Hide resolved

RafaelPo and others added 3 commits February 26, 2026 09:58

Add global search bar to results widget

83c8f31

Case-insensitive search across all columns with 150ms debounce. Input expands on focus and clears when new data loads. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sentry bot reviewed Feb 26, 2026

View reviewed changes

everyrow-mcp/src/everyrow_mcp/routes.py Show resolved Hide resolved

RafaelPo and others added 4 commits February 26, 2026 10:24

Fix widget visibility: use display:block not empty string

961392a

Setting display="" removes the inline style, falling back to the CSS rule (display:none). Must explicitly set display:block. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sentry bot reviewed Feb 26, 2026

View reviewed changes

RafaelPo merged commit 91cd58f into main Feb 26, 2026
9 checks passed

RafaelPo deleted the feat/curl-download-results branch February 26, 2026 10:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace curl-download with paginated inline results + widget JSON endpoint#232

Replace curl-download with paginated inline results + widget JSON endpoint#232
RafaelPo merged 20 commits intomainfrom
feat/curl-download-results

RafaelPo commented Feb 25, 2026

Uh oh!

RafaelPo commented Feb 25, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

sentry bot Feb 25, 2026

Uh oh!

Uh oh!

RafaelPo commented Feb 26, 2026

Uh oh!

github-actions bot commented Feb 26, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

sentry bot Feb 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		max_json_size = settings.max_upload_size_bytes // 5
		if len(csv_text) > max_json_size:

Conversation

RafaelPo commented Feb 25, 2026

Summary

Test plan

Uh oh!

RafaelPo commented Feb 25, 2026

Uh oh!

This comment was marked as outdated.

Uh oh!

sentry bot Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

RafaelPo commented Feb 26, 2026

Uh oh!

github-actions bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review

1. Unhandled exception from try_store_result in everyrow_results_http (High)

2. Download token consumed before format validation (Low)

3. _sanitize_records is now a cross-module public API (Nit)

What looks good

Uh oh!

Uh oh!

Uh oh!

sentry bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions bot commented Feb 26, 2026 •

edited

Loading

1. Unhandled exception from `try_store_result` in `everyrow_results_http` (High)

3. `_sanitize_records` is now a cross-module public API (Nit)