Skip to content

Clarifying the scope of the proposal #43

@43081j

Description

@43081j

Ultimately, an end-to-end WebMCP setup consists of these:

  • Browser implementation (i.e. the window.agent API)
  • Client (e.g. an MCP server capable of using CDP to call the browser API)

Right now, this repo seems to be both of these tangled up into one. Most likely that is a result of all prior art existing this way (e.g. "MCP-B" implements both).

This doesn't feel like how a web standard should work to me though. The client could be any of the following, for example:

  • An MCP server which uses CDP to call the browser API
  • A regular LLM tool which uses CDP to call the browser API
  • A Chrome extension which interacts directly with the browser API
  • A library which uses puppeteer to interact with the browser API
  • etc etc

If we accept that the client isn't our concern (community and companies will build these clients inevitably), that means the only thing this proposal needs to solve is the browser API.

If that was the case, this would be an extremely small proposal I think. Since we just need to bikeshed the name and the shape of the tools/results.

If I'm missing something here, please do let me know.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions