-
Notifications
You must be signed in to change notification settings - Fork 18
Open
Description
Ultimately, an end-to-end WebMCP setup consists of these:
- Browser implementation (i.e. the
window.agentAPI) - Client (e.g. an MCP server capable of using CDP to call the browser API)
Right now, this repo seems to be both of these tangled up into one. Most likely that is a result of all prior art existing this way (e.g. "MCP-B" implements both).
This doesn't feel like how a web standard should work to me though. The client could be any of the following, for example:
- An MCP server which uses CDP to call the browser API
- A regular LLM tool which uses CDP to call the browser API
- A Chrome extension which interacts directly with the browser API
- A library which uses puppeteer to interact with the browser API
- etc etc
If we accept that the client isn't our concern (community and companies will build these clients inevitably), that means the only thing this proposal needs to solve is the browser API.
If that was the case, this would be an extremely small proposal I think. Since we just need to bikeshed the name and the shape of the tools/results.
If I'm missing something here, please do let me know.
Metadata
Metadata
Assignees
Labels
No labels