You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In #185@bwalderman converted both content/explainer_*.png images to mermaid diagrams directly embedded in the markdown. However, in the second diagram I had some questions/concerns, so I held off on merging that directly so we could discuss it here.
graph TD
AI["<b><i>AI Platform</i></b>"]
AF["Agent Frontends (web site, app, etc)"]
subgraph WB["<b><i>Web Browser</i></b>"]
BIA["Browser-integrated agent"]
RP["Running Page <index.html>"]
end
subgraph TP["<b><i>Third-party service<br>(example.com)</i></b>"]
MCP[("MCP Server")]
end
AI <--> AF
AI <--> BIA
AI <-->|"Agents interact with the service directly using MCP. UI support would be provided either by the agent or manually by the service"| MCP
AF <--> BIA
RP <-->|HTTP| TP
Loading
I think the mermaid diagram pretty faithfully represents the picture, but I think the overall flow is not super clear. For example:
Does the "agent frontend" ever communicate directly to the "Browser-integrated agent"? This part confused me... I don't think when I'm on chatgpt.com, Gemini-in-the-sidebar or Copilot-in-the-sidebar is ever talking directly to the chatgpt front-end, without WebMCP, right? There are kind of two distinct use cases:
In one, the built-in agent is helping me browse websites, through actuating the site directly, or through WebMCP tools
In the other, the agent I'm interacting with is the agent frontend (chatgpt.com for example). I'm not juggling two agents (built-in Gemini andchatgpt.com).
But the diagram seems to kind of merge the two above? Thoughts @bwalderman?
In #185 @bwalderman converted both
content/explainer_*.pngimages to mermaid diagrams directly embedded in the markdown. However, in the second diagram I had some questions/concerns, so I held off on merging that directly so we could discuss it here.See https://github.com/webmachinelearning/webmcp#direct-backend-mcp-flow and https://github.com/webmachinelearning/webmcp/blob/main/content/explainer_mcp.png.
graph TD AI["<b><i>AI Platform</i></b>"] AF["Agent Frontends (web site, app, etc)"] subgraph WB["<b><i>Web Browser</i></b>"] BIA["Browser-integrated agent"] RP["Running Page <index.html>"] end subgraph TP["<b><i>Third-party service<br>(example.com)</i></b>"] MCP[("MCP Server")] end AI <--> AF AI <--> BIA AI <-->|"Agents interact with the service directly using MCP. UI support would be provided either by the agent or manually by the service"| MCP AF <--> BIA RP <-->|HTTP| TPI think the mermaid diagram pretty faithfully represents the picture, but I think the overall flow is not super clear. For example:
chatgpt.com, Gemini-in-the-sidebar or Copilot-in-the-sidebar is ever talking directly to thechatgptfront-end, without WebMCP, right? There are kind of two distinct use cases:chatgpt.comfor example). I'm not juggling two agents (built-in Gemini andchatgpt.com).But the diagram seems to kind of merge the two above? Thoughts @bwalderman?