add explainer for the declarative api #26

MiguelsPizza · 2025-09-17T03:24:36Z

This draft proposal outlines a declarative WebMCP API that enables web pages to expose tools via HTML, using minimal attributes like tool-name and standard form semantics. Currently, it's a compilation of my notes and ideas from developing this approach, and I'm sharing it to gather feedback before the September 18th working group meeting. I'm particularly interested in your thoughts on the open questions (e.g., JSON vs. HTML responses, elicitation flows), tradeoffs, and overall API design.

The proposed API was shaped by building a real application and polyfill during the MCP enterprise hackathon, where our team successfully implemented it (and took home the win, which was exciting validation!). You can see a video of a Rails app using declarative WebMCP tools to enable complex browser automation without client-side JavaScript: link.

Based on your feedback, I'll refine this draft to align more closely with the structured format and narrative style of other explainers in the repo.

bwalderman · 2025-10-08T19:50:27Z

This is great work. I do have one general question. Was reusing ARIA attributes instead of introducing new tool-* attributes considered?

There are already attributes such as aria-label and aria-description and others for labelling and describing elements and so it might be helpful to define WebMCP mappings/behaviors for these instead of introducing entirely new HTML attributes.

One benefit of using these existing attributes is that they are also surfaced in native accessibility APIs, so assistive tools that already use these APIs to access the page's accessibility tree would be able to access WebMCP tools declarations as well.

MiguelsPizza · 2025-10-15T16:13:55Z

@bwalderman This is a good idea, I'll put the PR in draft while I re-implement the ARIA based polyfill.

The only thing I can think is that we still need a way to make exposing tools to the agent opt-in (or opt-out)

Maybe we still tag elements with a tool-name to expose them to the agent? This will help prevent duplicate tool names which causes errors in most inference providers

vsakaria · 2025-11-07T21:55:29Z

The concern with the HTML method is of course the iFrame. Realistically speaking its stood up well for some years now. Would an iFrame in a browser be more trustworthy. I would prefer JSON and rendering on client. I am sure web components can be distributed with framework payloads and CSS. But the build process for this type of architecture would have to change. I would prefer that design.

The trade off really is that payload would need to be disputed more frequently.

anssiko · 2025-11-11T03:59:12Z

@matatk to review for the accessibility group's perspective (aka APA WG).

anssiko · 2025-11-25T13:55:45Z

A new paper and implementation experience:

https://arxiv.org/abs/2511.11287v1
https://svenschultze.github.io/VOIX/

@svenschultze & team, this W3C community group is developing a WebMCP API that is complemented with a declarative mechanism explored in this PR.

Let’s join forces to explore this space. Here’s how to join:
https://webmachinelearning.github.io/community/#join

anssiko · 2025-11-25T14:46:13Z

That was fast. I’m excited to welcome @svenschultze to the WebML Community Group! 🎉

svenschultze · 2025-11-25T16:10:08Z

Hi @anssiko, thank you for making me aware of this project! It is great to see the community converging on this. I'm happy to share some insights from our work on VOIX, where we implemented a similar declarative framework and tested it with developers.

We established a more explicit interface where MCP tools are separated from standard UI HTML elements. This ensures the agent only accesses data and actions the developer specifically intended to share. I think this is also relevant for the discussion about including ARIA attributes. I think it is important not to just reuse ARIA since this could lead to conflicts of interest between optimizing for accessibility or agents.
Is there an equivalent idea for declarative context/resources in this spec? We found that it was really helpful to explicitly set agent-only text elements (in our case, specific <context name="mouse_position"> elements). This avoids long context inputs of the full html text, hides potentially sensitive data like credit card numbers, and enables high-fidelity synergetic multimodal interaction where UI hover/selection states can be explicitly exposed to the agent. This way, you can interact with websites using commands like "move this to here" without requiring long chains of tool calls.

add wip explainer for the declarative api

e3c23c0

MiguelsPizza force-pushed the declarative branch from 4558fa1 to e3c23c0 Compare September 17, 2025 04:07

MiguelsPizza marked this pull request as ready for review September 17, 2025 04:08

anssiko mentioned this pull request Sep 18, 2025

Declarative API Equivalent #22

Open

MiguelsPizza marked this pull request as draft October 15, 2025 16:13

anssiko mentioned this pull request Oct 16, 2025

WebML WG/CG F2F Agenda - TPAC 2025 (Kobe, Japan) webmachinelearning/meetings#35

Closed

anssiko added the Agenda+ label Oct 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add explainer for the declarative api #26

add explainer for the declarative api #26

MiguelsPizza commented Sep 17, 2025 •

edited

Loading

Uh oh!

bwalderman commented Oct 8, 2025

Uh oh!

MiguelsPizza commented Oct 15, 2025

Uh oh!

vsakaria commented Nov 7, 2025

Uh oh!

anssiko commented Nov 11, 2025

Uh oh!

anssiko commented Nov 25, 2025

Uh oh!

anssiko commented Nov 25, 2025

Uh oh!

svenschultze commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

add explainer for the declarative api #26

Are you sure you want to change the base?

add explainer for the declarative api #26

Conversation

MiguelsPizza commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bwalderman commented Oct 8, 2025

Uh oh!

MiguelsPizza commented Oct 15, 2025

Uh oh!

vsakaria commented Nov 7, 2025

Uh oh!

anssiko commented Nov 11, 2025

Uh oh!

anssiko commented Nov 25, 2025

Uh oh!

anssiko commented Nov 25, 2025

Uh oh!

svenschultze commented Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

MiguelsPizza commented Sep 17, 2025 •

edited

Loading