What it does
Renders a screenshot of the current page, overlays numbered labels on every interactive element, and returns the annotated image plus a stable objectId per label. The agent picks an element by its @N label; vulpine-mark resolves that back to a DOM target on the next interaction tool.