This is the abridged developer documentation for ggui # Introduction > ggui is an open, MCP-native protocol for AI agents to render interactive UI on demand. GGUI Preview — self-hosted via `ggui serve`. **ggui** is an open protocol that lets AI agents render interactive UI on the fly. Your agent describes what it needs in natural language; ggui compiles a typed component and returns it as an MCP-Apps resource (`ui://ggui/render/`) that your host mounts. The UI reports back typed events the next time your agent calls `ggui_consume`. You run the whole protocol yourself with **`ggui serve`** — no account, no cloud, no API key. A managed hosted endpoint that speaks the same wire is coming after the preview. [Quickstart — local in 5 minutes](/oss-quickstart/) [How ggui works](/how-it-works/) [Reading as an LLM?](/agents/) ## Choose your path [Section titled “Choose your path”](#choose-your-path) ### Agent Builder — wire ggui into an MCP server [Section titled “Agent Builder — wire ggui into an MCP server”](#agent-builder--wire-ggui-into-an-mcp-server) Start with the [Quickstart](/oss-quickstart/) (5 min, local). Then read the [MCP protocol reference](/api/mcp-protocol/), browse the [Cookbook](/cookbook/feedback-form/), or copy a worked [example agent](/examples/claude-agent/). ### Host — connect a client to your server [Section titled “Host — connect a client to your server”](#host--connect-a-client-to-your-server) [Connect Claude Desktop](/clients/claude-desktop/) to a `ggui serve` you run yourself. [Other MCP hosts](/clients/connect-other-hosts/) use the same self-hosted endpoint with their own config shape. ### Operator — self-host the stack [Section titled “Operator — self-host the stack”](#operator--self-host-the-stack) [`ggui serve`](/cli/serve/) is the local deployment guide. [Reference deploys](/self-hosted/reference-deploys/) covers Docker, Fly.io, and Render. The [Self-hosted Registry](/sdk/self-hosted-registry/) is the artifact layer for private gadgets and blueprints. ### Agentic App Builder — make your SaaS agent-drivable [Section titled “Agentic App Builder — make your SaaS agent-drivable”](#agentic-app-builder--make-your-saas-agent-drivable) If you ship a SaaS or webapp and want agents to drive it without rewriting the frontend, see [Agentic App Builders](/agentic-app-builders/). ### LLM agent reading docs [Section titled “LLM agent reading docs”](#llm-agent-reading-docs) Every page is also raw markdown at the same slug. Start with [`/llms.txt`](/llms.txt) (index, per the [llms.txt convention](https://llmstxt.org/)), `/llms-full.txt` (single-file dump of the whole site), or `/llms-small.txt` (compact variant). See [the LLM-agent track](/agents/) for the full machine-readable surface. ## How ggui works [Section titled “How ggui works”](#how-ggui-works) A typical exchange is four moments — **handshake → render → interact → consume**. See [How ggui works](/how-it-works/) for the full walk-through with code. ## Key concepts [Section titled “Key concepts”](#key-concepts) A few terms recur across the docs: * **GguiSession** — one rendered UI, minted by `ggui_render` (*render* is the verb; the object it creates is a GguiSession). Each GguiSession carries a stable `sessionId`. * **Contract** — the typed agreement between agent and renderer for one GguiSession. * **Tool** — an agent-side action (`ggui_render`, `ggui_consume`, …) — the MCP surface. * **Gadget** — a renderer-side capability (Leaflet map, Stripe Checkout, …) the LLM can compose with. * **Blueprint** — a cached recipe — a UI promoted from one-shot to “use this exact screen next time.” → [Glossary](/glossary/) for everything else. ## What’s on this site [Section titled “What’s on this site”](#whats-on-this-site) * **[How ggui works](/how-it-works/)** — narrative walk-through for builders * **[Quickstart](/oss-quickstart/)** — zero to a running local server in 5 minutes * **Protocol** — [Overview](/protocol/overview/) · [Envelopes](/protocol/envelopes/) · [Bootstrap](/protocol/bootstrap-handshake/) · [Conformance](/protocol/conformance/) · [Version policy](/protocol/version-policy/) * **API** — [MCP](/api/mcp-protocol/) · [WebSocket](/api/websocket-protocol/) · [MCP Apps](/api/mcp-apps/) · [OAuth (self-hosted)](/api/oauth/) · [Ops MCP](/api/ops-mcp/) · [Rate limits](/api/rate-limits/) * **SDK** — [React](/sdk/react/) · [Gadgets](/sdk/gadgets/) · [Marketplace](/sdk/marketplace/) · [Self-hosted Registry](/sdk/self-hosted-registry/) * **CLI** — [Overview](/cli/) · [`ggui dev`](/cli/dev/) · [`ggui serve`](/cli/serve/) * **Connect a host** — [Claude Desktop](/clients/claude-desktop/) · [Other MCP hosts](/clients/connect-other-hosts/) * **Self-hosted** — [Pair a client app](/self-hosted/pairing/) · [Reference deploys](/self-hosted/reference-deploys/) * **Cookbook** — [Feedback form](/cookbook/feedback-form/) · [Multi-step wizard](/cookbook/multi-step-wizard/) · [Real-time dashboard](/cookbook/real-time-dashboard/) · [Auth-gated UI](/cookbook/auth-gated-ui/) · [Theming](/cookbook/custom-theming/) · [Error handling](/cookbook/error-handling/) · [Chat](/cookbook/chat-own-storage/) · [Testing](/cookbook/testing/) * **Architecture** — [Overview](/architecture/overview/) · [Agent backend](/architecture/agent-backend/) · [Audience routes](/architecture/audience-routes/) · [MCP services](/architecture/mcp-services/) · [Event System](/architecture/event-system/) · [UI Generator](/architecture/ui-generator/) * **Design System** — [Design Tokens](/design/tokens/) * **Examples** — [Claude Agent](/examples/claude-agent/) · [OpenAI](/examples/openai-agent/) · [Gemini](/examples/gemini-agent/) · [OpenClaw](/examples/openclaw-agent/) · [Generic MCP](/examples/generic-mcp/) * **Glossary** — [terminology reference](/glossary/) * **Troubleshooting** — [common errors and what they mean](/troubleshooting/) # 404 — page not found > Nothing lives at this URL. Try the home page, the glossary (gadget / tool / blueprint), or the search box in the sidebar. # For Agentic App Builders > Make your existing SaaS or webapp something an AI agent can drive — without rewriting it. Vision page + waitlist for the agentic-app-builders track on ggui. Vision page This describes a direction, not a shipped product. If you’re building agentic apps **today**, take the [Agent Builder track](/oss-quickstart/) — that path is live. The waitlist at the bottom is for early access to this track when its first adapter lands. ## The shift [Section titled “The shift”](#the-shift) AI agents drive existing SaaS apps through three increasingly unreliable layers: 1. **Reading screenshots** — vision-language models reason about rendered pixels. Brittle, slow, no typed I/O. 2. **Browser automation** — Playwright / Puppeteer scripts wrapping flows. Snap the moment a button moves; the agent has no way to know the contract changed. 3. **Reading official APIs** — when they exist, and only for the slice the API covers. Most production apps keep a third of their behavior in the UI, not the API. What’s missing: a **typed contract** between the agent and an existing app’s surface, so the agent can drive flows safely without parsing pixels or scraping markup. That’s the gap this track fills. ## What gguifying an app looks like [Section titled “What gguifying an app looks like”](#what-gguifying-an-app-looks-like) You add a small ggui adapter to your existing app. The adapter: 1. Exposes navigable routes as **renders** with typed contracts 2. Mirrors form fields as **`actionSpec`** (typed inbound actions that drive turns) 3. Mirrors visible state as **`contextSpec`** (read-only observable state the agent reacts to) 4. Wraps any browser-only library you use (Stripe Checkout, Mapbox, calendar pickers) as **gadgets** so the LLM knows how to compose with them The agent then drives your app the same way it drives a freshly-generated ggui UI — via `ggui_handshake`, `ggui_render`, `ggui_consume`. Same MCP wire. Same contract guarantees. **Critically:** the human-facing UI doesn’t change. End-users keep clicking your buttons. Agents get a parallel typed surface onto the same flows. ## What you’d write [Section titled “What you’d write”](#what-youd-write) The provisional shape is a `ggui.app.json` next to your existing app config: ```json { "name": "my-saas", "routes": [ { "path": "/invoices/new", "intent": "create an invoice", "agentCapabilities": { "tools": { "createInvoice": { "toolInfo": { "inputSchema": { "$ref": "./schemas/invoice.json" } } } } }, "actionSpec": { "submit": { "label": "Create invoice", "schema": { "$ref": "./schemas/invoice.json" }, "nextStep": "createInvoice" } }, "contextSpec": { "currentDraft": { "schema": { "$ref": "./schemas/invoice-draft.json" } } } } ] } ``` Your app code stays as-is. An agent runtime can now discover `/invoices/new` from the adapter’s catalog, `ggui_handshake` against its contract, and `ggui_render` with a typed payload — no clicking around. (Exact wire shape for app-catalog lookup is part of the SDK’s in-design surface.) ## Three scenarios this unlocks [Section titled “Three scenarios this unlocks”](#three-scenarios-this-unlocks) 1. **Your support agent drives the app for the customer.** “Refund last month’s invoice for customer X” → agent navigates to `/invoices/`, finds the row, calls `refund` with the typed payload. No human-in-the-loop browsing. 2. **Your sales engineer runs an agent-narrated demo.** During a prospect call, the agent narrates while driving the form in real-time. The prospect sees the same UI any user sees, but the agent’s typed actions land like a polished guided tour. 3. **Power users hand work off to their AI.** “I have 40 of these to fill in — can my AI do it?” The AI has a typed surface to drive, not pixels to scrape. ## What’s shipping when [Section titled “What’s shipping when”](#whats-shipping-when) | Component | Status | | --------------------------------------------------------- | --------- | | ggui protocol (this site) | Live | | Agent Builder track (build a ggui-native agent) | Live | | Agentic App SDK (wrap an existing app) | In design | | Hosted gguifier service on [guuey.com](https://guuey.com) | In design | | Reference adapters for Next.js / Rails / Django | Planned | | Reference adapters for legacy stacks (Java EE, Drupal, …) | Planned | No dates. Real engineering work, not a marketing roadmap. ## Waitlist [Section titled “Waitlist”](#waitlist) If this is the track you actually need, drop your email. We’ll write when the first adapter ships and we have an early-access slot to fill. Email Join waitlist Opens your mail client with a pre-filled message. We'll wire up a real intake form before the first adapter lands — until then this stays a direct mailto so nothing intermediates the signal. ## Related [Section titled “Related”](#related) * [How ggui works](/how-it-works/) — the protocol’s four moments. The gguify pattern reuses the same handshake → render → interact → consume loop, just driven by your app’s routes instead of LLM-generated UI. * [Glossary](/glossary/) — `actionSpec`, `contextSpec`, `gadget`, `tool`, `blueprint`. * [For LLM agents](/agents/) — machine-readable resources, including this page at [`/agentic-app-builders.md`](/agentic-app-builders.md). # For LLM agents > Machine-readable resources for LLMs and coding-assistant devtools reading ggui docs programmatically — aggregated dumps, per-page .md companions, stable anchors, wire schemas. This page is for **non-human readers** — LLM agents, coding-assistant devtools (Claude Code, Cursor, Cline, Continue, Codeium), evaluators, and scrapers. Humans, the rest of the site is for you. ## What’s available [Section titled “What’s available”](#whats-available) | Resource | URL | When to fetch | | --------------------- | ------------------------------------ | ---------------------------------------------------------------------------------------------------------- | | Site index | [`/llms.txt`](/llms.txt) | First contact. Every page in [llms.txt format](https://llmstxt.org/) with one-line summaries. | | Whole-site dump | [`/llms-full.txt`](/llms-full.txt) | One-shot context loading. \~400 KB. Drop into your window for cross-topic tasks. | | Compact dump | [`/llms-small.txt`](/llms-small.txt) | Smaller one-shot context when `/llms-full.txt` is too big. Custom subsets live at `/_llms-txt/.txt`. | | Per-page raw markdown | `/.md` | Reading one specific page. No HTML, no chrome. | | Stable anchors | `/#` | Deep-linking to a section. Every H2/H3 has a Starlight-derived id. | ## Per-page `.md` companions [Section titled “Per-page .md companions”](#per-page-md-companions) Every page is also served as raw markdown at the same slug with a `.md` extension. Examples: * [`/how-it-works.md`](/how-it-works.md) — the narrative walk-through * [`/api/mcp-protocol.md`](/api/mcp-protocol.md) — the wire reference * [`/protocol/envelopes.md`](/protocol/envelopes.md) — live-channel envelope shapes * [`/glossary.md`](/glossary.md) — terminology lookup * [`/cli/serve.md`](/cli/serve.md) — `ggui serve` command reference The `.md` response is the source markdown with a small `---\ntitle: ...\n---` envelope and **no other transformation**. Fetch from any origin: ```bash curl https://docs.ggui.ai/protocol/envelopes.md ``` ```ts const res = await fetch("https://docs.ggui.ai/protocol/envelopes.md"); const body = await res.text(); ``` CORS is open (`Access-Control-Allow-Origin: *`); `Cache-Control` permits 5-minute CDN caching. ## Search docs via MCP [Section titled “Search docs via MCP”](#search-docs-via-mcp) Any LLM agent will be able to connect to `mcp.ggui.ai/docs` (coming soon) and search / read these docs programmatically — no auth required. See [Docs MCP route](/api/mcp-docs/) for the tool catalog and connection details. ## When to use which [Section titled “When to use which”](#when-to-use-which) ```plaintext You want → Fetch ─────────────────────────────────────────── ────────────────────────────────── Orient quickly, plan a session /llms.txt Drop everything into context (one-shot) /llms-full.txt Read one specific page /.md Deep-link to a section in conversation //# ``` ## Stable anchors [Section titled “Stable anchors”](#stable-anchors) Every H2 and H3 has a stable `id` derived by Starlight from its text. Anchors don’t change between releases unless the heading text changes. Examples: * [`/glossary/#gguisession-a-render`](/glossary/#gguisession-a-render) — definition of the GguiSession (the “render”) * [`/protocol/envelopes/#actionenvelope`](/protocol/envelopes/#actionenvelope) — inbound live-channel envelope * [`/api/mcp-protocol/#ggui_render`](/api/mcp-protocol/#ggui_render) — the `ggui_render` MCP method The same anchors work on the `.md` companions: [`/protocol/envelopes.md#actionenvelope`](/protocol/envelopes.md#actionenvelope) — markdown clients with anchor-scroll support honor them. ## Wire schemas (roadmap) [Section titled “Wire schemas (roadmap)”](#wire-schemas-roadmap) The wire envelopes (`ActionEnvelope`, `StreamEnvelope`) and the MCP method shapes (`ggui_handshake`, `ggui_render`, `ggui_consume`, …) live in TypeScript in `@ggui-ai/protocol`. Protocol version: see `PROTOCOL_VERSION` in `@ggui-ai/protocol` (currently `draft-2026-06-12`). Standalone JSON-Schema endpoints at `/api/schemas/.json` are planned but not yet shipped. Until then, the canonical wire shapes live at: * [`/protocol/envelopes/`](/protocol/envelopes/) — envelope shapes, fields, validation rules * [`/api/mcp-protocol/`](/api/mcp-protocol/) — MCP methods, request/response shapes * [`/api/websocket-protocol/`](/api/websocket-protocol/) — live-channel framing For machine-parseable types, install the npm package: ```bash npm install @ggui-ai/protocol ``` ## Site conventions [Section titled “Site conventions”](#site-conventions) Things to know if you’re writing code against these docs: * **`ggui`** = the open protocol. Self-host with `ggui serve`; a hosted endpoint at `mcp.ggui.ai` is coming soon. Open source at `github.com/ggui-ai/ggui`. Documented on this site. * **`guuey`** = a separate SaaS platform at `guuey.com`. Different surface, different docs. Don’t conflate the two. * **`gadget`** = renderer-side capability — a wrapped 3rd-party library (`Leaflet`, `Stripe`, …). Formerly called `clientLibraries`. * **`tool`** = agent-side action (an MCP tool the agent invokes). * **`blueprint`** = cached UI recipe (matched at `ggui_handshake` by intent + contract similarity). These three nouns are not interchangeable; mixing them in generated code will confuse readers. ## Page-level metadata [Section titled “Page-level metadata”](#page-level-metadata) Every page has frontmatter with at least `title` and `description`. Most also carry `audience` (one of `agent-builder`, `host`, `operator`, `llm-agent`, `agentic-app-builder`, `all`) and optionally `prereqs`. The `.md` companion re-emits `title` and `description` at the top of the response; other frontmatter fields (`audience`, `prereqs`) are only in the repo source. ## Telemetry [Section titled “Telemetry”](#telemetry) The site sends pageview events to PostHog and tags requests from known LLM-agent user-agents (`claude-bot`, `gptbot`, `perplexitybot`, `cursor`, `cline`, `continue`, `codeium`, …) with `is_bot: true`. No PII, no fingerprinting. Set a useful user-agent and you’ll show up in the bot-traffic dashboard — that helps us prioritize docs machine readers actually use. ## Reporting issues [Section titled “Reporting issues”](#reporting-issues) Found a page that’s hard for LLM consumption, or want a `.md` companion that’s missing something? Open an issue at [`github.com/ggui-ai/ggui/issues`](https://github.com/ggui-ai/ggui/issues) tagged `docs/llm-readability`. # MCP Apps support > How ggui implements the io.modelcontextprotocol/ui capability so hosts like Claude Desktop, claude.ai, Goose, and VS Code Copilot render generative UIs inline. [MCP Apps](https://modelcontextprotocol.io/extensions/apps/overview) is the protocol extension that lets MCP servers ship interactive UI alongside structured data, and lets MCP hosts render those UIs inline in the chat surface. The OSS [`@ggui-ai/mcp-server`](/oss-quickstart/) implements the wire format on both sides — as will the hosted ggui server (`mcp.ggui.ai`, coming soon) — so generative UIs render directly in chat instead of forcing a “click this link” detour to a browser tab. This page documents the protocol pieces ggui implements. For end-user setup, see [Connect Claude Desktop](/clients/claude-desktop/). For the underlying transport, see [WebSocket protocol](/api/websocket-protocol/). ## What MCP Apps adds [Section titled “What MCP Apps adds”](#what-mcp-apps-adds) Without MCP Apps, an MCP tool that produces UI has to choose between: 1. Returning structured data and hoping the host formats it (no interactivity), or 2. Returning a URL the user clicks out to (interactive, but chat and UI live in separate windows). MCP Apps adds a third option: declare a UI resource alongside the tool result, the host sandboxes it in an iframe inside the chat, and a WebSocket channel carries data both ways — host to UI for live updates, UI to server for actions. ## What ggui ships [Section titled “What ggui ships”](#what-ggui-ships) When the server boots with `mcpApps` enabled, three things happen: 1. **`io.modelcontextprotocol/ui` is advertised** in the server’s `initialize` capabilities (under `experimental`). MCP-Apps-aware hosts read this and switch on inline rendering. 2. **`ui://ggui/render` is served** as a resource via `resources/read` — a minimal HTML shell that loads [`@ggui-ai/iframe-runtime`](https://www.npmjs.com/package/@ggui-ai/iframe-runtime) and opens the WebSocket channel. 3. **Every `ggui_render` tool result carries the `_meta["ai.ggui/render"]` slice** — `sessionId`, `appId`, `runtimeUrl`, `wsUrl`, a short-TTL `wsToken`, and `expiresAt` (ISO 8601 string) as top-level fields, alongside capability + render-state fields, plus theme fields (`themeId`, `themeMode`, `theme` — a validated `--ggui-*` CSS-variable overlay the iframe applies at `:root`), a `pollingUrl` fallback for WS-blocked environments, and `lastSequence` to seed replay cursors. The iframe consumes the slice, opens the WebSocket, and trades the `wsToken` for a longer-lived `sessionToken` for reconnects. ## Capability declaration [Section titled “Capability declaration”](#capability-declaration) On `initialize`, ggui returns: ```json { "capabilities": { "tools": { "listChanged": true }, "resources": { "subscribe": false, "listChanged": false }, "experimental": { "io.modelcontextprotocol/ui": {} } } } ``` Hosts that recognize `io.modelcontextprotocol/ui` flip into inline-render mode. Hosts that don’t simply ignore the capability and skip inline rendering — the render is still delivered as a resource (`ui://ggui/render/` on `_meta.ui.resourceUri`), but without MCP-Apps support there is nothing to mount it. There is no agent-returned URL to open instead. ## Tool result shape [Section titled “Tool result shape”](#tool-result-shape) Every UI-producing tool (today: `ggui_render`) declares a meta-resource on the result so the host knows where to load the UI from: ```json { "content": [{ "type": "text", "text": "Created render render_abc123" }], "_meta": { "ui": { "resourceUri": "ui://ggui/render/render_abc123" }, "ai.ggui/render": { "sessionId": "render_abc123", "appId": "app_abc", "runtimeUrl": "https://your-server.example.com/_ggui/iframe-runtime.js", "wsUrl": "wss://your-server.example.com/ws", "wsToken": "btkn_…", "expiresAt": "2099-01-01T00:00:00.000Z" } } } ``` The tool **declaration** (returned by `tools/list`) is what carries `_meta.ui.visibility: ["model"]` — the MCP Apps signal that this tool ships a renderable UI surface. Each per-call **result** stamps `_meta.ui.resourceUri` (the per-render URI) plus the `_meta["ai.ggui/render"]` slice. The host fetches `ui://ggui/render` (or the per-render form `ui://ggui/render/`) once, sandboxes it in an iframe, and forwards the `_meta["ai.ggui/render"]` slice to it. ## The shell at `ui://ggui/render` [Section titled “The shell at ui://ggui/render”](#the-shell-at-uigguirender) Reading the resource returns a small HTML document — paper-themed, full-bleed, no chrome — whose only job is: 1. Receive the `ai.ggui/render` slice from the host (via `postMessage`). 2. Dynamically load `runtimeUrl` (the iframe-runtime bundle). 3. Hand the slice to the runtime, which opens the WebSocket and starts rendering. The shell is intentionally minimal. The actual rendering work — component resolution, contract validation, action dispatch, gadget loading — lives in [`@ggui-ai/iframe-runtime`](https://www.npmjs.com/package/@ggui-ai/iframe-runtime), which the shell loads on demand. This keeps the shell payload tiny and lets the runtime version-bump independently of host caches. ## Bootstrap token exchange [Section titled “Bootstrap token exchange”](#bootstrap-token-exchange) The `wsToken` is short-lived (default 180s) and reusable within its TTL, so a transient WebSocket drop reconnects without a fresh handshake. The iframe trades it for a longer-lived `sessionToken` (default 4h) on the first wsToken-authed `subscribe` frame, then uses the session token (`sessionToken`) for reconnects. After the wsToken expires, the iframe swaps the envelope via `ggui_runtime_refresh_ws_token` (within the refresh window) or re-bootstraps. Consequences: * Hosts can cache the resource document, but the bootstrap is per-call — every render mints a fresh token. * An iframe that loses connection reconnects with its session token (`sessionToken`) without re-fetching the resource or re-running OAuth. * A leaked bootstrap token is useless after the TTL expires. The bootstrap is HMAC-signed with a server-side `wsTokenSecret`. Multi-pod deployments MUST share a deterministic secret (typically from a secrets manager) so any pod accepts any other pod’s tokens. The handshake details are documented in [Bootstrap handshake](/protocol/bootstrap-handshake/). ## Self-hosted: enabling MCP Apps in your own server [Section titled “Self-hosted: enabling MCP Apps in your own server”](#self-hosted-enabling-mcp-apps-in-your-own-server) ```typescript import { createGguiServer } from "@ggui-ai/mcp-server"; const server = createGguiServer({ // ... renderChannel: true, // required — MCP Apps needs the WS channel mcpApps: { wsUrl: "wss://your-server.example.com/ws", }, runtime: true, // serve the iframe-runtime bundle wsTokenSecret: process.env.WS_TOKEN_SECRET, // required for multi-pod }); ``` For local dev, `wsUrl: "ws://127.0.0.1:6781/ws"` is the conventional loopback URL. Hosted ggui (coming soon) will use `wss://mcp.ggui.ai/ws`. What each option does: * **`renderChannel: true`** — mounts the live-channel WebSocket at `/ws`. MCP Apps requires it; the iframe has nowhere to connect without one. * **`mcpApps.wsUrl`** — the publicly-reachable WebSocket URL written onto every bootstrap. Don’t ship `ws://localhost:…` to internet-accessible servers — clients must be able to reach it. * **`runtime: true`** (default when `mcpApps` is on) — mounts the iframe-runtime bundle at `/_ggui/iframe-runtime.js`. Pass `runtime: { url: "https://your-cdn/…" }` to point at an externally-hosted bundle. * **`wsTokenSecret`** — HMAC secret. If omitted, the server mints a random secret at boot — fine for single-process dev, wrong for multi-pod (pods would reject each other’s tokens). `mcpApps` requires `renderChannel: true`; the factory throws at construction if you enable one without the other. ## Compatibility matrix [Section titled “Compatibility matrix”](#compatibility-matrix) Host capabilities below describe what each MCP host supports when connected to your self-hosted server (a hosted ggui connector is coming soon): | Host | OAuth | MCP Apps | Notes | | ------------------- | ----- | -------- | ------------------------------------------------------------------------------------------------- | | Claude Desktop | Yes | Yes | Inline rendering, full UX. ([install](/clients/claude-desktop/)) | | claude.ai (web) | Yes | Yes | Same as Desktop. | | Goose | Yes | Yes | Inline rendering in TUI mode varies by terminal. | | VS Code Copilot | Yes | Yes | UI renders in a side panel. | | Cursor | Yes | Partial | OAuth works; MCP Apps support depends on version. | | Generic MCP runtime | No | No | Static `Authorization: Bearer …`; no inline render — resolve the `resourceUri` resource yourself. | If your host doesn’t yet implement MCP Apps, the underlying render still works — you just lose inline rendering. Each render is delivered as an MCP-Apps resource (`ui://ggui/render/` on `_meta.ui.resourceUri`); resolve it with `resources/read`. There is no render-viewer URL the agent receives. ## Reference [Section titled “Reference”](#reference) * MCP Apps protocol: * ggui server factory: [`@ggui-ai/mcp-server`](https://www.npmjs.com/package/@ggui-ai/mcp-server) * Iframe runtime: [`@ggui-ai/iframe-runtime`](https://www.npmjs.com/package/@ggui-ai/iframe-runtime) * Wire envelopes: [Envelopes](/protocol/envelopes/) * Glossary terms: [gadget, tool, blueprint](/glossary/) # Docs MCP service > Anonymous MCP service at mcp.ggui.ai/docs — three tools (docs_search, docs_read, docs_list) for any LLM to query ggui documentation programmatically. Coming soon This page describes a **managed hosted surface** (`mcp.ggui.ai/docs`), which is **not yet live** — it is not part of GGUI Preview 0.1.0. The self-hosted path is available today: start with the [Quickstart](/oss-quickstart/). This page is kept as forward documentation of the wire surface and goes live when hosted ggui ships. The Docs MCP service is a public, anonymous-auth Model Context Protocol surface that lets any LLM agent query the ggui documentation corpus from inside its tool loop. Three read-only tools, one in-memory index, no token required. One clarification on what’s open vs. hosted: the `McpService` mount primitive this service is built on **is OSS** (it ships in `@ggui-ai/mcp-server` — you can mount your own anonymous docs-style service today), but the hosted docs corpus service itself launches with `mcp.ggui.ai`. If you’re building a coding assistant, an IDE plugin, or any agent that needs to ground its answers in ggui’s docs, point it at `https://mcp.ggui.ai/docs` and the model gets `docs_search` / `docs_read` / `docs_list` for free. ## What it is [Section titled “What it is”](#what-it-is) A first-party MCP service mounted at `https://mcp.ggui.ai/docs` exposing three tools that wrap an in-memory index of the docs corpus: * `docs_search` — keyword search, ranked by occurrence count with a 3× title weighting. * `docs_read` — fetch the raw markdown body of one doc by path. * `docs_list` — enumerate every doc in the corpus, optionally filtered by path prefix. The service is built on the same `McpService` primitive that hosts the main agent API at `mcp.ggui.ai`. See [MCP services architecture](/architecture/mcp-services/) for the anonymous-mode mechanics and how multiple services are composed into one `createGguiServer` boot. ## Why anonymous [Section titled “Why anonymous”](#why-anonymous) The ggui docs corpus is public — every page on this site is reachable without a login, and every byte the service returns is something a browser could fetch by following a URL. There is no per-user state, no rate-shaped quota tied to identity, no privileged content behind the corpus. Forcing OAuth on a read-only public surface adds setup friction (Dynamic Client Registration ceremony, token refresh, secret storage) for zero security gain. Anonymous mode is the intentional default for surfaces that meet **all** of: * The data is already public. * The service has no side effects (no writes, no mutating tool calls, no external API spend). * Per-user state would be a lie (the same query produces the same answer for everyone). The agent surface at `mcp.ggui.ai` does **not** meet these criteria — sessions, BYOK credentials, and per-app rate limits all require an identified caller. The docs surface does, so it skips auth. ## Endpoint [Section titled “Endpoint”](#endpoint) ```plaintext POST https://mcp.ggui.ai/docs ``` Served as a **separate** MCP server, not folded into `/mcp`. The two surfaces have different auth modes (anonymous vs. OAuth) and different tool catalogs; keeping them on distinct paths means a docs-only client never has to discover or skip past agent-loop tools, and the agent loop never has to surface read-only docs tools alongside its session lifecycle. No `Authorization` header is required. `Content-Type: application/json` and the JSON-RPC 2.0 envelope are still required — this is MCP over HTTP, identical wire shape to `/mcp`, only the auth handshake is skipped. The same three tools are also co-hosted on the unified `/dev` developer endpoint (coming soon with the hosted platform), alongside the executable `ggui_protocol_*` tools — one `.mcp.json` line for learn + author + do. ## The three tools [Section titled “The three tools”](#the-three-tools) ### `docs_search` [Section titled “docs\_search”](#docs_search) Keyword search over the corpus. The query is tokenized on whitespace and lowercased; matches are case-insensitive. Each hit’s score is the sum of token-occurrence counts in the doc — title occurrences are weighted **3×**, body occurrences **1×**. The server clamps results to a maximum of **50 hits** regardless of the requested `limit`. Default `limit` is 10. | Field | Type | Required | Description | | ------- | -------- | -------- | ------------------------------------------------------- | | `q` | `string` | Yes | Search terms — whitespace-separated; case-insensitive. | | `limit` | `number` | No | Cap on returned hits. Default 10. Server-clamped to 50. | **Returns:** `{ hits: SearchHit[] }` ```ts interface SearchHit { path: string; // corpus-relative path, e.g. "principles/strict-typing.md" title: string; // first H1, or filename if none summary: string; // first ~200 chars of prose after the title score: number; // keyword score, higher is better } ``` Hits are sorted by `score` descending, with a stable secondary sort on `path` ascending so identical-score hits land in deterministic order. Follow up with `docs_read` to fetch the full body of any hit. ### `docs_read` [Section titled “docs\_read”](#docs_read) Fetch the full markdown body of one doc by its corpus-relative path. Pair with `docs_search` (search → read top hit) or `docs_list` (browse → read). | Field | Type | Required | Description | | ------ | -------- | -------- | ------------------------------------------------------------------------------------------------ | | `path` | `string` | Yes | Path relative to the corpus root, forward slashes. Leading `./` or `/` is trimmed before lookup. | **Returns:** `{ found, path, title, summary, body, bytes }` ```ts interface DocsReadOutput { found: boolean; // false when the path isn't in the corpus path: string; title: string; summary: string; body: string; // full raw markdown, including any frontmatter bytes: number; // utf-8 byte length of body } ``` When the path doesn’t match any doc, the tool returns `found: false` with empty `body` / `title` / `summary` rather than throwing — agents see a clean “no such doc” result and can fall back to `docs_search` or ask the user to refine. ### `docs_list` [Section titled “docs\_list”](#docs_list) Enumerate every doc in the corpus, optionally filtered by a path prefix. Returns metadata only (path / title / summary); bodies are not included. | Field | Type | Required | Description | | -------- | -------- | -------- | ------------------------------------------------------------------------------------------------------------ | | `prefix` | `string` | No | Case-sensitive path-prefix filter. When set, only entries whose `path` starts with this prefix are returned. | **Returns:** `{ entries: DocMeta[], total }` ```ts interface DocMeta { path: string; title: string; summary: string; } ``` Entries are sorted by `path` ascending. `total` is the count after the prefix filter — use it to detect a typo’d prefix (e.g. `total: 0` when you expected dozens) without scanning `entries`. The full unfiltered list weighs roughly 50 KB for the current corpus, comfortably under any sane MCP response budget. Use `prefix` to scope to one section (`principles/`, `protocol/`, `architecture/`, etc.) when you want a focused slice. ## Corpus loading [Section titled “Corpus loading”](#corpus-loading) The corpus is loaded once at service boot via `loadDocsCorpus(rootDir)`, which walks the docs directory recursively, reads every `.md` file into RAM, extracts the title (first `# Heading` or filename fallback) and a short prose summary, and returns an immutable `DocsCorpus` handle. ```ts interface DocsCorpus { list(): readonly DocMeta[]; read(docPath: string): DocEntry | null; search(query: string, limit?: number): readonly SearchHit[]; } ``` Total weight: roughly **10 MB across \~400 files**, all held in process memory. Every request is served from RAM — no filesystem hit per call, no vector DB, no Algolia. Symlinks are skipped (prevents loops), non-`.md` files are skipped (no images, no frontmatter sidecars, no generated artifacts). The `DocsCorpus` interface is the upgrade seam. A future v2 can drop in an embedding-backed implementation — precomputed vectors, semantic ranking — without changing the tool surface or the wire shape. Agents written against today’s keyword search keep working; quality improves under their feet. ## Connecting from agents [Section titled “Connecting from agents”](#connecting-from-agents) ### Claude Agent SDK [Section titled “Claude Agent SDK”](#claude-agent-sdk) Add the service to the `mcpServers` block of your agent config: ```ts import { query } from "@anthropic-ai/claude-agent-sdk"; const result = await query({ prompt: "How do I write a blueprint for a contact form?", options: { mcpServers: { docs: { type: "http", url: "https://mcp.ggui.ai/docs", }, }, // No `authToken` needed — service is anonymous. }, }); ``` Tools surface to the model as `mcp__docs__docs_search`, `mcp__docs__docs_read`, and `mcp__docs__docs_list`. The SDK auto-discovers them on the first turn and injects them into the model’s tool list. ### Raw `@modelcontextprotocol/sdk` [Section titled “Raw @modelcontextprotocol/sdk”](#raw-modelcontextprotocolsdk) For non-Claude agents (Gemini, GPT, local models) talking MCP directly: ```ts import { Client } from "@modelcontextprotocol/sdk/client/index.js"; import { StreamableHTTPClientTransport } from "@modelcontextprotocol/sdk/client/streamableHttp.js"; const client = new Client({ name: "my-agent", version: "1.0.0" }); const transport = new StreamableHTTPClientTransport(new URL("https://mcp.ggui.ai/docs")); await client.connect(transport); const tools = await client.listTools(); // ["docs_search", "docs_read", "docs_list"] const hits = await client.callTool({ name: "docs_search", arguments: { q: "audience routes principle", limit: 5 }, }); ``` ### cURL [Section titled “cURL”](#curl) For one-off probes or wiring into a shell pipeline: ```bash # Search curl -X POST https://mcp.ggui.ai/docs \ -H "Content-Type: application/json" \ -d '{"jsonrpc":"2.0","id":1,"method":"tools/call","params":{"name":"docs_search","arguments":{"q":"blueprint first","limit":5}}}' # Read one doc curl -X POST https://mcp.ggui.ai/docs \ -H "Content-Type: application/json" \ -d '{"jsonrpc":"2.0","id":2,"method":"tools/call","params":{"name":"docs_read","arguments":{"path":"principles/blueprint-first-architecture.md"}}}' # List one section curl -X POST https://mcp.ggui.ai/docs \ -H "Content-Type: application/json" \ -d '{"jsonrpc":"2.0","id":3,"method":"tools/call","params":{"name":"docs_list","arguments":{"prefix":"principles/"}}}' ``` ## What it pairs with [Section titled “What it pairs with”](#what-it-pairs-with) The Docs MCP service is the **runtime** half of ggui’s LLM-agent surface. The static half lives at [Agents track](/agents/) — narrative guides, recipe cookbooks, and step-by-step walkthroughs of agent patterns. Use the static docs to learn the protocol; use the Docs MCP service inside your agent to look up the specifics on demand. It also pairs naturally with the main agent API at [`/mcp`](/api/mcp-protocol/). An agent that’s already authenticated for session work can add the docs service as a second `mcpServers` entry — the two are independent endpoints with independent auth. ## Limits [Section titled “Limits”](#limits) * **Search hit cap: 50.** Any `limit` above 50 is silently clamped server-side. For corpus-wide enumeration use `docs_list`, not a huge `docs_search` limit. * **No streaming.** All three tools are request/response. There is no `tools/streaming` variant — bodies are small enough that streaming would add latency without benefit. * **No live refresh.** The corpus is loaded once at service boot and held immutable for the life of the process. Docs changes land in the corpus at the **next deploy** of the `@ggui-private/mcp-docs` service — typically when `main` ships to `mcp.ggui.ai`. Expect a lag of minutes-to-hours between a docs PR merging and the new content appearing in tool responses. * **Markdown only.** The corpus walker reads `.md` files. Images, code samples in separate files, and any non-markdown content are not indexed. The agent gets the markdown the docs site renders from; it does **not** get screenshots, generated diagrams, or compiled HTML. * **Anonymous.** Anyone can call these tools without identifying themselves. Do not assume there is a per-user audit trail on this surface — there isn’t. (Rate limiting at the edge still applies; behavior under high load is best-effort.) ## See also [Section titled “See also”](#see-also) * [MCP services architecture](/architecture/mcp-services/) — how anonymous services compose with auth’d services on one host * [MCP Protocol Reference](/api/mcp-protocol/) — the main agent API at `mcp.ggui.ai` * [LLM agents](/agents/) — narrative track for building agents against ggui # MCP Protocol Reference > Wire-level reference for the ggui MCP HTTP API — tools, inputs, return shapes, and a worked curl flow against self-hosted `ggui serve`. The ggui MCP API is [Model Context Protocol](https://modelcontextprotocol.io/) over HTTP with JSON-RPC 2.0. This page is the wire reference: tool names, input shapes, return shapes, error codes, and a complete curl walkthrough. Three-noun vocab in play on this page: **blueprint** (a cached recipe routed by `BlueprintSearch` over the draft’s contract + variance axes), **tool** (an agent-side MCP method the LLM invokes), **gadget** (a renderer-side capability the generated component imports). If those terms are new, skim the [glossary](/glossary/) first. ## Endpoint [Section titled “Endpoint”](#endpoint) **Self-hosted (`ggui serve`):** ```plaintext POST http://127.0.0.1:6781/mcp ``` ## Authentication [Section titled “Authentication”](#authentication) Local dev: start the server with `ggui serve --dev-allow-all` and any bearer (conventionally `dev`) authenticates as the `builder` identity: ```plaintext Authorization: Bearer dev Content-Type: application/json ``` Default `ggui serve` is **strict** — only pairing-minted bearers authenticate `/mcp`. Pair a key via the pair code the server prints at boot, or mint one locally with `ggui keys create --keys-file ` (the same file `ggui serve --keys-file` reads). The bare `createGguiServer` factory defaults to dev-allow-all until you pass `auth` — swap in a real `AuthAdapter` before exposing the port beyond `127.0.0.1`. Hosted ggui (coming soon) will use OAuth 2.0 with Dynamic Client Registration — Claude Desktop and other MCP-Apps hosts run the ceremony for you; raw-HTTP callers present the issued bearer token on every call. See [OAuth on mcp.ggui.ai](/api/oauth/) for the ceremony. ## Render Lifecycle [Section titled “Render Lifecycle”](#render-lifecycle) ```plaintext 1. initialize → MCP handshake (one-shot per connection) 2. ggui_list_gadgets → Optional: fetch the gadget catalog 3. ggui_handshake → Negotiate the wire (handshakeId + suggestion) 4. ggui_render → Materialize the UI (mints sessionId) 5. ggui_consume → Long-poll for user events (keyed by sessionId) 6. ggui_update → Mutate props in place (never re-render) 7. ggui_emit → Optional: push frames on a streamSpec channel ``` Renders decay implicitly via TTL — there is no explicit close ceremony. `ggui_update` and `ggui_consume` are keyed by `sessionId` (globally unique); the server tenancy-checks via the bearer token. ## Rendering Pipeline [Section titled “Rendering Pipeline”](#rendering-pipeline) The rendering decision is made during `ggui_handshake`. The negotiator runs `BlueprintSearch` plus contract validation in parallel and returns a routed `suggestion` whose `origin` tag tells the agent which branch fired: 1. **`origin: 'cache'`** — exact or semantic match against a registered blueprint. Free, deterministic reuse on the paired render. 2. **`origin: 'agent'`** — no cache hit, but the agent’s draft passed validation. Gen runs against the agent’s contract verbatim. 3. **`origin: 'synth'`** — no cache hit AND validation surfaced amendments. The server amends the draft and gen runs against the amended contract. The handshake response carries `handshakeId`, `action`, and `suggestion` (always with a provisional `blueprintMeta`). The agent then calls `ggui_render` with the `handshakeId` plus `props`: omit `override` to reuse the suggestion’s provisional `blueprintId` as-is, or pass `override: {contract?, variance?}` to mint a fresh `blueprintId` against a re-aimed draft. *** ## Agent system prompt [Section titled “Agent system prompt”](#agent-system-prompt) The canonical posture-only system prompt for any agent calling these tools is exported as a string constant: ```ts import { GGUI_AGENT_SYSTEM_PROMPT } from "@ggui-ai/protocol"; ``` Use it as-is. It teaches the wire flow (handshake → render → consume), the three rendering origins, and when to call which tool — without baking in any product-specific persona. Per-tool `description` strings on each `ggui_*` MCP tool reinforce the same flow at the tool layer, so the agent has two consistent signals during planning. Roll your own system prompt only when you have a domain-specific persona to layer on top. In that case, **concatenate**, don’t replace: keep `GGUI_AGENT_SYSTEM_PROMPT` first, then append your additions. Replacing it removes the wire-flow teaching, and the agent will misuse the toolset. The prompt source lives at `packages/protocol/src/recommended-prompts.ts` and ships with `@ggui-ai/protocol` for every consumer language the protocol package targets. *** ## Tools [Section titled “Tools”](#tools) | Tool | Purpose | | ----------------------------------------------------------------- | -------------------------------------------------------------------------------------------- | | [`ggui_handshake`](#ggui_handshake) | Negotiate the wire surface before rendering — returns `handshakeId` + a routed `suggestion`. | | [`ggui_render`](#ggui_render) | Materialize the UI; mints `sessionId`. | | [`ggui_consume`](#ggui_consume) | Long-poll buffered user events on one GguiSession. | | [`ggui_update`](#ggui_update) | Mutate props on a delivered GguiSession in place. | | [`ggui_emit`](#ggui_emit) | Push a delivery onto a declared `streamSpec` channel. | | [`ggui_get_session`](#ggui_get_session) | Read GguiSession state + activity timestamps. | | [`ggui_list_sessions`](#ggui_list_sessions) | Enumerate GguiSessions by host conversation (resume flows). | | [`ggui_list_gadgets`](#ggui_list_gadgets) | Fetch the renderer-side gadget catalog before authoring a contract. | | [`ggui_list_themes`](#ggui_list_themes) | List the theme presets usable via `ggui_render({themeId})`. | | [`ggui_list_featured_blueprints`](#ggui_list_featured_blueprints) | Enumerate builder-curated featured blueprints. | | [`ggui_search_blueprints`](#ggui_search_blueprints) | Semantic search across this app’s blueprints. | | [`ggui_render_blueprint`](#ggui_render_blueprint) | Resolve a registered blueprint id to its compiled bundle. | | [`ggui_discover`](#ggui_discover) | Platform capability discovery (hosted-only, coming soon). | | [`ggui_request_credential`](#ggui_request_credential) | OAuth consent proxy (hosted-only, coming soon). | ### `ggui_handshake` [Section titled “ggui\_handshake”](#ggui_handshake) Negotiate the wire surface for a UI. Call BEFORE `ggui_render`. The agent posts a draft; the server runs blueprint-search + contract-validation in parallel and returns a routed `suggestion` the agent then accepts or overrides on render. **Top-level fields:** | Field | Type | Required | Description | | ---------------- | --------- | -------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | | `intent` | `string` | Yes | Concise semantic identity — same intent across calls = same component reused. Example: `"Gmail inbox for email triage"`. | | `blueprintDraft` | `object` | Yes | Single-field draft wrapping the agent’s `contract` (required) plus optional `variance` and optional `generator` slug hint. The contract drives blueprint-search embed/structural axes; variance feeds the variance axis. | | `forceCreate` | `boolean` | No | Skip blueprint-search and route straight to validation + agent-mode suggestion against the draft. Use after a prior handshake returned an unwanted cache suggestion. | **Returns:** `{ handshakeId, action, suggestion, nextStep? }` | Field | Type | Description | | ------------- | -------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `handshakeId` | `string` | Stable id — pass to `ggui_render`. Records are SINGLE-USE and expire after 10 minutes. | | `action` | `enum` | One of `create` / `reuse` / `update` / `replace` / `declined`. | | `suggestion` | `object` | Routed suggestion. Carries `origin: 'cache' \| 'agent' \| 'synth'`, an always-present provisional `blueprintMeta` (incl. `blueprintId`), and conditional `amendments` (synth-only) / `validationFindings` (soft on cache). | | `nextStep` | `object` | Wire-shape recovery hint — `{tool: 'ggui_render', example}` worked-literal of the next call. | The agent branches the paired `ggui_render` on `suggestion.origin`: any origin can be accepted (reuse the provisional `blueprintId` verbatim) or overridden (mint fresh against a new draft). ### `ggui_render` [Section titled “ggui\_render”](#ggui_render) Materialize the UI. Step 3 of the three-step handshake protocol. `handshakeId` and `props` are REQUIRED. Commit relative to the handshake’s suggestion by PRESENCE of `override`: omit it to ACCEPT the suggestion as-is, or provide `override: {contract?, variance?}` to re-aim (PATCH semantics). ```json // ACCEPT the suggestion as-is { "handshakeId": "h_…", "props": {…} } // re-draft the contract (cold-gen) { "handshakeId": "h_…", "props": {…}, "override": { "contract": {…} } } // re-aim the variant axis { "handshakeId": "h_…", "props": {…}, "override": { "variance": {…} } } ``` | Field | Type | Required | Description | | ------------- | -------- | -------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | | `handshakeId` | `string` | Yes | From a prior `ggui_handshake` response. | | `props` | `object` | Yes | Runtime prop values for THIS render. Validated against the effective contract’s `propsSpec`; failures fail the render with a recoverable `ContractViolationError`. Pass `{}` when the contract declares no `propsSpec`. | | `themeId` | `string` | No | Per-render theme preset override — wins over `App.defaultThemeId` for THIS render. Discover ids via `ggui_list_themes`. Omit to inherit the app theme. | | `infra` | `object` | No | `{model?}` — provider-prefixed per-render model override (e.g. `anthropic/claude-haiku-4-5`). Strict — unknown keys are rejected. | | `override` | `object` | No | Omit to ACCEPT the suggestion as-is. Provide `{contract?, variance?}` to re-aim: `override.contract` re-drafts the contract (STRICT — must already conform) and cold-gens; `override.variance` re-aims the variant axis. | **Returns:** `{ sessionId, resourceUri, action, contractHash, blueprintId, variantKey, cache, nextStep? }` | Field | Type | Description | | -------------- | -------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `sessionId` | `string` | Globally-unique id (UUID) for the delivered render. Use for `ggui_consume` / `ggui_update`. | | `resourceUri` | `string` | Spec-canonical MCP-Apps entry point — `ui://ggui/render/`, mirrored on the tool result’s `_meta.ui.resourceUri`. A host mounts the render from this; there is no clickable URL on the wire. | | `action` | `enum` | One of `create` / `reuse` / `update` / `replace` / `declined`. | | `contractHash` | `string` | Canonical hash of the rendered data contract (shape only — fields, types, specs). Same hash ⟺ same data flow. | | `blueprintId` | `string` | Opaque id of the materialised component. Equal across two renders ⟺ the same cached component was served (a fresh gen mints a new id). | | `variantKey` | `string` | Canonical hash of the design-time variance. With `contractHash` it forms the reuse key. | | `cache` | `object` | Reuse outcome — `{ hit, similarity?, cachedBlueprintId?, llmCallsAvoided, kind?, reason? }`. | | `nextStep` | `object` | Emitted ONLY when the rendered contract has a non-empty `actionSpec`. Points at `ggui_consume({sessionId})` for the inbound action loop. Pure-display renders get no `nextStep`. | The render consumes the handshake record. Bootstrap credentials (`wsUrl`, `wsToken`, `expiresAt`) reach the iframe via the `_meta["ai.ggui/render"]` slice, not via this response. ### `ggui_consume` [Section titled “ggui\_consume”](#ggui_consume) Long-poll for buffered user events on one render. Events drain on read (consume-once semantics). Call this right after every `ggui_render` whose response carries `nextStep.tool === 'ggui_consume'`. | Field | Type | Required | Description | | ----------- | -------- | -------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `sessionId` | `string` | Yes | Render to consume from. Globally unique. | | `timeout` | `number` | No | Long-poll seconds — integer in `[0, 25]`; `0` = immediate (default). Values outside `[0, 25]` reject `INVALID_PARAMS`. Returns on the first event or at timeout; re-call on empty to keep waiting — a longer wait is your loop, not a bigger timeout (pick 5–15s typical, 25 max). | **Returns:** `{ events: ConsumeEventEntry[], status: "active" | "expired", client? }` Keyed by `sessionId`. THE LOOP: when `events` is non-empty, react (commonly via `ggui_update` to refresh the iframe), then re-call `ggui_consume`. The render stays `active` until its TTL elapses — `status: "expired"` means no more events will arrive, so the long-poll loop terminates. Exit when you have the events you need, or when `status` is `expired`. The optional `client` field echoes mid-render host observations (window resize, fullscreen toggle, etc.) without forcing a fresh handshake. ### `ggui_update` [Section titled “ggui\_update”](#ggui_update) Mutate props on a delivered render in place. Targets `sessionId` directly. Discriminated on `kind`. ```json // FULL replacement { "sessionId": "…", "kind": "replace", "props": { … } } // RFC 7396 JSON Merge Patch { "sessionId": "…", "kind": "merge", "patch": { … } } ``` | Field | Type | Required | Description | | ----------- | -------- | ---------- | --------------------------------------------------------------------------------------------------------------------------------------------------------- | | `sessionId` | `string` | Yes | The render to mutate (UUID from `ggui_render` response). | | `kind` | `enum` | Yes | `'replace'` — the `props` map IS the new state. `'merge'` — apply RFC 7396 JSON Merge Patch (null deletes a key; arrays fully replace, NOT element-wise). | | `props` | `object` | If replace | Full replacement props map. Required when `kind: 'replace'`. | | `patch` | `object` | If merge | RFC 7396 patch. Required when `kind: 'merge'`. | **Returns:** `{ sessionId, updated, resourceUri }` `resourceUri` is unchanged from the initial render — the same `ui://ggui/render/{sessionId}` the mount stamped. Both modes validate the final props state (post-merge for `merge`) against the render’s `propsSpec` and reject on violation. Use for partial UI state changes; for a structurally different surface, handshake + render a fresh one. Post-update the iframe receives the new props via the live-channel `props_update` WS frame. ### `ggui_emit` [Section titled “ggui\_emit”](#ggui_emit) Emit a new delivery on a declared `streamSpec` channel of the render. The agent describes WHAT new data exists; the server stamps the canonical `StreamEnvelope` (mode derived from `streamSpec[channel].mode`, `seq` + `timestamp` server-assigned). Validates `payload` against the channel’s declared `schema` and rejects undeclared channels at call time. | Field | Type | Required | Description | | ----------- | --------- | -------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------ | | `sessionId` | `string` | Yes | Render to stream to. Server enforces app-ownership. | | `channel` | `string` | Yes | Channel name declared on the render’s `streamSpec`. Undeclared channels reject. | | `payload` | `unknown` | Yes | Delivery payload. Validated against `streamSpec[channel].schema`. | | `complete` | `boolean` | No | Terminal-delivery marker. Only valid when the channel was declared with `complete: true` on the streamSpec; setting it on a non-completable channel rejects. | **Returns:** `{ accepted }` `accepted: true` means the server validated and enqueued the envelope at the boundary. No-subscriber is NOT an error — buffered retention and live fan-out happen independently. The server-assigned `seq` is observable on the delivered `StreamEnvelope` (the live-channel `data` WS frame), not on this tool result. ### `ggui_get_session` [Section titled “ggui\_get\_session”](#ggui_get_session) Retrieve GguiSession state — id, appId, event sequence, activity timestamps. Bumps the activity heartbeat on every successful read. Omits `componentCode` + `sourceCode` (those live on the renderable surface, not the agent-visible one). | Field | Type | Required | Description | | ----------- | -------- | -------- | ----------------------- | | `sessionId` | `string` | Yes | GguiSession to inspect. | **Returns:** `{ id, appId, eventSequence, createdAt, lastActivityAt, expiresAt }` `createdAt` / `lastActivityAt` / `expiresAt` are epoch milliseconds (numbers). ### `ggui_list_sessions` [Section titled “ggui\_list\_sessions”](#ggui_list_sessions) Enumerate this app’s GguiSessions by host conversation — the lookup behind resume flows. Matches on the `_meta["ai.ggui/host-session"]` pair (`hostName` + `hostSessionId`) captured at render creation; sessions created without that slice never match host-scoped queries. | Field | Type | Required | Description | | --------------- | -------- | -------- | ----------------------------------------------------------------------------------------------------------- | | `hostName` | `string` | No | Filter by host identifier (`claude.ai`, `sample`, …). Pair with `hostSessionId` to target one conversation. | | `hostSessionId` | `string` | No | The host’s opaque conversation-grouping key (e.g. a claude.ai thread id). Typically paired with `hostName`. | | `limit` | `number` | No | Max rows, 1–200. Default 50. Newest-last ordering matches the conversation timeline. | **Returns:** `{ sessions: [{ sessionId, hostName?, hostSessionId?, createdAt, lastActivityAt, status, wsToken?, wsTokenExpiresAt? }] }` `createdAt` / `lastActivityAt` are ISO 8601 strings here. The `wsToken` pair is present only when the deployment wires a `mintWsToken` seam — resume flows use it to remount each iframe without a fresh handshake. ### `ggui_list_gadgets` [Section titled “ggui\_list\_gadgets”](#ggui_list_gadgets) Return the catalog of renderer-side **gadgets** the UI may import via the package-keyed `clientCapabilities.gadgets` map of a `DataContract`. Call this BEFORE authoring a contract so the catalog you seed only references gadgets the renderer will actually serve. Returns the per-app catalog: the 7-hook stdlib package (`@ggui-ai/gadgets` 0.3.0) is the structural floor; gadgets declared in `ggui.json#app.gadgets` layer on top (declared wins on a package collision). | Field | Type | Required | Description | | ------- | -------- | -------- | ---------------------------------------------------------------------------------------------------------------------------------------------- | | `appId` | `string` | No | The app whose catalog to fetch. Defaults to the caller-resolved appId from the auth header. Explicit mismatch surfaces as `app_access_denied`. | **Returns:** `{ gadgets: GadgetDescriptor[] }` Each `GadgetDescriptor` is a gadget PACKAGE: `{ package, version, exports: GadgetExport[], … }` — package-level identity plus transport metadata (`bundleUrl` / `bundleHost` / `bundleSri` / `styleUrl` / `connect` / `requires` / `typesUrl` / `typesSri` for non-stdlib packages). Each `GadgetExport` is a field-presence-discriminated union — a hook export `{ hook, description?, usage?, example?, gotchas?, permission?, required? }` or a component export `{ component, description?, usage?, example?, gotchas?, permission?, required? }`. Full entry shape: [SDK gadgets guide](/sdk/gadgets/). ### `ggui_list_themes` [Section titled “ggui\_list\_themes”](#ggui_list_themes) Return the theme presets an agent may apply per render via `ggui_render({ themeId })`. When the app configures an `availableThemeIds` allowlist, the catalog is filtered to it (catalog order preserved; unregistered ids silently dropped). | Field | Type | Required | Description | | ------- | -------- | -------- | ---------------------------------------------------------------------------------------------------------------------------------------------------- | | `appId` | `string` | No | The app whose theme catalog to fetch. Defaults to the caller-resolved appId from the auth header. Explicit mismatch surfaces as `app_access_denied`. | **Returns:** `{ themes: [{ id, name, description, modes }] }` `modes` lists the variants each preset ships (`light` / `dark`). ### `ggui_list_featured_blueprints` [Section titled “ggui\_list\_featured\_blueprints”](#ggui_list_featured_blueprints) Enumerate the builder-curated featured blueprints declared via the server’s blueprint catalog (typically `ggui.json#blueprints.include` for OSS deployments). Returns an empty list when no catalog is wired. **Inputs:** none. **Returns:** `{ blueprints: BlueprintEntry[], total }` Pair with `ggui_search_blueprints` for semantic lookup or `ggui_render_blueprint` to materialize one directly. ### `ggui_search_blueprints` [Section titled “ggui\_search\_blueprints”](#ggui_search_blueprints) Semantic search across this app’s blueprints — both manifest-declared UIs (`ggui.json#blueprints.include`) and previously cached generations. Matches by name/description against the manifest source and by cosine similarity against the semantic vector index; results merge + dedupe by id (manifest wins on collision) and sort by score descending. | Field | Type | Required | Description | | ------- | -------- | -------- | ---------------------------------------------------------- | | `query` | `string` | Yes | Natural-language description of the UI you’re looking for. | | `limit` | `number` | No | Max results. Default 10. Maximum 100. | **Returns:** `{ results, total, query }` Each result row carries `{ id, name, description, category, props, callbacks, featured, relevance: 'match', score }`. `score` is 0–1 (cosine similarity for semantic hits; `1.0` for exact manifest-name matches, `0.7` for manifest substring matches). Agents use `score` to decide whether to reuse a blueprint or generate from scratch. ### `ggui_discover` [Section titled “ggui\_discover”](#ggui_discover) Return platform capabilities (protocol version, supported content types, shell types, adapter types, component-capability catalog) and — when the bearer token resolves to a known app — that app’s enabled adapters / granted capabilities / auth mode / rate limit. Call BEFORE the first handshake when you need to branch on what this deployment supports. **Inputs:** none. **Returns:** `{ protocolVersion, contentTypes, shellTypes, adapterTypes, componentCapabilities, app? }` | Field | Type | Description | | ----------------------- | ---------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `protocolVersion` | `string` | ggui protocol revision (prelaunch drafts use `draft-YYYY-MM-DD`; first frozen release will be `1.0.0`). | | `contentTypes` | `string[]` | Bundle content types this deployment serves (e.g. `application/javascript+react`). | | `shellTypes` | `string[]` | Available shell flavors (`chat`, `fullscreen`, `spatial`). | | `adapterTypes` | `string[]` | Adapter families wired on this deployment (`voice`, `camera`, `location`, `bluetooth`). | | `componentCapabilities` | `string[]` | Informational capability vocabulary. The load-bearing per-app grant lives on the operator-registered `GadgetExport.permission` in `App.gadgets` (registry side) — not on the contract wire. | | `app` | `object?` | Present when the bearer token resolves to a known app. `{ enabledAdapters?, grantedCapabilities?, defaultShellType?, authMode?, rateLimitPerMinute? }`. | ### `ggui_request_credential` [Section titled “ggui\_request\_credential”](#ggui_request_credential) Request OAuth consent from the end user via the Portal’s consent overlay. Blocks up to 25 seconds polling for the user’s choice (Allow once / Always allow / Deny). Short-circuits with `granted: true` when a prior grant already exists for this user + app + service. | Field | Type | Required | Description | | ----------- | -------- | -------- | --------------------------------------------------------------------------------------------- | | `serviceId` | `string` | Yes | OAuth service identifier (matches an `McpServiceConfig` entry — e.g. `"bashdoor"`, `"ubot"`). | | `reason` | `string` | No | One-line rationale shown to the user inside the consent overlay. | | `sessionId` | `string` | No | Existing render to surface the consent UI into. Required to actually surface the overlay. | **Returns:** `{ granted, mode?, service?, reason? }` | Field | Type | Description | | --------- | -------------------- | ----------------------------------------------------------------------- | | `granted` | `boolean` | Whether the user (or a prior grant) approved. | | `mode` | `'once' \| 'always'` | Grant mode when `granted: true`. Absent on denial / timeout. | | `service` | `{ name, icon }` | Display info pulled from `McpServiceConfig`. Echoed back for UI parity. | | `reason` | `string` | Denial / timeout / error rationale when `granted: false`. | ### `ggui_render_blueprint` [Section titled “ggui\_render\_blueprint”](#ggui_render_blueprint) Resolve a registered blueprint id to its compiled JS bundle, inline. The OSS handler reads the manifest entry via the server’s `UiRegistry`, compiles on demand from the colocated TSX (`@ggui-ai/dev-stack::LocalUiRegistry` is the reference impl), and returns the bundle as a single JSON field. Fails with a clear error when the id is unknown or no bundle is available. Only registered when the server boots with a `UiRegistry` seam — otherwise the tool is omitted from `tools/list` entirely. | Field | Type | Required | Description | | ------------- | -------- | -------- | ----------------------------------------------------------------------------------------------------- | | `blueprintId` | `string` | Yes | Stable blueprint id declared via `ggui.ui.json#id`. Must match an entry in this server’s UI registry. | **Returns:** `{ blueprintId, blueprintName, code, contentType }` `code` is the compiled JS bundle as a string (ESM `export default` producing the component to mount). `contentType` is typically `'application/javascript+react'` — pinned by the server’s compile pipeline. The caller mounts `code` directly; no second round-trip is required. *** ## Events [Section titled “Events”](#events) Two distinct shapes are in play. Don’t conflate them: * **`ActionEnvelope`** — live-channel inbound on the WebSocket subscribe seam. Used by browser/SDK consumers that listen to live events (e.g. `@ggui-ai/wire`’s `useRender`). See [WebSocket Protocol](/api/websocket-protocol/). * **`ConsumeEventEntry`** — per-gesture row on the render-keyed consume pipe, returned by `ggui_consume`. This is what agents read. ### `ActionEnvelope` (live-channel inbound) [Section titled “ActionEnvelope (live-channel inbound)”](#actionenvelope-live-channel-inbound) ```typescript interface ActionEnvelope { sessionId: string; type: EventType; payload?: TPayload; // For `data:submit`: { action, data?, tool? } clientSeq?: number; // client-monotonic, for at-least-once dedup } ``` ### `ConsumeEventEntry` (consume pipe) [Section titled “ConsumeEventEntry (consume pipe)”](#consumeevententry-consume-pipe) ```typescript interface ConsumeEventEntry { readonly type: "action"; readonly sessionId: string; readonly intent: string; // which actionSpec[*] fired readonly actionData: JsonValue | null; // matches actionSpec[intent].schema readonly uiContext: JsonObject; // contextSpec slot snapshot at gesture time readonly actionId: string; // 8-hex FNV-1a correlation id readonly firedAt: string; // ISO 8601 UTC } ``` Both envelopes are flat — no nested `event` / `context` / `meta` blocks. Diagnostic render metadata (device info, interface context) lives on the render at subscribe time, not per-delivery. ### Event Types [Section titled “Event Types”](#event-types) `EventType` has exactly one member, `data:submit`. | Type | Category | Description | | ------------- | -------- | ---------------------------------------- | | `data:submit` | Data | User gesture surfaced as a consume event | The pre-actionSpec multi-event vocabulary (`data:change`, `lifecycle:*`, `interaction:*`, `error:*`) was deleted in draft-2026-06-12 — it never had a first-party producer. Today’s actionSpec-driven flow surfaces every user gesture as a `data:submit` `ConsumeEventEntry`. There is no other event vocabulary — agent code reads from `ggui_consume` and only needs to recognize the `data:submit` shape. *** ## Error Codes [Section titled “Error Codes”](#error-codes) Names mirror the `MCP_ERROR_CODES` / `PLATFORM_ERROR_CODES` constants exported from `@ggui-ai/protocol`. The `-32010` range is the ggui platform-extension block, not part of the core protocol. | Code | Name | Description | | -------- | --------------------------- | ------------------------------------ | | `-32700` | `PARSE_ERROR` | Invalid JSON in request | | `-32600` | `INVALID_REQUEST` | Not a valid JSON-RPC object | | `-32601` | `METHOD_NOT_FOUND` | Unknown method name | | `-32602` | `INVALID_PARAMS` | Missing or invalid tool arguments | | `-32603` | `INTERNAL_ERROR` | Server-side failure | | `-32001` | `UNAUTHORIZED` | Invalid token or app ID | | `-32002` | `SESSION_NOT_FOUND` | Session expired or deleted | | `-32003` | `APP_NOT_FOUND` | App ID does not exist | | `-32004` | `PRODUCTION_FAILED` | UI production failed | | `-32005` | `CAPABILITY_DENIED` | Requested capability not granted | | `-32010` | `GENERATION_QUOTA_EXCEEDED` | Platform: generation quota exhausted | | `-32011` | `APP_LIMIT_EXCEEDED` | Platform: app-count ceiling reached | | `-32012` | `CONCURRENT_SESSION_LIMIT` | Platform: too many live sessions | | `-32013` | `RATE_LIMIT_EXCEEDED` | Platform: reserved rate-limit code | | `-32020` | `CONTRACT_VIOLATION` | Platform: contract validation failed | *** ## Supported Models [Section titled “Supported Models”](#supported-models) There is no fixed model menu. The operator sets the per-app default via `ggui.json#generation.model` — any provider-prefixed route, written `provider:model` (canonical) or LiteLLM-style `provider/model`, e.g. `anthropic:claude-haiku-4-5-20251001`. Providers on the self-hosted BYOK path: `anthropic`, `openai`, `google`, `openrouter` (`bedrock` routes are hosted-runtime-only and rejected by `ggui serve`). Agents can override the model per render via `ggui_render({ infra: { model } })` with a provider-prefixed id (e.g. `anthropic/claude-haiku-4-5`). *** ## Example: Full Render Flow (curl) [Section titled “Example: Full Render Flow (curl)”](#example-full-render-flow-curl) This walkthrough runs against self-hosted `ggui serve` (started with `--dev-allow-all`). Hosted `mcp.ggui.ai` (coming soon) will speak the same wire — only the URL and bearer change. ```bash # 1. Initialize curl -X POST http://127.0.0.1:6781/mcp \ -H "Authorization: Bearer dev" \ -H "Content-Type: application/json" \ -d '{ "jsonrpc": "2.0", "id": 1, "method": "initialize", "params": { "protocolVersion": "2025-06-18", "clientInfo": { "name": "curl", "version": "1.0" }, "capabilities": {} } }' # 2. Handshake — negotiate the wire surface # (blueprintDraft carries the agent's contract) curl -X POST http://127.0.0.1:6781/mcp \ -H "Authorization: Bearer dev" \ -H "Content-Type: application/json" \ -d '{ "jsonrpc": "2.0", "id": 2, "method": "tools/call", "params": { "name": "ggui_handshake", "arguments": { "intent": "Contact form", "blueprintDraft": { "contract": { "propsSpec": {}, "actionSpec": {} } } } } }' # 3. Render — accept the handshake suggestion verbatim (mints sessionId) curl -X POST http://127.0.0.1:6781/mcp \ -H "Authorization: Bearer dev" \ -H "Content-Type: application/json" \ -d '{ "jsonrpc": "2.0", "id": 3, "method": "tools/call", "params": { "name": "ggui_render", "arguments": { "handshakeId": "hs_…", "props": {} } } }' # 4. Poll for events (keyed by sessionId from step 3) curl -X POST http://127.0.0.1:6781/mcp \ -H "Authorization: Bearer dev" \ -H "Content-Type: application/json" \ -d '{ "jsonrpc": "2.0", "id": 4, "method": "tools/call", "params": { "name": "ggui_consume", "arguments": { "sessionId": "", "timeout": 15 } } }' # Renders decay implicitly via TTL — no explicit close. ``` *** ## See Also [Section titled “See Also”](#see-also) * [Protocol overview](/protocol/overview/) — the three channels at a glance * [WebSocket Protocol](/api/websocket-protocol/) — live-channel live events * [MCP Apps](/api/mcp-apps/) — inline rendering inside MCP hosts * [OAuth on mcp.ggui.ai](/api/oauth/) — hosted auth ceremony (coming soon) * [Ops MCP route](/api/ops-mcp/) — operator agent tools at `/ops` * [Docs MCP route](/api/mcp-docs/) — anonymous docs search at `/docs` * [Playground · Todos](/clients/playground-todos/) — hosted demo service at `/playground/todos` (coming soon) * [Playground · MDH](/clients/playground-mdh/) — hosted demo service at `/playground/mdh` (coming soon) * [MCP services](/architecture/mcp-services/) — mounting standalone services on one server * [Glossary](/glossary/) — gadget / tool / blueprint, plus every other term # OAuth (self-hosted) > OAuth 2.1 + PKCE + Dynamic Client Registration wire format for a self-hosted @ggui-ai/mcp-server (enable with `oauth: true` / `ggui serve --oauth`), with the access-token-IS-the-API-key trade-off explained. A self-hosted [`@ggui-ai/mcp-server`](/oss-quickstart/) — enabled with `oauth: true` on the factory or `ggui serve --oauth` — implements [OAuth 2.1](https://datatracker.ietf.org/doc/html/draft-ietf-oauth-v2-1) with [PKCE](https://datatracker.ietf.org/doc/html/rfc7636), [Dynamic Client Registration](https://datatracker.ietf.org/doc/html/rfc7591) (RFC 7591), the [Protected Resource Metadata](https://datatracker.ietf.org/doc/html/rfc9728) discovery profile (RFC 9728), and the [Resource Indicators](https://datatracker.ietf.org/doc/html/rfc8707) extension (RFC 8707) for per-app token scoping. MCP-aware hosts can connect with just a server URL — no manually-issued `client_id`, no shared secret. To onboard a remote host (Claude Desktop, claude.ai, Goose) against your local server, front it with a tunnel (ngrok / cloudflared) and pass `--public-base-url=` so the discovery URLs resolve from the host’s browser. This page is the wire reference. If you only want to **connect a client** ([Claude Desktop](/clients/claude-desktop/), claude.ai, Goose, VS Code Copilot), the host handles everything — skip this page. Read on if you’re building a host, debugging a custom client, or operating a deployment of `@ggui-ai/mcp-server`. ## The flow at a glance [Section titled “The flow at a glance”](#the-flow-at-a-glance) ```plaintext Client Server User │ │ │ │── GET /mcp (no auth) ─────────→│ │ │← 401 WWW-Authenticate: │ │ │ Bearer realm="mcp", │ │ │ resource_metadata="" ──│ │ │ │ │ │── GET /.well-known/ │ │ │ oauth-protected-resource ───→│ │ │← { authorization_servers } ───│ │ │ │ │ │── GET /.well-known/ │ │ │ oauth-authorization-server ─→│ │ │← { authorize_endpoint, ... } ─│ │ │ │ │ │── POST /oauth/register ───────→│ (RFC 7591 DCR — no client_secret) │← { client_id } ───────────────│ │ │ │ │ │── open /oauth/authorize ──────→│── in-process consent form ───→│ │ │ │── paste key / pair code + Approve │ │← form POST {api_key, params} ─│ │← 302 → redirect_uri?code=… ───│ │ │ │ │ │── POST /oauth/token │ │ │ {code, code_verifier} ──────→│ │ │← { access_token: } ─│ │ │ │ │ │── GET /mcp │ │ │ Authorization: Bearer ─→│ │ │← MCP session ─────────────────│ │ ``` ## The access token IS the API key [Section titled “The access token IS the API key”](#the-access-token-is-the-api-key) The simplest possible bridge between MCP’s OAuth-required client UX and ggui’s existing API-key model: * **No parallel token table.** The `access_token` returned at `/oauth/token` is verbatim the bearer key your `AuthAdapter` accepts on `/mcp` — pasted (or, with `ggui serve`, exchanged from the terminal pair code) during consent. * **No translation in the request hot path.** An authenticated `/mcp` request looks identical to a static-bearer request — `Authorization: Bearer `. * **No refresh dance.** The access token TTL equals the API key TTL. Revoke the key in your store and the client’s next request returns `401`. Re-auth is one OAuth round trip — one paste from the user. * **One audit surface.** Every connected client appears as a single row in your keys list, labelled with the `client_name` from DCR. The trade-off: there’s no “this token came from OAuth” flag — the key works identically whether minted by your operator tooling or the OAuth flow. If that distinction matters to your operator policy, plug in your own `OAuthStorage` + `AuthAdapter` (see [Storage seam](#storage-seam) below). ## Endpoints [Section titled “Endpoints”](#endpoints) ### `GET /.well-known/oauth-protected-resource` (RFC 9728) [Section titled “GET /.well-known/oauth-protected-resource (RFC 9728)”](#get-well-knownoauth-protected-resource-rfc-9728) Tells the client where to find the authorization server. Same origin in our case — your server (here a tunnel exposing the local `ggui serve`) is both the resource and the auth server. ```json { "resource": "https://your-mcp.example.com/mcp", "authorization_servers": ["https://your-mcp.example.com"], "bearer_methods_supported": ["header"], "resource_documentation": "https://modelcontextprotocol.io/extensions/apps/overview" } ``` ### `GET /.well-known/oauth-authorization-server` (RFC 8414) [Section titled “GET /.well-known/oauth-authorization-server (RFC 8414)”](#get-well-knownoauth-authorization-server-rfc-8414) Authorization-server metadata. ```json { "issuer": "https://your-mcp.example.com", "authorization_endpoint": "https://your-mcp.example.com/oauth/authorize", "token_endpoint": "https://your-mcp.example.com/oauth/token", "registration_endpoint": "https://your-mcp.example.com/oauth/register", "response_types_supported": ["code"], "grant_types_supported": ["authorization_code"], "code_challenge_methods_supported": ["S256"], "token_endpoint_auth_methods_supported": ["none"], "scopes_supported": ["mcp"] } ``` `token_endpoint_auth_methods_supported: ["none"]` is intentional — PKCE is the only supported client authentication, no `client_secret` is ever issued. ### `POST /oauth/register` (RFC 7591) [Section titled “POST /oauth/register (RFC 7591)”](#post-oauthregister-rfc-7591) Dynamic Client Registration. Issues a random `client_id`, no secret. Accepts arbitrary `redirect_uris` from the client without an allowlist — the trade-off matches the MCP spec’s pragmatism: any client willing to do PKCE + paste-key gets registered. ```bash curl -X POST https://your-mcp.example.com/oauth/register \ -H "Content-Type: application/json" \ -d '{ "redirect_uris": ["http://localhost:33418/callback"], "client_name": "Claude Desktop" }' ``` ```json { "client_id": "mcp_client_AbCdEf...", "redirect_uris": ["http://localhost:33418/callback"], "grant_types": ["authorization_code"], "response_types": ["code"], "token_endpoint_auth_method": "none", "client_name": "Claude Desktop" } ``` ### `GET /oauth/authorize` [Section titled “GET /oauth/authorize”](#get-oauthauthorize) The user-facing approval entry point. Required query params: | Param | Value | | ----------------------- | ---------------------------------------------------------------------------------------------------------- | | `response_type` | `code` | | `client_id` | from DCR | | `redirect_uri` | one of the URIs registered with DCR | | `code_challenge` | base64url(SHA256(verifier)) | | `code_challenge_method` | `S256` | | `state` | opaque, echoed back on redirect (CSRF defense) | | `scope` | `mcp` (advertised; not gated server-side today) | | `resource` | optional — RFC 8707 indicator naming the target MCP endpoint (per-app routing); omit for universal scoping | With no `consentUrl` configured (the typical self-hosted posture), the server renders an unbranded paste-key HTML form in-process — no external origin needed. When the operator does configure a `consentUrl`, the server 302s the browser there with every OAuth param forwarded plus an `mcp_origin` param the consent page uses to know where to POST back. (The hosted endpoint will point `consentUrl` at its own console — coming soon.) ### `POST /oauth/authorize` (form-encoded) [Section titled “POST /oauth/authorize (form-encoded)”](#post-oauthauthorize-form-encoded) Called by the consent UI (the in-process form by default) after the user approves. Same OAuth params echoed as hidden inputs, plus one of two credential fields: * `pair_code` — the 6-digit code printed on the terminal banner by `ggui serve`. The server exchanges it through `PairingService.completePairing` to mint a per-server bearer. This is the easiest self-hosted path. * `api_key` — a freshly-minted (or pasted) bearer key your `AuthAdapter` accepts. The server validates the resulting key against the same `AuthAdapter` that gates `/mcp`, mints a 5-minute auth code, and 302s to the client’s `redirect_uri?code=…&state=…`. ### `POST /oauth/token` [Section titled “POST /oauth/token”](#post-oauthtoken) Exchange the auth code for an access token. ```bash curl -X POST https://your-mcp.example.com/oauth/token \ -H "Content-Type: application/x-www-form-urlencoded" \ -d "grant_type=authorization_code&code=AUTH_CODE&redirect_uri=http://localhost:33418/callback&client_id=mcp_client_…&code_verifier=ORIGINAL_VERIFIER" ``` ```json { "access_token": "", "token_type": "Bearer", "scope": "mcp" } ``` No `refresh_token`, no `expires_in` — the access token’s lifetime is the underlying API key’s lifetime. Re-auth to get a new one. If the client passed a `resource` at `/authorize` and includes one at `/token`, the two MUST match (RFC 8707 §2.2); mismatch returns `400 invalid_target`. Omitting `resource` at `/token` is tolerated even when the auth code captured one — RFC 8707 only mandates the constraint when the client opts in. ## Resource indicators (RFC 8707) [Section titled “Resource indicators (RFC 8707)”](#resource-indicators-rfc-8707) The server supports per-app token scoping via the optional `resource` query parameter at `/oauth/authorize`. Two shapes are accepted: * **Universal** — `${issuer}` (cloud bare root) or `${issuer}/mcp` (OSS default path). Tokens bind to the user’s full app surface. * **Per-app** — `${issuer}/apps/` where `` matches the deployment’s app-id pattern. The auth code, and any token minted from it, is captured against that single app target. Per-app deployments also surface their own `/.well-known/oauth-protected-resource` at the app-scoped path so RFC 9728 discovery resolves correctly when a client starts from the per-app URL. Absent `resource`, universal scoping applies. Unknown resources are rejected at `/authorize` with `invalid_target` (RFC 8707 §2) before the consent step so the user sees the failure before pasting a key. OSS deployments can plug a `validateResource(issuer, resource)` callback into the server factory to gate which targets are accepted. The captured resource is snapshotted onto the DCR client record (`lastResource`) so an operator console can label rows “Connected to: \” rather than “Universal”. ## OSS deployment notes [Section titled “OSS deployment notes”](#oss-deployment-notes) **Running the CLI?** `ggui serve --oauth` mounts this whole surface — the `/.well-known/oauth-protected-resource` and `/.well-known/oauth-authorization-server` discovery endpoints plus `/oauth/{authorize,token,register}`. It’s required for OAuth-discovery hosts (claude.ai / ChatGPT “Add custom connector”); pure-bearer clients work without it. The paste-key form accepts either a `ggui_user_*` key or the 6-digit pair code from the serve banner. If you’re embedding [`@ggui-ai/mcp-server`](/oss-quickstart/) programmatically, pass `oauth: true` (or an `OAuthConfig`) to the server factory: ```typescript import { createGguiServer } from "@ggui-ai/mcp-server"; const server = createGguiServer({ // ... oauth: { issuerUrl: "https://your-mcp.example.com", consentUrl: "https://your-console.example.com/oauth/consent", // optional storage: new InMemoryOAuthStorage(), // or your own }, auth: yourAuthAdapter, }); ``` ### Storage seam [Section titled “Storage seam”](#storage-seam) Auth codes and DCR clients live in `OAuthStorage`. The default `InMemoryOAuthStorage` is fine for single-replica dev and any deployment with sticky sessions on the load balancer. Multi-replica deployments without sticky sessions need a shared backend — Redis or DynamoDB both work. The interface is two collections (auth codes, DCR clients) with `put` / `consume` / `get` methods; see `packages/mcp-server/src/oauth.ts` in the [public repo](https://github.com/ggui-ai/ggui) for the full type. ### Consent UI [Section titled “Consent UI”](#consent-ui) `consentUrl` delegates the user-facing approval step to a separate origin. The consent UI sees both the user’s session (whatever auth model it uses) and the freshly-minted API key in plaintext — same trust boundary as the MCP server itself. Validate the `mcp_origin` param against an allowlist before posting back, otherwise an attacker can craft a URL that exfiltrates a key to a third-party origin. Without `consentUrl`, the server renders an in-process paste-key HTML form. Functional but unbranded — fine for OSS deployers who don’t want to stand up a separate origin. ### Disable OAuth entirely [Section titled “Disable OAuth entirely”](#disable-oauth-entirely) `oauth: false` (or simply omitting the option) skips the OAuth surface — `/oauth/*` and the well-known endpoints return 404, and `/mcp` falls back to the bearer-only `AuthAdapter` model. Hosts that don’t speak OAuth-DCR can still authenticate by setting a static `Authorization: Bearer …` header. ## Hosted ggui (coming soon) [Section titled “Hosted ggui (coming soon)”](#hosted-ggui-coming-soon) The managed endpoint at `mcp.ggui.ai` will run this identical flow with two deployment-specific choices: `consentUrl` points at its own console’s consent page (so approval happens on a branded origin that mints a `ggui_user_*` key for you), and the universal MCP endpoint mounts at the bare root rather than `/mcp`. Neither is live yet — hosted ggui is not part of GGUI Preview 0.1.0, and nothing on this page depends on it. ## Non-goals [Section titled “Non-goals”](#non-goals) * **Not refresh-token compatible.** The OAuth ceremony is one-time per credential. Adding a refresh-token grant requires decoupling the access token from the API key (separate token store + lifetime). * **Not OIDC.** No ID tokens, no userinfo endpoint, no nonce. The MCP spec doesn’t ask for them, and the access-token-IS-the-API-key model doesn’t need them. * **Not a replacement for static API keys.** If your client can hold a static `Authorization: Bearer ` header (CLI agents, server-side runtimes), keep doing that. For local dev, `Authorization: Bearer dev` works against `ggui serve --dev-allow-all`. OAuth exists for hosts that need DCR + browser-mediated approval. # Ops MCP route > MCP tools on the /ops route — operator actions (apps, orgs, connector keys, coupons, blueprints, provider keys, credits) exposed for operator agents. The `/ops` route surfaces operator-class MCP tools — the same actions the [console UI](/clients/console/) (coming soon) will expose to a human (create an app, rename it, mint a connector key, redeem a coupon, …), exposed as MCP tools so an LLM acting as an operator agent can perform them on the user’s behalf. This page is the wire reference for the 24 ops-audience handlers across seven domains. The agent-loop surface (handshake / render / consume / …) lives on [`/mcp`](/api/mcp-protocol/) and is documented separately — `/ops` is a strictly disjoint route with no overlap. ## What’s on /ops [Section titled “What’s on /ops”](#whats-on-ops) `/ops` is the destination for an **operator agent** — an LLM acting as the console’s hands. Typical caller: a Claude conversation that the user opens from `console.ggui.ai` (coming soon) and gives natural-language instructions like “create a new app called Inbox Triage and lock a connector key to it.” The agent calls `ggui_ops_create_app` followed by `ggui_ops_issue_connector_key`, never touching the AppSync GraphQL layer directly. Every tool here mirrors a UI action the console (coming soon) will expose. The handler files in `@ggui-ai/mcp-server-handlers` are pure over typed seams (`AppsSource`, `OrgsSource`, `OrgInvitesSource`, `ConnectorKeysSource`, `CouponRedeemSource`) — the cloud pod binds AppSync-backed adapters; OSS deployments leave the seams unwired and the surface stays narrow. ## Endpoint [Section titled “Endpoint”](#endpoint) ```plaintext POST http://127.0.0.1:6781/ops ``` Self-hosted servers register tools on `/ops` when the operator seams are wired into `createGguiServer({opsApps, opsOrgs, opsConnectorKeys, opsCoupon})` (the ops-blueprint family additionally hangs off the `opsBlueprint` dep bundle on `defaultHandlers`). With nothing wired, the route still mounts but `tools/list` rejects with JSON-RPC `Method not found` — no tools capability is advertised when zero handlers are registered. Hosted ggui (coming soon) will serve the same route at `https://mcp.ggui.ai/ops`. ## Authentication [Section titled “Authentication”](#authentication) Identical to [`/mcp`](/api/mcp-protocol/#authentication) — bearer-token auth via the same upstream `AuthAdapter`: ```plaintext Authorization: Bearer dev Content-Type: application/json ``` Self-hosted: with `ggui serve --dev-allow-all`, any bearer authenticates as the `builder` identity; default serve requires a pairing-minted bearer. Hosted ggui (coming soon) will run the OAuth 2.0 Dynamic Client Registration ceremony (see [OAuth on mcp.ggui.ai](/api/oauth/)). The bearer presented on `/ops` is the same bearer presented on `/mcp` — there is no separate “ops token”. ## Identity model [Section titled “Identity model”](#identity-model) Every handler resolves the calling identity through a single helper: ```typescript function resolveOwnerSub(toolName: string, ctx: HandlerContext): string { const sub = ctx.userId ?? ctx.appId; if (!sub) throw new Error(`${toolName}: missing caller identity`); return sub; } ``` * **Hosted (multi-tenant):** `ctx.userId` is the caller’s Cognito sub, populated by the upstream auth adapter. * **OSS (single-tenant):** `ctx.userId` is undefined; `ctx.appId` (resolved by the auth adapter via `defaultAppIdFromIdentity` — typically `workspaceId ?? userId` for kind=user identities) serves as the identity. * **Neither set:** the handler throws — that means an unauthenticated caller slipped past auth, surfaced as a 5xx rather than masked as an empty list. ### Tenancy posture [Section titled “Tenancy posture”](#tenancy-posture) Every read and write is scoped by the resolved identity at the seam layer (`AppsSource.list(ownerSub)` returns only the caller’s rows; `AppsSource.get` returns `null` for foreign rows). Cross-tenant probes never reveal whether a given id exists on another user’s account — the handlers translate “row exists but you don’t own it” to the same shape as “no such row”: | Operation | Cross-tenant probe | | ---------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------- | | `ggui_ops_list_apps` | Returns only caller’s rows; foreign rows invisible. | | `ggui_ops_rename_app` / `ggui_ops_set_default_app` / `ggui_ops_update_app_system_prompt` | Throws `app_not_found` — same as a genuinely missing id. | | `ggui_ops_delete_app` | Returns `{deleted: true}` without touching the foreign row. Uniform with “row didn’t exist.” | | `ggui_ops_invite_to_org` / `ggui_ops_revoke_invite` | Throws `org_invite_access_denied` for orgs the caller doesn’t administer. | | `ggui_ops_revoke_connector_key` | Throws `connector_key_access_denied` for keys owned by other users. | | `ggui_ops_redeem_coupon` | Throws `coupon_access_denied` for `targetOrgId` orgs the caller isn’t a member of. | ### One-time secret reveal [Section titled “One-time secret reveal”](#one-time-secret-reveal) Plaintext keys appear exactly once `ggui_ops_issue_connector_key` returns the plaintext `ggui_user_*` secret on its result — this is the **only call** that ever surfaces it. The adapter persists `sha256(plaintext)` hex plus the first \~8 plaintext characters (`apiKeyPrefix`); the plaintext is not stored anywhere. Subsequent `ggui_ops_list_connector_keys` responses carry the prefix and the metadata but never the full secret. The MCP caller (Claude Desktop conversation, console) is responsible for surfacing the plaintext to the user immediately. There is no recovery if it’s lost — the user must revoke and reissue. *** ## Tools by domain [Section titled “Tools by domain”](#tools-by-domain) Seven domains, 24 handlers total. Each domain is optional: the four console-style domains (apps / orgs / connector keys / coupons) hang off `CreateGguiServerOptions`; ops-blueprint hangs off the `opsBlueprint` dep bundle on `defaultHandlers`; provider-keys and credits are bound by the hosted cloud pod (coming soon). Leaving a domain unwired removes its tools from `tools/list` at registration time. ## Apps (`ops-apps`, 6 handlers) [Section titled “Apps (ops-apps, 6 handlers)”](#apps-ops-apps-6-handlers) Operator actions on `GguiApp` rows — the rows the universal MCP route resolves per-request to scope sessions. Each row carries `appId` (server-minted base62), `displayName`, optional `systemPrompt` override, `createdAt`, `updatedAt`. Bound on the cloud pod via the AppSync `provisionGguiApp` mutation + the `GguiApp` model. ### `ggui_ops_list_apps` [Section titled “ggui\_ops\_list\_apps”](#ggui_ops_list_apps) Enumerate every `GguiApp` row owned by the calling user. Returns metadata only — same data the console’s Apps section renders. Use to discover ids before calling the mutating tools. **Inputs:** none. **Returns:** `{ apps: AppRecord[] }` ```typescript interface AppRecord { readonly appId: string; readonly displayName: string; readonly systemPrompt?: string; readonly createdAt: string; readonly updatedAt: string; } ``` **Tenancy:** scope is `ownerSub` from the bearer token. Cross-user listings are impossible by construction. ### `ggui_ops_create_app` [Section titled “ggui\_ops\_create\_app”](#ggui_ops_create_app) Provision a fresh `GguiApp` owned by the calling user. Wraps the cloud’s `provisionGguiApp` mutation — opaque base62 `appId` is minted server-side; argument-supplied `appId` is NEVER honored (tenant-takeover vector). | Field | Type | Required | Description | | ------------- | ---------------------- | -------- | -------------------------------------------------------------------------------------------------------------- | | `displayName` | `string` (1–120 chars) | No | Human-friendly label. Defaults to `'My ggui app'` when absent — matches the auto-create path in `useGguiUser`. | **Returns:** the full `AppRecord` shape as above. **Follow-up:** call `ggui_ops_set_default_app({appId})` to promote the new app to the user’s default. ### `ggui_ops_rename_app` [Section titled “ggui\_ops\_rename\_app”](#ggui_ops_rename_app) Update an existing app’s `displayName`. The target app MUST be owned by the calling user. | Field | Type | Required | Description | | ------------- | ---------------------- | -------- | ------------------------------------------------------------ | | `appId` | `string` | Yes | Target `GguiApp.appId`. Discover via `ggui_ops_list_apps`. | | `displayName` | `string` (1–120 chars) | Yes | New display name. Cap matches the cloud provisioning Lambda. | **Returns:** the updated `AppRecord`. **Errors:** | Code | When | | --------------- | ---------------------------------------------------------------------------------------- | | `app_not_found` | The id doesn’t exist OR exists under another tenant (uniform shape — no existence leak). | ### `ggui_ops_delete_app` [Section titled “ggui\_ops\_delete\_app”](#ggui_ops_delete_app) Hard-delete an app owned by the calling user. Idempotent — a second delete of the same id resolves cleanly. The cloud adapter additionally cascades per-app keys / blueprints / sessions (orchestrated below the seam). | Field | Type | Required | Description | | ------- | -------- | -------- | ----------------------- | | `appId` | `string` | Yes | Target `GguiApp.appId`. | **Returns:** `{ deleted: true }` **Tenancy:** cross-tenant probes return the success shape without touching the foreign row. Uniform with “row didn’t exist.” ### `ggui_ops_set_default_app` [Section titled “ggui\_ops\_set\_default\_app”](#ggui_ops_set_default_app) Set the calling user’s `GguiUser.defaultAppId` — the universal MCP route resolves this on every request to scope the session. The handler first verifies the caller owns the target `appId` before writing `User.defaultAppId`. | Field | Type | Required | Description | | ------- | -------- | -------- | ----------------------------------------- | | `appId` | `string` | Yes | Target app — must be owned by the caller. | **Returns:** `{ defaultAppId: string }` **Errors:** | Code | When | | --------------- | ----------------------------------------------------------- | | `app_not_found` | Target `appId` doesn’t exist OR is owned by another tenant. | ### `ggui_ops_update_app_system_prompt` [Section titled “ggui\_ops\_update\_app\_system\_prompt”](#ggui_ops_update_app_system_prompt) Set or clear the per-app system-prompt override. Empty-string input clears the field — the pod’s per-app system-prompt resolution then falls back to the universal default. | Field | Type | Required | Description | | -------------- | ------------------------ | -------- | ----------------------------------------------------------------------------------------------------------------------------------------- | | `appId` | `string` | Yes | Target `GguiApp.appId`. | | `systemPrompt` | `string` (≤10,000 chars) | Yes | Replacement text. Pass `""` to clear the override. Cap bounds the response payload and matches a reasonable agent-authored prompt length. | **Returns:** the updated `AppRecord` (with `systemPrompt` omitted when cleared). **Errors:** | Code | When | | --------------- | ----------------------------------------------------------- | | `app_not_found` | Target `appId` doesn’t exist OR is owned by another tenant. | *** ## Orgs (`ops-orgs`, 4 handlers) [Section titled “Orgs (ops-orgs, 4 handlers)”](#orgs-ops-orgs-4-handlers) Operator actions on `GguiOrg` + `GguiOrgMember` + `GguiOrgInvite` rows. Orgs are the unit of multi-user collaboration; each row carries `orgId` (ULID), `name`, `ownerUserId`, plus per-membership role on the join rows. Bound on the cloud pod via the `provisionGguiOrg` / `fetchMyOrgs` / `issueOrgInvite` / `revokeOrgInvite` AppSync mutations. ### `ggui_ops_list_orgs` [Section titled “ggui\_ops\_list\_orgs”](#ggui_ops_list_orgs) Enumerate every org the calling user belongs to — owner + admin + member memberships in a single list, each row carrying the caller’s role. **Inputs:** none. **Returns:** `{ orgs: OrgMembershipRecord[] }` ```typescript interface OrgMembershipRecord { readonly orgId: string; readonly name: string; readonly ownerUserId: string; readonly role: "owner" | "admin" | "member"; readonly joinedAt: string; } ``` Mirrors the AppSync `fetchMyOrgs` custom resolver. Use to discover `orgId` before calling the invite tools. ### `ggui_ops_create_org` [Section titled “ggui\_ops\_create\_org”](#ggui_ops_create_org) Provision a fresh `GguiOrg` owned by the calling user. Wraps the cloud’s `provisionGguiOrg` mutation — ULID `orgId` minted server-side; an owner membership row and a zero-balance credit row are inserted atomically via TransactWrite. | Field | Type | Required | Description | | ------ | ---------------------- | -------- | ------------------------------------------------------------------------------------ | | `name` | `string` (1–120 chars) | Yes | Human-friendly display name. Required (no default — orgs are intentional creations). | **Returns:** ```typescript interface CreateOrgOutput { readonly orgId: string; readonly name: string; readonly ownerUserId: string; readonly createdAt: string; readonly updatedAt: string; } ``` ### `ggui_ops_invite_to_org` [Section titled “ggui\_ops\_invite\_to\_org”](#ggui_ops_invite_to_org) Issue an `admin`- or `member`-role invite to a `GguiOrg` the caller can administer. The invite link in the recipient’s email points at the console (coming soon): `console.ggui.ai/invites/`. | Field | Type | Required | Description | | ------- | --------------------- | -------- | --------------------------------------------------------------------------------------------------------------------- | | `orgId` | `string` | Yes | Target org — caller must own or administer it. Discover via `ggui_ops_list_orgs`. | | `email` | `string` (RFC 5322) | Yes | Recipient email — the invite link is sent here. | | `role` | `'admin' \| 'member'` | Yes | Role the recipient holds once they accept. Owner can’t be granted via invite — ownership transfer is a separate flow. | **Returns:** ```typescript interface InviteToOrgOutput { readonly inviteId: string; readonly orgId: string; readonly email: string; readonly role: "admin" | "member"; readonly inviterUserId: string; readonly status: "pending" | "accepted" | "revoked" | "expired"; readonly expiresAt: string; readonly createdAt: string; readonly reused: boolean; } ``` **Anti-double-issue:** an existing pending invite for the same `(orgId, email)` is reused — no new row, no second email. `reused: true` flags the dedup. **Errors:** | Code | When | | -------------------------- | -------------------------------------------- | | `org_invite_access_denied` | Caller is not owner/admin of the target org. | ### `ggui_ops_revoke_invite` [Section titled “ggui\_ops\_revoke\_invite”](#ggui_ops_revoke_invite) Invalidate a pending org invite — the bearer-secret link in the recipient’s email stops working immediately. | Field | Type | Required | Description | | ---------- | -------- | -------- | ---------------------------------------------------------------- | | `inviteId` | `string` | Yes | Target invite — must belong to an org the caller can administer. | **Returns:** ```typescript interface RevokeInviteOutput { readonly inviteId: string; readonly status: "pending" | "accepted" | "revoked" | "expired"; readonly alreadyRevoked: boolean; } ``` **Concurrency:** the adapter flips `status` from `pending` → `revoked` via a CAS `ConditionExpression`. A racing accept surfaces a clear conflict instead of silently overwriting. Already-revoked invites return `alreadyRevoked: true`; already-accepted invites reject. **Errors:** | Code | When | | -------------------------- | ------------------------------------------------------ | | `org_invite_access_denied` | Caller is not owner/admin of the invite’s org. | | `org_invite_not_found` | The id doesn’t exist OR isn’t reachable by the caller. | *** ## Connector keys (`ops-connector-keys`, 3 handlers) [Section titled “Connector keys (ops-connector-keys, 3 handlers)”](#connector-keys-ops-connector-keys-3-handlers) Operator actions on `GguiUserApiKey` rows — the user-facing `ggui_user_*` API key strings that Claude Desktop (and other Connectors) present to call the MCP routes on the user’s behalf. Bound on the cloud pod via the `issueGguiUserApiKey` AppSync mutation + the `apiKeysByUserId` GSI + raw DDB `UpdateItem` for revoke. ### `ggui_ops_list_connector_keys` [Section titled “ggui\_ops\_list\_connector\_keys”](#ggui_ops_list_connector_keys) Read the calling user’s `ggui_user_*` connector keys. **Metadata only — NEVER plaintext.** **Inputs:** none. **Returns:** `{ keys: ConnectorKeySummary[] }` ```typescript interface ConnectorKeySummary { readonly id: string; // stable id for revoke readonly apiKeyPrefix: string; // first ~8 chars of the secret (human re-identification) readonly name?: string; // user-supplied label readonly appId?: string; // optional FK — when set the key locks to that app readonly status: "active" | "revoked"; readonly createdAt: string; readonly lastUsedAt?: string; // from the last successful auth lookup readonly expiresAt?: string; // past timestamp ⇒ adapter rejects auth } ``` The hash itself is never returned on any tool. ### `ggui_ops_issue_connector_key` [Section titled “ggui\_ops\_issue\_connector\_key”](#ggui_ops_issue_connector_key) Mint a fresh `ggui_user_*` connector key. | Field | Type | Required | Description | | ----------- | ---------------------- | -------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `name` | `string` (1–120 chars) | No | Optional label, e.g. `'MacBook Claude Desktop'`. Surfaces on `ggui_ops_list_connector_keys`. | | `appId` | `string` | No | Lock the key to one app. When set, sessions opened with this key scope to the named app and meta-tools (`ggui_ops_open_app`, `ggui_ops_list_apps`) are NOT exposed. Absent ⇒ universal key (scopes to `User.defaultAppId` per request). | | `expiresAt` | `string` (ISO 8601) | No | Optional expiry. Past timestamps reject auth from the start. | **Returns:** ```typescript interface IssueConnectorKeyOutput { // metadata — same shape as a list row readonly id: string; readonly apiKeyPrefix: string; readonly name?: string; readonly appId?: string; readonly status: "active" | "revoked"; readonly createdAt: string; readonly lastUsedAt?: string; readonly expiresAt?: string; // ONE-TIME REVEAL — never returned again readonly plaintextKey: string; } ``` One-time reveal `plaintextKey` is the `ggui_user_` secret. It appears on this response and never again — the adapter persists `sha256(plaintextKey)` hex plus `apiKeyPrefix`, and the plaintext is not stored. The caller MUST surface it to the user immediately. ### `ggui_ops_revoke_connector_key` [Section titled “ggui\_ops\_revoke\_connector\_key”](#ggui_ops_revoke_connector_key) Soft-revoke a `GguiUserApiKey` row. The adapter sets `status='revoked'`; the auth path rejects revoked keys regardless of hash match. Rows are kept for audit (age-based sweep handles cleanup). | Field | Type | Required | Description | | ------- | -------- | -------- | ------------------------------------------------------------------------------------------ | | `keyId` | `string` | Yes | Stable id of the row (NOT the secret string). Discover via `ggui_ops_list_connector_keys`. | **Returns:** ```typescript interface RevokeConnectorKeyOutput { readonly id: string; readonly status: "active" | "revoked"; readonly alreadyRevoked: boolean; } ``` **Errors:** | Code | When | | ----------------------------- | ------------------------------------ | | `connector_key_access_denied` | The key belongs to another user. | | `connector_key_not_found` | No such key reachable by the caller. | Idempotent — re-revoking returns `alreadyRevoked: true`. *** ## Coupons (`ops-coupon`, 1 handler) [Section titled “Coupons (ops-coupon, 1 handler)”](#coupons-ops-coupon-1-handler) Operator action on `GguiCoupon` rows — bearer-secret promo codes that credit user or org wallets. Bound on the cloud pod via the `redeemCoupon` AppSync mutation. ### `ggui_ops_redeem_coupon` [Section titled “ggui\_ops\_redeem\_coupon”](#ggui_ops_redeem_coupon) Redeem a `cpn_*` coupon code, crediting the caller’s wallet (default) or a target org’s wallet. The adapter runs an atomic three-leg `TransactWrite`: 1. Flip `GguiCoupon.status` from `issued` → `activated`. 2. Credit the wallet (user or org). 3. Insert a ledger row. Failure of any leg rolls all back — no half-credit, no double-spend. | Field | Type | Required | Description | | ------------- | -------- | -------- | --------------------------------------------------------------------------------------------------------------------- | | `couponCode` | `string` | Yes | The bearer-secret code in format `cpn_<8 chars>`. One-time redemption. | | `targetOrgId` | `string` | No | When set, credits the named org’s wallet instead of the caller’s personal wallet. Caller MUST be a member of the org. | **Returns:** ```typescript interface RedeemCouponOutput { readonly couponCode: string; readonly creditCents: number; readonly redeemedByPrincipalType: "user" | "org"; readonly redeemedByPrincipalId: string; readonly activatedAt: string; } ``` **Errors:** | Code | When | | ------------------------- | ---------------------------------------------------------------------- | | `coupon_not_found` | The code doesn’t exist. | | `coupon_already_redeemed` | The code was previously activated (one-time semantics). | | `coupon_expired` | The code is past its expiry. | | `coupon_access_denied` | `targetOrgId` was provided but the caller is not a member of that org. | *** ## Blueprints (`ops-blueprint`, 5 handlers) [Section titled “Blueprints (ops-blueprint, 5 handlers)”](#blueprints-ops-blueprint-5-handlers) Operator blueprint authorship — generate, register, list, update, delete cached blueprints for the calling app. Unlike the four console-style domains, this family registers on the OSS server via the `opsBlueprint` dep bundle on `defaultHandlers` (registry + blueprint store + search; `generate` additionally requires the `resolveLlm` + `blueprints` deps the render generation path reads). ### `ggui_ops_generate_blueprint` [Section titled “ggui\_ops\_generate\_blueprint”](#ggui_ops_generate_blueprint) Author a blueprint via the bound generator (LLM generation + validation). | Field | Type | Required | Description | | ---------------------- | --------- | -------- | -------------------------------------------------------------- | | `contract` | `object` | Yes | The `DataContract` to generate against. | | `generator` | `string` | No | Generator slug. Unknown slug fails with `generator_not_found`. | | `persona` | `string` | No | Variance axis — normalized lowercase + trimmed. | | `aesthetic` | `string` | No | Variance axis. | | `context` | `string` | No | Variance axis. | | `seedPrompt` | `string` | No | Variance axis. | | `setAsOperatorDefault` | `boolean` | No | Promote the result to the operator default for its contract. | **Returns:** `{ blueprintId, codeHash?, validatorScore?, source }` — `validatorScore` (0–1) only on the advanced generator path; `source` is the stamped provenance `{ kind: 'llm', generator, model }` from the engine’s own metadata stamp. **Errors:** `generator_not_found`; `missing_credentials` (BYOK fix: `ggui_ops_set_provider_key`); generation failure. ### `ggui_ops_register_blueprint` [Section titled “ggui\_ops\_register\_blueprint”](#ggui_ops_register_blueprint) Register pre-built component code verbatim — no LLM, no validator. Operator entry point for fixture seeding and export/reimport round-trips. | Field | Type | Required | Description | | --------------- | -------- | -------- | ----------------------------------------------------- | | `contract` | `object` | Yes | The `DataContract` the code implements. | | `componentCode` | `string` | Yes | The component code to register verbatim (min 1 char). | Plus the same optional `generator` / `persona` / `aesthetic` / `context` / `seedPrompt` / `setAsOperatorDefault` fields as `ggui_ops_generate_blueprint`. **Returns:** `{ blueprintId, codeHash, source }` — `source` is always `{ kind: 'user' }`; hand-supplied bytes carry no engine claim, so none is recorded. ### `ggui_ops_list_blueprints` [Section titled “ggui\_ops\_list\_blueprints”](#ggui_ops_list_blueprints) | Field | Type | Required | Description | | ---------------- | ---------- | -------- | ----------------------------------------------------- | | `contractHash` | `string` | No | Filter by canonical contract hash. | | `generator` | `string` | No | Filter by generator slug. | | `persona` | `string` | No | Dispatches semantic search. | | `intentKeywords` | `string[]` | No | Dispatches semantic search. Filters are AND-composed. | **Returns:** `{ blueprints: Blueprint[] }` ### `ggui_ops_update_blueprint` [Section titled “ggui\_ops\_update\_blueprint”](#ggui_ops_update_blueprint) | Field | Type | Required | Description | | ------------------- | -------------- | -------- | ----------------------------------------------------------------- | | `blueprintId` | `string` | Yes | Target blueprint. | | `isOperatorDefault` | `literal true` | No | Promote to operator default. | | `variance` | `object` | No | Partial-merge of variance axes; `{persona: ""}` clears the field. | **Returns:** `{ blueprintId, updatedAt }` ### `ggui_ops_delete_blueprint` [Section titled “ggui\_ops\_delete\_blueprint”](#ggui_ops_delete_blueprint) | Field | Type | Required | Description | | ------------- | -------- | -------- | ----------------- | | `blueprintId` | `string` | Yes | Target blueprint. | **Returns:** `{ deleted: true }` — idempotent. *** ## Provider keys (`provider-keys`, 3 handlers) — BYOK [Section titled “Provider keys (provider-keys, 3 handlers) — BYOK”](#provider-keys-provider-keys-3-handlers--byok) Operator actions on the caller’s BYOK LLM provider keys. Provider enum: `'anthropic' | 'openai' | 'google' | 'openrouter'`. The handler factories ship in `@ggui-ai/mcp-server-handlers`; they are bound today by the hosted cloud pod (coming soon), which validates keys against the provider and encrypts at rest. ### `ggui_ops_set_provider_key` [Section titled “ggui\_ops\_set\_provider\_key”](#ggui_ops_set_provider_key) | Field | Type | Required | Description | | -------------- | -------- | -------- | -------------------------------------------------------------- | | `provider` | `enum` | Yes | One of `anthropic` / `openai` / `google` / `openrouter`. | | `plaintextKey` | `string` | Yes | The provider API key (min 1 char). Re-set replaces (rotation). | | `label` | `string` | No | Human label. | **Returns:** `{ provider, label?, lastFour, createdAt?, lastUsedAt? }` — never echoes the key. ### `ggui_ops_list_provider_keys` [Section titled “ggui\_ops\_list\_provider\_keys”](#ggui_ops_list_provider_keys) **Inputs:** none. **Returns:** `{ keys: [{ provider, label?, lastFour, createdAt?, lastUsedAt? }] }` ### `ggui_ops_remove_provider_key` [Section titled “ggui\_ops\_remove\_provider\_key”](#ggui_ops_remove_provider_key) | Field | Type | Required | Description | | ---------- | ------ | -------- | ------------------- | | `provider` | `enum` | Yes | Provider to remove. | **Returns:** `{ deleted, provider }` *** ## Credits (`credits`, 2 handlers) [Section titled “Credits (credits, 2 handlers)”](#credits-credits-2-handlers) Read-only views over the caller’s prepaid credit wallet. Bound by the hosted cloud pod (coming soon); self-hosted deployments have no credit plane. ### `ggui_ops_get_credit_balance` [Section titled “ggui\_ops\_get\_credit\_balance”](#ggui_ops_get_credit_balance) **Inputs:** none. **Returns:** `{ balanceCents, lifetimeGrantedCents, lifetimeSpentCents, updatedAt }` ### `ggui_ops_list_credit_transactions` [Section titled “ggui\_ops\_list\_credit\_transactions”](#ggui_ops_list_credit_transactions) | Field | Type | Required | Description | | -------- | -------- | -------- | ------------------ | | `limit` | `number` | No | 1–100, default 20. | | `cursor` | `string` | No | Pagination cursor. | **Returns:** `{ transactions: [{ transactionId, kind, deltaCents, balanceAfterCents, reason, createdAt, relatedSessionId? }], nextCursor? }` — `kind` is one of `free_credit` / `render_charge` / `topup` / `refund`. *** ## OSS vs hosted [Section titled “OSS vs hosted”](#oss-vs-hosted) The four console-style domains are wired through optional fields on `CreateGguiServerOptions` (ops-blueprint hangs off the `opsBlueprint` dep bundle on `defaultHandlers`; provider-keys + credits are cloud-pod-bound): ```typescript interface CreateGguiServerOptions { readonly opsApps?: { readonly apps: AppsSource; readonly userDefaultApp: UserDefaultAppSource; }; readonly opsOrgs?: { readonly orgs: OrgsSource; readonly invites: OrgInvitesSource; }; readonly opsConnectorKeys?: { readonly connectorKeys: ConnectorKeysSource; }; readonly opsCoupon?: { readonly coupons: CouponRedeemSource; }; } ``` * **Hosted (`mcp.ggui.ai`, coming soon):** the cloud pod binds all four — AppSync-backed adapters wrap the corresponding mutations — plus the provider-keys and credits families. The full ops surface is registered on `/ops`. * **OSS (`ggui serve`):** every field is `undefined` by default. The route still mounts but `tools/list` rejects with `Method not found` — no tools capability is advertised when zero handlers are registered. Operator tools only make sense alongside a data model to operate on; the ops-blueprint family is the one most self-hosters wire (via the `opsBlueprint` dep bundle). * **Partial wiring:** omit individual fields to drop their tools. A self-hosted deployment with its own `AppsSource` can register `ggui_ops_*_app` only and leave orgs / connector keys / coupons unwired. The seam interfaces (`AppsSource`, `OrgsSource`, `OrgInvitesSource`, `ConnectorKeysSource`, `CouponRedeemSource`) are exported from `@ggui-ai/mcp-server-handlers` — implementing them against your own backend is the integration path for downstream forks. ## Console parity [Section titled “Console parity”](#console-parity) The [console UI](/clients/console/) (coming soon) will mirror these tools 1:1 — every tool corresponds to one UI action: | Tool | Console surface | | ----------------------------------- | ----------------------------------------------------- | | `ggui_ops_list_apps` | Apps section — main list. | | `ggui_ops_create_app` | Apps section — “New app” button. | | `ggui_ops_rename_app` | Apps section — inline rename. | | `ggui_ops_delete_app` | Apps section — row menu → Delete. | | `ggui_ops_set_default_app` | Apps section — “Set as default” toggle. | | `ggui_ops_update_app_system_prompt` | Apps section → System Prompt editor. | | `ggui_ops_list_orgs` | Orgs section — main list. | | `ggui_ops_create_org` | Orgs section — “New org” button. | | `ggui_ops_invite_to_org` | Orgs section → Members → Invite. | | `ggui_ops_revoke_invite` | Orgs section → Members → pending invite row → Revoke. | | `ggui_ops_list_connector_keys` | Account → Connector Keys list. | | `ggui_ops_issue_connector_key` | Account → Connector Keys → “Issue new key”. | | `ggui_ops_revoke_connector_key` | Account → Connector Keys → row menu → Revoke. | | `ggui_ops_redeem_coupon` | Billing → Redeem coupon. | The MCP surface and the UI surface are siblings over the same seam — they call the same `AppsSource.create`, the same `OrgInvitesSource.issue`, etc. There’s no privileged path on either side. *** ## Example: curl [Section titled “Example: curl”](#example-curl) This walkthrough targets a self-hosted server with the ops seams wired (started with `--dev-allow-all` for the `Bearer dev` shortcut). On hosted ggui (coming soon), the same calls will go to `https://mcp.ggui.ai/ops` with an OAuth bearer. ```bash # 1. Initialize curl -X POST http://127.0.0.1:6781/ops \ -H "Authorization: Bearer dev" \ -H "Content-Type: application/json" \ -d '{"jsonrpc":"2.0","id":1,"method":"initialize","params":{"protocolVersion":"2025-06-18","clientInfo":{"name":"curl","version":"1.0"},"capabilities":{}}}' # 2. Enumerate the caller's apps curl -X POST http://127.0.0.1:6781/ops \ -H "Authorization: Bearer dev" \ -H "Content-Type: application/json" \ -d '{"jsonrpc":"2.0","id":2,"method":"tools/call","params":{"name":"ggui_ops_list_apps","arguments":{}}}' # 3. Create a fresh app curl -X POST http://127.0.0.1:6781/ops \ -H "Authorization: Bearer dev" \ -H "Content-Type: application/json" \ -d '{"jsonrpc":"2.0","id":3,"method":"tools/call","params":{"name":"ggui_ops_create_app","arguments":{"displayName":"Inbox Triage"}}}' # 4. Promote the new app to default (use the appId from step 3's response) curl -X POST http://127.0.0.1:6781/ops \ -H "Authorization: Bearer dev" \ -H "Content-Type: application/json" \ -d '{"jsonrpc":"2.0","id":4,"method":"tools/call","params":{"name":"ggui_ops_set_default_app","arguments":{"appId":""}}}' # 5. Issue a connector key locked to the new app # The response carries `plaintextKey` — surface it to the user immediately. curl -X POST http://127.0.0.1:6781/ops \ -H "Authorization: Bearer dev" \ -H "Content-Type: application/json" \ -d '{"jsonrpc":"2.0","id":5,"method":"tools/call","params":{"name":"ggui_ops_issue_connector_key","arguments":{"name":"MacBook Claude Desktop","appId":""}}}' ``` The same calls can be made through the [`@modelcontextprotocol/sdk`](https://github.com/modelcontextprotocol/typescript-sdk) client by pointing the transport at `/ops` instead of `/mcp` — the tool registration shapes are standard. *** ## See Also [Section titled “See Also”](#see-also) * [Console](/clients/console/) — the human-facing surface for the same actions (coming soon). * [Audience Routes](/architecture/audience-routes/) — the `agent` / `runtime` / `protocol` / `ops` tag model and how it projects to routes. * [MCP Protocol Reference](/api/mcp-protocol/) — the sibling agent-loop surface on `/mcp`. # Rate limits > How ggui surfaces rate limits — the in-band `rate_limited` tool error on `ggui_render`, the HTTP 429 + Retry-After contract, and how host SDKs handle backoff. ggui rate limiting is operator-configured: a `RateLimiter` seam the deployment wires (or doesn’t). This page covers the self-hosted defaults, the two enforcement layers and their wire shapes, and how to layer retry on top of whichever MCP host SDK you’re using. ## Self-hosted defaults [Section titled “Self-hosted defaults”](#self-hosted-defaults) * Default (strict) `ggui serve` wires **no** generation limiter — `ggui_render` is unlimited for paired callers. * `ggui serve --public-demo` binds a per-remote-IP fixed-window limiter to `ggui_render`: **30 generations / 10 minutes / IP** (operator-pays posture for public demos). * Library users wire their own `RateLimiter` into the render handler deps — the seam and the typed `RateLimitedError` live in `@ggui-ai/mcp-server-core`. ## Two enforcement layers [Section titled “Two enforcement layers”](#two-enforcement-layers) ### Tool-level: `ggui_render` rejects in-band [Section titled “Tool-level: ggui\_render rejects in-band”](#tool-level-ggui_render-rejects-in-band) When a rate limiter is wired into the render handler, a limited `ggui_render` call rejects with an MCP **tool error** (an `isError` tool result), not an HTTP 429. The error carries the code `rate_limited` and the retry decision (`retryAfterMs`). Catch it in your agent loop like any other tool error and back off before re-calling. ### HTTP-level: 429 on auth/pairing endpoints [Section titled “HTTP-level: 429 on auth/pairing endpoints”](#http-level-429-on-authpairing-endpoints) The pairing/login routes enforce limits at the HTTP transport layer. Every limited request returns: | Field | Value | | -------------------- | ---------------------------------------------------------------------------------------------- | | HTTP status | `429` | | `Retry-After` header | Seconds before the next attempt is permitted. Optional — absent means use exponential backoff. | | Body | JSON `{ "error": { "code": "rate_limited", "message": "...", "retryAfter": } }`. | `Retry-After` is the authoritative signal. When present, honor it verbatim — the server has already computed the appropriate wait. The `retryAfter` field in the body mirrors the header for convenience when only the body is observable (e.g. some transport wrappers). ## Retry is the host SDK’s job [Section titled “Retry is the host SDK’s job”](#retry-is-the-host-sdks-job) ggui has no first-party client SDK to wrap retries — your MCP host owns that loop. The pattern is the same regardless of host: catch the 429, read `Retry-After`, sleep, retry, cap attempts. For `ggui_render`, additionally check the tool result’s `isError` flag for the in-band `rate_limited` error and back off the same way. ### Claude Agent SDK [Section titled “Claude Agent SDK”](#claude-agent-sdk) The Claude Agent SDK’s `query()` already retries transient transport errors (including 429) using the standard Anthropic SDK retry config. You generally don’t need to do anything — bursts within the retry window never surface to your code. To tune, pass `maxRetries` through the SDK’s options. See [Examples → Claude Agent](/examples/claude-agent/) for a runnable scaffold. ### `@modelcontextprotocol/sdk` (generic MCP) [Section titled “@modelcontextprotocol/sdk (generic MCP)”](#modelcontextprotocolsdk-generic-mcp) The official MCP SDK throws on HTTP errors without retrying. Wrap `callTool` (or whichever method you invoke) yourself: ```typescript import { Client } from "@modelcontextprotocol/sdk/client/index.js"; import { StreamableHTTPClientTransport } from "@modelcontextprotocol/sdk/client/streamableHttp.js"; const client = new Client({ name: "my-agent", version: "1.0.0" }); await client.connect( new StreamableHTTPClientTransport(new URL("http://127.0.0.1:6781/mcp"), { requestInit: { headers: { Authorization: "Bearer dev" } }, }) ); async function callWithRetry( fn: () => Promise, { maxRetries = 3, baseDelayMs = 1000, maxDelayMs = 30000 } = {} ): Promise { for (let attempt = 0; attempt <= maxRetries; attempt++) { try { return await fn(); } catch (err) { // The MCP SDK surfaces HTTP errors with status + headers attached. const status = (err as { status?: number }).status; if (status !== 429 || attempt === maxRetries) throw err; const retryAfter = Number((err as { headers?: Record }).headers?.["retry-after"]) || undefined; const waitMs = retryAfter != null ? retryAfter * 1000 : Math.min(baseDelayMs * 2 ** attempt, maxDelayMs); await new Promise((r) => setTimeout(r, waitMs)); } } throw new Error("unreachable"); } const result = await callWithRetry(() => client.callTool({ name: "ggui_handshake", arguments: { /* ... */ }, }) ); ``` Tune `maxRetries` per workload: lower on interactive (user-blocking) paths so failures bubble up fast; raise on background batch paths where backoff is cheaper than re-queuing. Note that a rate-limited `ggui_render` on a `--public-demo` server does NOT throw an HTTP error — it resolves with `isError: true` and a `rate_limited` message; check the result before treating the call as a success. ## Raw HTTP [Section titled “Raw HTTP”](#raw-http) If you’re hitting the server directly without an MCP SDK, implement the same loop against `fetch`: 1. Read the `Retry-After` header on every 429. 2. If present, sleep that many seconds, then retry. 3. If absent, sleep `min(baseDelay * 2^attempt, maxDelay)`, then retry. 4. Cap retries (3–5 is reasonable for interactive workloads, more for batch). 5. Stop retrying on non-429 4xx (those won’t resolve with backoff). The [generic MCP example](/examples/generic-mcp/) walks through raw-HTTP usage end-to-end. ## See also [Section titled “See also”](#see-also) * [Examples → Claude Agent](/examples/claude-agent/) — runnable scaffold with the host SDK’s native retry. * [Cookbook → Error handling](/cookbook/error-handling/) — retry, surfacing, and dead-letter patterns. * [Troubleshooting](/troubleshooting/) — common error patterns. * [MCP Protocol](/api/mcp-protocol/) — full JSON-RPC method reference. # WebSocket Protocol > Live-channel wire format — the live session plane between a ggui client and your self-hosted ggui serve (hosted ggui coming soon). The live channel — the **live session plane** — runs over a WebSocket between a ggui server and a connected client. It carries agent `render` notifications, outbound `StreamEnvelope` deliveries, and canonical inbound `ActionEnvelope` user actions. | Deployment | URL | | -------------------------------- | ------------------------------------------------ | | Self-hosted (`ggui serve`) | `ws://127.0.0.1:6781/ws` (default; configurable) | | Hosted (`ggui.ai`) — coming soon | `wss://mcp.ggui.ai/ws` | ## Frame catalog [Section titled “Frame catalog”](#frame-catalog) | Direction | Type | Payload | Purpose | | --------------- | ----------------------- | ------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | | Client → Server | `subscribe` | `SubscribePayload` | Bind the connection to a GguiSession. MUST be first. | | Client → Server | `action` | `ActionEnvelope` | Canonical inbound user action. | | Client → Server | `ping` | — | Heartbeat; server answers `pong`. | | Client → Server | `channel_subscribe` | channel name | Subscribe to a `streamSpec[*].source.tool` channel; server polls the tool. | | Client → Server | `channel_unsubscribe` | channel name | Cancel a `channel_subscribe` (idempotent). | | Client → Server | `host_context_observed` | host-context projection | Iframe echoes the MCP-Apps host context. | | Server → Client | `ack` | `AckPayload` | Acknowledges `subscribe`; seeds resume cursors. | | Server → Client | `pong` | — | Heartbeat response. | | Server → Client | `error` | `ErrorPayload` | Transport / auth / subscribe-level error. | | Server → Client | `render` | `RenderPayload {session, matchType?}` | The agent committed a new GguiSession. | | Server → Client | `props_update` | `{sessionId, props}` | `ggui_update` fan-out — full props replacement. | | Server → Client | `data` | `StreamEnvelope` | Outbound delivery on a declared `streamSpec` channel. Generation-pipeline progress also flows here as a `{type:'data'}` delivery on the reserved `_ggui:lifecycle` channel — there is no dedicated progress frame. | | Server → Client | `render_event` | `GguiSessionEvent` | Event-ledger replay when `subscribe.sinceSequence` is set. | | Server → Client | `drain_ack` | `DrainAckPayload` | `ggui_consume` drained an action; iframe cancels its claim timer. | | Server → Client | `channel_payload` | channel frame | `source.tool` result for a subscribed channel. | | Server → Client | `channel_error` | channel error frame | Channel subscribe rejected / poll failed / tool errored. | | Server → Client | `system` | system payload | System-level events (auth, credentials). | ## Client → Server [Section titled “Client → Server”](#client--server) ### `subscribe` [Section titled “subscribe”](#subscribe) Bind the connection to a GguiSession. MUST be the first message. ```json { "type": "subscribe", "payload": { "sessionId": "ses_abc123", "appId": "app_myapp", "wsToken": "btkn_…" } } ``` | Field | Required | Description | | ------------------- | -------- | --------------------------------------------------------------------------------------------------------------------------------------------- | | `sessionId` | Yes | The GguiSession to bind. | | `appId` | No | Tenancy scope — when present MUST match the session’s bound app; when omitted the server resolves the identity-default appId. | | `wsToken` | No | Short-TTL auth credential from the `_meta["ai.ggui/render"]` slice. Required unless the connection authenticated by bearer (see below). | | `fromSeq` | No | Per-channel stream cursor — replay outbound `StreamEnvelope`s with `seq > N` before the live tail begins (needs a `GguiSessionStreamBuffer`). | | `sinceSequence` | No | Event-ledger replay cursor — replays `GguiSessionEvent`s with `sequence > N` as `render_event` frames. Independent of `fromSeq`. | | `role` | No | `'user'` or `'agent'`. | | `supportedVersions` | No | Protocol-version handshake — see below. | `fromSeq` and `sinceSequence` are two independent replay cursors over two ledgers: per-channel stream replay vs the render-level event ledger. When both are set, `sinceSequence` events replay first. #### Authentication [Section titled “Authentication”](#authentication) The load-bearing credential is the `wsToken` minted at `ggui_render` and delivered on the `_meta["ai.ggui/render"]` slice. Clients thread it twice: as `?wsToken=` on the WebSocket upgrade URL AND inside `SubscribePayload.wsToken`. It is opaque, validated server-side against `sessionId` + `appId`, short-TTL, and reusable within its TTL for reconnects. On a successful wsToken-authed subscribe the server mints `AckPayload.sessionToken` — a longer-lived reconnect credential passed on the standard bearer path (`Authorization: Bearer ` or `?token=`) on later connections. ### `action` [Section titled “action”](#action) A canonical [`ActionEnvelope`](/protocol/envelopes/#actionenvelope) — flat, no nested blocks. ```json { "type": "action", "payload": { "sessionId": "ses_abc123", "type": "data:submit", "payload": { "action": "submit", "data": { "rating": 5 } }, "clientSeq": 1 } } ``` The `sessionId` identifies the render the action originated from; the server rejects envelopes whose `sessionId` doesn’t match the subscriber’s bound render. *** ## Server → Client [Section titled “Server → Client”](#server--client) ### `ack` [Section titled “ack”](#ack) Acknowledges the `subscribe` and seeds resume cursors. ```json { "type": "ack", "payload": { "sequence": 42, "timestamp": 1716130000000, "streamSeq": 12, "session": null } } ``` | Field | Description | | ----------------- | ------------------------------------------------------------------------------------------------------------------------ | | `sequence` | Inbound event-ledger position. | | `timestamp` | Epoch milliseconds. | | `session` | Current `GguiSession` snapshot when one is already committed; `null` / absent otherwise. | | `streamSeq` | Highest outbound `StreamEnvelope.seq` sent — seeds `fromSeq` on the next subscribe. | | `replayTruncated` | `true` when a requested `fromSeq` predates the server’s buffer window — the client got the live tail but missed history. | | `sessionToken` | Longer-lived reconnect credential, minted on the first wsToken-authed subscribe (see Authentication above). | | `serverVersion` | Server’s `PROTOCOL_SCHEMA_VERSION` stamp — see the protocol-version handshake below. | ### `render` [Section titled “render”](#render) The agent committed a new GguiSession for this `sessionId`. The payload is `{ session: GguiSession, matchType? }` — the frame discriminator `type: "render"` stays verb-named; the object inside is the `GguiSession`. ```json { "type": "render", "payload": { "session": { "id": "ses_xyz", "componentCode": "/* compiled JS */", "propsSpec": { /* … */ }, "actionSpec": { /* … */ }, "streamSpec": { /* … */ }, "contextSpec": { /* … */ } } } } ``` ### `props_update` [Section titled “props\_update”](#props_update) The `ggui_update` fan-out — the agent mutated props on this GguiSession. `props` is the FULL replacement state (post-merge for `kind: "merge"` updates), not a patch. Persistence is the source of truth; this frame is the latency optimization. ```json { "type": "props_update", "payload": { "sessionId": "ses_abc123", "props": { "rating": 5 } } } ``` ### `render_event` [Section titled “render\_event”](#render_event) One `GguiSessionEvent` from the per-session event ledger, replayed when `subscribe.payload.sinceSequence` is set. Same ledger as `GET /api/sessions/:sessionId/events` — two transports, one cursor. ### `data` [Section titled “data”](#data) An outbound [`StreamEnvelope`](/protocol/envelopes/#streamenvelope). `channel` names the `streamSpec[name]` it belongs to; `mode` is `append` or `replace`. ```json { "type": "data", "payload": { "sessionId": "ses_abc123", "channel": "message", "mode": "append", "payload": { "text": "Found 3 flights.", "sender": "agent" }, "seq": 7 } } ``` ### `error` [Section titled “error”](#error) A server-side error — transport, auth, subscribe rejection, or an inbound action that failed contract validation. An inbound action that fails contract validation surfaces here as a typed `error` frame with code `CONTRACT_VIOLATION` (numeric `-32020`); nothing reaches the consume buffer. The earlier `_ggui:contract-error` reserved channel, its `ContractErrorPayload`, and the `ContractErrorCode` union were deleted in draft-2026-06-11 — contract failures now surface on the call that caused them, not on a side channel. Canonical `code` values exported from `@ggui-ai/protocol`: | Code | Meaning | | -------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------- | | `UPGRADE_REQUIRED` | Protocol-version handshake mismatch (see below). Servers default to `versionPolicy: 'reject'` and close the connection after emitting this frame. | | `CONTRACT_VIOLATION` | An inbound action failed validation against the render’s `actionSpec` (undeclared name or schema-rejected payload); nothing reaches the consume buffer. | Other codes are free-form strings. Server-emitted today: `SESSION_MISMATCH`, `BOOTSTRAP_SESSION_MISMATCH`, `SESSION_CREATE_FAILED`, `REPLAY_HORIZON_PASSED`; the `channel_error` frame uses `CHANNEL_UNKNOWN`, `CHANNEL_NOT_LOCAL`, `SESSION_NOT_FOUND`, `SUBSCRIBE_UNAUTHORIZED`, `POLL_FAILED`. See [Envelopes](/protocol/envelopes/). *** ## Protocol-version handshake [Section titled “Protocol-version handshake”](#protocol-version-handshake) Both peers advertise their schema version on the wire so version mismatch surfaces explicitly instead of silently corrupting state. * `subscribe.payload.supportedVersions?: string[]` — client declares the versions it accepts (first-party clients populate from `CLIENT_SUPPORTED_VERSIONS`). * `ack.payload.serverVersion?: string` — server stamps its `PROTOCOL_SCHEMA_VERSION` on every ack. Mismatch policy: * **Server-side** — if `subscribe.supportedVersions` is present and the server’s `PROTOCOL_SCHEMA_VERSION` isn’t a member, the server replies with `error { code: 'UPGRADE_REQUIRED' }`. Default `versionPolicy: 'reject'` also closes the socket; `versionPolicy: 'advisory'` keeps it open (controlled-migration opt-out only). * **Client-side** — if `ack.serverVersion` is absent from the client’s `CLIENT_SUPPORTED_VERSIONS`, the client surfaces `UPGRADE_REQUIRED` on its error channel. Absent declarations on either side are legacy-pass-through (version-agnostic) — preserves pre-handshake behavior for older peers. *** ## Lifecycle [Section titled “Lifecycle”](#lifecycle) ```plaintext 1. Connect to ws[s]:///ws?wsToken= 2. Send "subscribe" (sessionId + appId + wsToken, optional fromSeq / sinceSequence) 3. Receive "ack" (carries streamSeq + initial session) 4. Loop: receive "render" / "props_update" / "data" / "error" send "action" (generation-pipeline progress arrives as a {type:'data'} delivery on the reserved _ggui:lifecycle channel) 5. Close on session end or client disconnect ``` ### Reconnection [Section titled “Reconnection”](#reconnection) `@ggui-ai/react` reconnects automatically with exponential backoff (1s → 30s, capped at 10 attempts). To resume the outbound stream without gaps, track the last observed `StreamEnvelope.seq` and pass it as `fromSeq` on the next `subscribe`. The server replays buffered envelopes (where supported) before re-entering the live tail. ### Status (SDK) [Section titled “Status (SDK)”](#status-sdk) | Status | Meaning | | -------------- | ---------------------------------- | | `connecting` | Initial socket handshake in flight | | `connected` | Subscribed; `ack` received | | `disconnected` | Socket closed, no retry pending | | `reconnecting` | Backoff in progress after a drop | # The agent backend > How a ggui-aware agent is hosted — @ggui-ai/agent-server (a brand-agnostic Hono backend) plus a thin per-SDK AgentAdapter. The three channels, the user-action doorbell, and what "Zero Agent Code" means. A ggui-aware agent needs an HTTP backend the user’s chat client can talk to. You do **not** hand-roll that backend. The OSS reference implementation is **[`@ggui-ai/agent-server`](https://www.npmjs.com/package/@ggui-ai/agent-server)** — a brand-agnostic [Hono](https://hono.dev) server that owns every ggui-coupled host concern — plus a thin **`AgentAdapter`** that maps your LLM SDK’s event stream to a normalized message envelope. The split is the point: the agent-server has **zero LLM-SDK knowledge** and the adapter has **zero ggui awareness**. The protocol lives entirely between the two. ## Three parties [Section titled “Three parties”](#three-parties) ```plaintext ┌─────────────────────────┐ │ host / chat client │ MCP-Apps host: claude.ai, ChatGPT, │ (owns the chat UI, │ or the sample chat app. Forwards │ forwards ui/message) │ ui/message text to the model. └───────────┬─────────────┘ chat │ (HTTP POST /agent → SSE) ▼ ┌─────────────────────────┐ │ agent backend │ @ggui-ai/agent-server (Hono) │ agent-server + a thin │ + your AgentAdapter (per-SDK glue). │ AgentAdapter │ └───────────┬─────────────┘ MCP │ (Streamable HTTP) ▼ ┌─────────────────────────┐ │ ggui MCP server │ ggui_handshake / ggui_render / │ (GguiSessions + state │ ggui_update / ggui_consume / ggui_emit, │ + the iframe runtime) │ plus the iframe runtime served per session. └─────────────────────────┘ ``` The **iframe** (running [`@ggui-ai/iframe-runtime`](https://www.npmjs.com/package/@ggui-ai/iframe-runtime)) is the rendered surface that lives inside the host. It is not a fourth party — it is the ggui MCP server’s UI, mounted in the host. ## Three channels [Section titled “Three channels”](#three-channels) | Channel | Between | Transport | Status | Carries | | -------- | ------------------------------- | ------------------------ | --------- | ------------------------------------------------------------------------------------- | | **chat** | host ↔ agent backend | HTTP `POST /agent` → SSE | Mandatory | `{kind: 'chat', prompt, chatId?}` in; a stream of `NormalizedMessage`s out | | **MCP** | agent backend ↔ ggui MCP server | MCP Streamable HTTP | Mandatory | `ggui_handshake` / `ggui_render` / `ggui_update` / `ggui_consume` / `ggui_emit` calls | | **live** | ggui MCP server ↔ iframe | WebSocket (+ fallback) | Optional | declared `streamSpec` deliveries (`ggui_emit` fan-out) and `props_update` frames | **chat** is how a turn starts. `POST /agent` is one kind-discriminated endpoint: `{kind: 'chat', prompt, chatId?}` opens the SSE stream — the first event is always `chat-allocated`, carrying the server-allocated `chatId`; subsequent frames are `message` events. The same endpoint also accepts `{kind: 'tool-call', name, arguments}` — the iframe-issued `tools/call` relay, answered as plain JSON rather than SSE. `GET /agent?chatId=X` replays the server-authoritative snapshot through the same handler for rehydration (each recorded tool result is re-inlined fresh so the snapshot reflects current server state), and `GET /` serves a small public manifest the frontend reads `sandboxProxyUrl` from. **MCP** is the agent loop — the adapter’s LLM calls `ggui_render` / `ggui_consume` / etc. against the ggui MCP server using the URL + bearer the library threads on every call. **live** is the first-party fast path for streaming UI updates into the iframe over WebSocket (gated by a `wsToken`). The spec-compliant cross-host fallback is **tool-result inlining**: agent-server’s interceptor reads `_meta.ui.resourceUri` on each tool result, issues a `resources/read`, and inlines the iframe HTML under `_meta.ui.resource` — so the iframe mounts on the first SSE frame with no extra round-trip. ## What agent-server owns [Section titled “What agent-server owns”](#what-agent-server-owns) The library owns **every ggui-coupled host concern**, and nothing else: * HTTP routing (Hono) + SSE streaming * MCP discovery / routing + bearer threading (`bearer` defaults to `GGUI_MCP_BEARER`, then `dev` — pairing with `ggui serve --dev-allow-all`) * **Tool-result resource inlining** (`interceptToolResult`) — mounting iframes from `_meta.ui.resourceUri` * Server-allocated chat ids (`mintChatId`) — the frontend never mints ids client-side * Guest + bearer **auth** with chat-ownership gating * The second-origin **sandbox proxy** boot (per the MCP Apps spec; defaults to `port + 1000`) * The snapshot / rehydration path * **Cross-framework tool identity**: with `crossFramework` on (the `startAgentServer` default), the library declares each tool’s canonical `serverInfo` to ggui once per process via `ggui_runtime_declare_tool_catalog`, so blueprint reuse stays identity-stable across agent frameworks Crucially, it is a **pure prompt forwarder**: the prompt is fed to the adapter verbatim, and the server synthesizes no directive and special-cases no key. The user-gesture directive that tells the model to call `ggui_consume` is authored in the iframe’s `ui/message` text and passes straight through (see [the user-action flow](#the-user-action-flow) below). ## What the AgentAdapter implements [Section titled “What the AgentAdapter implements”](#what-the-agentadapter-implements) A thin per-SDK adapter implements **one** async-iterable method: ```ts import { startAgentServer, type AgentAdapter } from "@ggui-ai/agent-server"; const adapter: AgentAdapter = { name: "my-sdk", async *run(input) { // input.prompt — the string the LLM should see (verbatim) // input.chatId — server-allocated stable id for this conversation // input.mcpServers — { name → { url, bearer } } map (e.g. { ggui: {…} }) // input.systemPrompt — three-way: undefined = adapter default, // null = explicitly none, string = override // input.abortSignal — fires on client disconnect; stop the LLM call // input.agentCapabilities — canonical tool catalog (from live MCP // initialize + tools/list) to stamp into the // handshake's blueprintDraft contract // // Drive your SDK's tool loop and yield NormalizedMessage values: // assistant text · tool_use · tool_result (carrying the full MCP // CallToolResult as `tool_use_result`) · result }, }; await startAgentServer({ port: 6790, mcpServers: { ggui: { url: "http://localhost:6781/mcp" } }, // a `ggui` entry is required adapter, // optional: auth (default createGuestTokenAuth()), sandboxProxyPort // (default port + 1000), systemPrompt, bearer, chatStore, crossFramework }); ``` Adapters **must stay brand-agnostic**: no imports of `@ggui-ai/protocol/integrations/mcp-apps`, no awareness of `sessionId` / `host-session` / `_meta.ui` keys. The adapter maps its native SDK event stream onto the `NormalizedMessage` envelope; ggui mechanics stay in agent-server. Reference adapters ship for the Claude Agent SDK (`claude-agent-sdk`), the OpenAI Agents SDK (`openai-agents-sdk`), and Google ADK (`google-adk`). ## Frontend pairing — `@ggui-ai/react/chat-helpers` [Section titled “Frontend pairing — @ggui-ai/react/chat-helpers”](#frontend-pairing--ggui-aireactchat-helpers) On the browser side, agent-server pairs with the **`useMcpAppsChat`** hook from [`@ggui-ai/react/chat-helpers`](/sdk/react/). It: * opens the SSE stream to `POST /agent` and parses `chat-allocated` then `message` frames into one wire; * walks each tool result’s `tool_use_result` for `_meta.ui.resourceUri` (and any inlined `_meta.ui.resource`) and surfaces the result as `sessions` entries your app mounts with `` (imported directly from `@mcp-ui/client` — ggui doesn’t wrap or re-export it); * replays `GET /agent?chatId` through the same pipeline for rehydration; * runs the guest-token client flow (`POST /auth/guest` → store → `Bearer` on every request → retry once on `401`); * forwards an iframe `ui/message`’s text as the next prompt via `handleAppMessage` and carries its `_meta` **opaquely** as `data.meta` — it never reads a key inside. ## The user-action flow [Section titled “The user-action flow”](#the-user-action-flow) When a user interacts with a rendered UI, the gesture travels back to the agent through the **GguiSession’s pending-event pipe**, which is the single source of truth: 1. **Gesture in the iframe** → the iframe runtime calls `ggui_runtime_submit_action` via the host’s `tools/call` relay (postMessage per the MCP Apps spec; in the sample stack, the host relays it as `POST /agent {kind: 'tool-call'}`). 2. The ggui server **appends the gesture to the GguiSession’s pending-event pipe** and returns `{ok, consumerPresent}`. 3. `consumerPresent` is computed by an active-consumer registry: `ggui_consume` registers itself while long-polling, so `submit_action` knows whether a consume loop is currently listening. 4. **If a `ggui_consume` long-poll is listening**, it unblocks in-turn and returns the event `{intent, actionData, uiContext, actionId, firedAt}` to the agent. 5. **If nobody is listening** (`consumerPresent: false` — e.g. the user reloaded the page after the agent’s turn ended), the iframe emits a **userAction doorbell** on a `ui/message`. The host forwards the message text to the model, which wakes a fresh turn and calls `ggui_consume({sessionId})` to drain the already-enqueued gesture. The agent retrieves the gesture **exclusively** via `ggui_consume`, so it fires **exactly once**. The doorbell is a **pure pointer** — `_meta["ai.ggui/userAction"]` (`GguiUserActionMeta`) carries only `{kind: 'user-action', description, sessionId, actionId, submittedAt, intent, nextStep: {tool: 'ggui_consume', args: {sessionId}}}`, never the action payload. Carrying the payload in the doorbell would risk a double-trigger. ## ”Zero Agent Code”, redefined [Section titled “”Zero Agent Code”, redefined”](#zero-agent-code-redefined) [Zero Agent Code](/protocol/overview/) now means an agent builder writes only: 1. **MCP server config** naming the ggui MCP endpoint, 2. a **system prompt** (start from `GGUI_AGENT_SYSTEM_PROMPT`, exported by `@ggui-ai/protocol`), and 3. a few lines wiring a thin **`AgentAdapter`** into `startAgentServer()`. No polling loops, no event handlers, no protocol parsing, no `sessionId` / `host-session` awareness. All of that lives inside `@ggui-ai/agent-server`. The adapter is intentionally brand-agnostic SDK-mapping glue — not ggui logic. ## Auth posture (Preview) [Section titled “Auth posture (Preview)”](#auth-posture-preview) agent-server ships two `AuthAdapter` implementations: * **`createGuestTokenAuth` (default)** — stateless signed bearer tokens (signing secret from `GUEST_TOKEN_SECRET`, ephemeral with a warning if omitted) that work across browser / React Native / CLI. Mounts `POST /auth/guest`, `GET /auth/me`, `POST /auth/logout`. * **`createBearerTokenAuth`** — static operator-configured tokens for sample apps, CI, and small self-hosts. Mounts `GET /auth/me` only. Every chat row is stamped with an `ownerId`; reads and appends are ownership-gated (`200` owner / `403` other / `404` unknown), overridable via `authorizeChat` for team / org semantics. Richer JWT / JWKS / OAuth + PKCE flows are deferred to a future `@ggui-ai/agent-server-auth-extras` (same `AuthAdapter` contract, no handler rewrites). For the Preview, the bundled guest-token + static-bearer paths are the supported surface. ## See also [Section titled “See also”](#see-also) * [How ggui works](/how-it-works/) — the handshake → render → interact → consume loop * [Architecture overview](/architecture/overview/) — the wire pipeline at a glance * [Event System](/architecture/event-system/) — the pending-event pipe + the consume model * [React SDK](/sdk/react/) — `useMcpAppsChat` and `` on the frontend * [MCP Protocol reference](/api/mcp-protocol/) — `ggui_render` / `ggui_consume` / `ggui_update` / `ggui_emit` # Audience routes > Every MCP tool carries an audience tag that determines which route it surfaces on — /mcp, /protocol, or /ops. This page explains the four audiences and the routing rules. A single ggui server exposes several distinct MCP surfaces, not one. Each tool the server registers carries an **audience tag** that decides which HTTP route the tool appears on. The agent runtime sees one slice of the tool set; design-time clients see another; operators see a third. The audience tag is the structural mechanism that keeps those slices honest. This page explains the four audiences, the three routes they map to, and the placement rules for every new handler. ## Why audiences exist [Section titled “Why audiences exist”](#why-audiences-exist) Different callers connect to a ggui server for different reasons: * An **LLM agent** in the middle of a chat turn needs `ggui_render`, `ggui_handshake`, blueprint search, and not much else. * The **view runtime** — the iframe-runtime, relayed through the host’s `tools/call` — and first-party backend libraries (e.g. `@ggui-ai/agent-server`) need callbacks like `ggui_runtime_sync_context` and `ggui_runtime_submit_action`. * A **design-time client** authoring blueprints needs static spec/discovery tools like `ggui_protocol_describe_blueprint_format` once, then never again. * An **operator** managing apps, keys, and orgs needs administrative tools that should never appear in an LLM’s `tools/list`. Putting all of those tools on one `/mcp` surface burns agent context on tools the agent will never call, and exposes operator surfaces to runtimes that shouldn’t see them. The audience tag splits the surface so each caller’s `tools/list` is exactly the tools that caller cares about. ## The four audiences [Section titled “The four audiences”](#the-four-audiences) | Audience | Surfaces on | Wire-name prefix | Who calls | | ---------- | ----------- | ----------------- | ----------------------------------------------------------------------------------------------------- | | `agent` | `/mcp` | `ggui_*` | The LLM agent itself during a chat turn (render, handshake, blueprint search) | | `runtime` | `/mcp` | `ggui_runtime_*` | The view runtime — iframe-runtime (via the host’s `tools/call` relay) + first-party backend libraries | | `protocol` | `/protocol` | `ggui_protocol_*` | Design-time spec/discovery clients (conformance suites, registry browsers) | | `ops` | `/ops` | `ggui_ops_*` | Operator agents — an LLM acting as a console operator, dashboards, CI | Each tag answers exactly one question: **who is calling this tool, and on what time-scale?** ### `agent` [Section titled “agent”](#agent) Surfaced on `/mcp`. The LLM agent calls these tools live, inside a chat turn. They typically mutate render state, render contracts, or look up blueprints by intent. Wire-name prefix: bare `ggui_*` (e.g. `ggui_render`, `ggui_handshake`, `ggui_update`). The bare prefix is reserved for the canonical agent route — these are the tools an agent calls most often, and they don’t need a route disambiguator. ### `runtime` [Section titled “runtime”](#runtime) Surfaced on `/mcp` alongside `agent` tools. Called by the view runtime — the iframe-runtime, relayed through the host’s MCP-Apps `tools/call` — and by first-party backend libraries (e.g. `@ggui-ai/agent-server` declaring the per-app tool catalog via `ggui_runtime_declare_tool_catalog`) — not by the LLM directly. They handle things like syncing renderer state back to the server or submitting user actions. Wire-name prefix: `ggui_runtime_*` (e.g. `ggui_runtime_sync_context`, `ggui_runtime_submit_action`). ### `protocol` [Section titled “protocol”](#protocol) Surfaced on `/protocol`. Static design-time tools that describe the protocol itself — example blueprints, format references, schema validators. A client calls these **once** while authoring against the protocol, not during runtime. Wire-name prefix: `ggui_protocol_*` (e.g. `ggui_protocol_describe_blueprint_format`, `ggui_protocol_validate_blueprint`, `ggui_protocol_get_example_blueprints`). The litmus test: would the result change if the same caller invoked the tool again five minutes later? If no — the tool returns a static format reference or immutable example set — it belongs on `/protocol`. If yes, it’s a runtime lookup and belongs on `/mcp`. ### `ops` [Section titled “ops”](#ops) Surfaced on `/ops`. Operator-facing tools an LLM operator (or dashboard, or CI script) uses to manage apps, register blueprints, issue connector keys, redeem coupons, list orgs. Never visible to the agent runtime, never invoked from inside a rendered UI. Wire-name prefix: `ggui_ops_*` (e.g. `ggui_ops_create_app`, `ggui_ops_list_orgs`, `ggui_ops_issue_connector_key`). ## Routes table [Section titled “Routes table”](#routes-table) The four audiences map onto three HTTP routes: | Route | Audiences mounted | Typical caller | Auth posture | | ----------- | ------------------- | --------------------------------------- | ---------------------------- | | `/mcp` | `agent` ∪ `runtime` | LLM agent + view runtime | Bearer token or session auth | | `/protocol` | `protocol` | Conformance suites, design-time clients | Same auth chain as `/mcp` | | `/ops` | `ops` | Operator agents, dashboards, CI | Same auth chain as `/mcp` | The mounting logic reads each handler’s `audience` array and includes it on every route whose audience set intersects the handler’s tags. A handler tagged `audience: ['agent']` lands on `/mcp` only. A handler tagged `audience: ['agent', 'runtime']` also lands on `/mcp` (the union doesn’t change membership). A handler tagged `audience: ['ops']` lands on `/ops` only. When per-app routing is configured, the same `agent` ∪ `runtime` surface also mounts at a per-app path (`/apps/` on hosted deployments); the audience model is identical — only the tenancy resolution differs. Caution A handler with **no** audience tag is mounted on `/mcp` by default. This is a backward-compatibility behavior — handlers added before audience tagging existed default to `agent`. New handlers should always declare an explicit `audience` array. ## Placement decision tree [Section titled “Placement decision tree”](#placement-decision-tree) When you add a new MCP handler, walk this tree before picking an audience: 1. **Does the LLM agent invoke this during a chat turn?** Yes → `audience: ['agent']`. 2. **Is this a tool declared with `_meta.ui.visibility: ['app']`** that the rendered view invokes through the host’s MCP-Apps `tools/call` relay? Yes → `audience: ['runtime']`. 3. **Is this a static spec or discovery tool a client reads once while authoring against the protocol?** Yes → `audience: ['protocol']`. 4. **Is this an administrative operation a human, dashboard, CI, or operator agent performs out-of-band?** Yes → `audience: ['ops']`. The placement test is exclusive — if more than one branch fires, pick the **most-frequent caller** and tag that audience. Multi-audience tags are rare; see below. ### The “is this a runtime lookup?” trap [Section titled “The “is this a runtime lookup?” trap”](#the-is-this-a-runtime-lookup-trap) Tools like `ggui_search_blueprints` *sound* like spec/discovery — they discover blueprints. But their results change per-app and per-session: the agent calls them at chat time to decide what to build, not to learn the protocol’s format. They are runtime lookups, tagged `agent`, surfaced on `/mcp`. The litmus test repeats: **would the result change if the same caller invoked this tool again five minutes later?** * Yes → runtime lookup → `agent` (or `runtime` if called from the iframe). * No → static spec/discovery → `protocol`. ## Wire-name prefix discipline [Section titled “Wire-name prefix discipline”](#wire-name-prefix-discipline) The prefix encodes the audience at the wire-name level so a tool reader can infer the route without consulting documentation. An LLM scanning `tools/list` on `/protocol` sees `ggui_protocol_describe_blueprint_format` and immediately understands the routing. | Prefix | Implied route | Implied audience | | ----------------- | ------------- | ---------------- | | `ggui_*` (bare) | `/mcp` | `agent` | | `ggui_runtime_*` | `/mcp` | `runtime` | | `ggui_protocol_*` | `/protocol` | `protocol` | | `ggui_ops_*` | `/ops` | `ops` | Bare `ggui_*` is the exception, not the rule. It is reserved for agent runtime essentials — the canonical chat-turn tools that don’t need a route disambiguator. Every other audience requires the explicit prefix. Caution The prefix and the `audience` tag must agree. A handler named `ggui_protocol_foo` with `audience: ['ops']` is a discipline violation — the prefix promises `/protocol`, the tag mounts it on `/ops`, and any agent scanning `/protocol` for `ggui_protocol_*` tools will miss it. Keep them in sync. ## How a handler declares its audience [Section titled “How a handler declares its audience”](#how-a-handler-declares-its-audience) The `audience` field lives on every `SharedHandler`. It is a `ReadonlyArray<'agent' | 'runtime' | 'protocol' | 'ops'>` — an array because multi-audience tagging is structurally permitted (see below). ```ts import type { SharedHandler } from '@ggui-ai/mcp-server-handlers'; export function createListOrgsHandler(): SharedHandler<…> { return { name: 'ggui_ops_list_orgs', title: 'List organizations', audience: ['ops'], description: 'Enumerate orgs visible to the calling operator.', inputSchema: { /* … */ }, outputSchema: { /* … */ }, async handler(input, ctx) { /* … */ }, }; } ``` That’s the entire contract. Once a handler is registered through the normal channel (the `handlers` array passed to `createGguiServer`), the route mounter reads the `audience` field at compose time and decides which routes the handler appears on. There is no separate route-registration step. ## Multi-audience handlers [Section titled “Multi-audience handlers”](#multi-audience-handlers) The `audience` field is an array, not a scalar, because a handler can legally surface on more than one route. In practice this is rare and intentional: ```ts export function createSomeBoundaryToolHandler(): SharedHandler<…> { return { name: 'ggui_some_boundary_tool', audience: ['agent', 'runtime'], // … }; } ``` Such a handler appears in both the agent’s `tools/list` and the iframe runtime’s view. The canonical example is a tool that has to land on the agent’s wire AND accept calls from inside the rendered UI — the runtime essentials that genuinely span both callers. Multi-audience is a tool that lives on multiple routes simultaneously. If you find yourself reaching for it, double-check the placement test first — most “multi-audience” tools are actually two tools wearing a trench coat, and splitting them sharpens both surfaces. ## Relation to MCP services [Section titled “Relation to MCP services”](#relation-to-mcp-services) The audience model governs **shared** routes — `/mcp`, `/protocol`, `/ops` — where handlers from different sources are aggregated under audience filtering. A separate concept, **MCP services**, lets a server expose isolated, complete MCP servers at their own HTTP paths (e.g. `https://your-server/docs`, `https://your-server/playground/todos`). | Concept | Routing mechanism | Tool isolation | When to use | | ------------ | ---------------------------------- | --------------------- | ------------------------------------------- | | **Audience** | Tag filters tool onto shared route | Tools share namespace | Tool belongs alongside ggui-native tools | | **Service** | Path mounts an isolated MCP server | Per-path namespace | Tool set is conceptually its own MCP server | A service handler **must not** set an `audience` tag — the path IS the audience. The compose-time validator rejects services with audience-tagged handlers loudly: services bypass audience filtering entirely, so a tag would be silently meaningless. → See [MCP services](/architecture/mcp-services/) for the full service model. ## Placement anti-patterns [Section titled “Placement anti-patterns”](#placement-anti-patterns) Audience tagging makes it cheap to add new tools, which means the *placement* discipline matters more than ever. Some patterns to avoid: * **`ops`-audience tools that mutate non-tenant data.** The `/ops` route is operator-bounded but still scoped to the calling operator’s tenant. A tool that lets one operator probe another tenant’s apps is a confused-deputy bug, not a feature. * **Cross-tenant probing.** `ggui_ops_list_orgs` should enumerate orgs the caller can see, not all orgs. If a tool needs admin-level visibility, gate it on an explicit role check at the handler boundary, not at the route boundary. * **`protocol`-audience tools that return runtime data.** If a result depends on which session is calling, it is a runtime lookup. Move it to `agent`. * **`runtime`-audience tools that the agent should also call.** If an LLM agent needs the data, tag it `agent` (or `['agent', 'runtime']` if the iframe legitimately calls it too). Tagging it `runtime`-only hides it from the agent’s `tools/list`. * **Untagged handlers.** The default-to-`agent` behavior is a backward-compatibility convenience, not a recommendation. Always declare `audience` explicitly on new handlers. ## See also [Section titled “See also”](#see-also) * [MCP services](/architecture/mcp-services/) — isolated per-path MCP servers * [MCP protocol](/api/mcp-protocol/) — the `/mcp` surface in detail * [Ops MCP](/api/ops-mcp/) — the `/ops` surface in detail * [Architecture overview](/architecture/overview/) — three channels and the capability model # Benchmark methodology > How benchmarks.ggui.ai measures UI-generation quality, latency, and cost — the model matrix, the five aesthetic dimensions, the three-provider judge panel, the corpus, and how to reproduce a run locally. [benchmarks.ggui.ai](https://benchmarks.ggui.ai) is the public dashboard for ggui’s generation quality. It runs nightly and publishes per-cell **quality**, **latency**, and **cost** across a three-tier model matrix. This page is the methodology behind those numbers — what is measured, how it is scored, and how to reproduce a run yourself. ## What it measures [Section titled “What it measures”](#what-it-measures) Every night the harness generates UI for a fixed corpus of prompts across a **three-tier model matrix** — `fast`, `balanced`, and `premium` capability tiers, each instantiated on the three providers ggui supports (`claude`, `openai`, `google`). Every matrix cell records three things: * **Quality** — the aesthetic score (below), 0–100. * **Latency** — wall-clock time to a compiled, contract-typed component. * **Cost** — provider spend for the generation, in USD. The dashboard publishes **per-cell** results. It is not a provider leaderboard — see [Judge panel](#judge-panel). ## Quality scoring [Section titled “Quality scoring”](#quality-scoring) Quality is the mean of **five aesthetic dimensions**, each weighted equally at 20% and scored 0–100: | Dimension | What it captures | | ----------------- | ------------------------------------------------------- | | Layout | Spacing, alignment, structure, responsive behavior | | Design tokens | Correct use of `@ggui-ai/design` tokens over ad-hoc CSS | | Hierarchy | Visual weighting — what reads first, second, third | | Polish | States, affordances, finish; the absence of rough edges | | Data presentation | How clearly the contract’s data is rendered | The **pass threshold is 70**. A cell at or above 70 is a pass; below is a fail. The five-dimension breakdown is published alongside the composite so a regression can be traced to the dimension that moved. ## Judge panel [Section titled “Judge panel”](#judge-panel) Quality is **not** scored by a single model. Each generation is judged by a **three-provider panel** — `claude`, `openai`, and `google` — all run at **temperature 0** for determinism. The published score is the **panel mean**, and the **per-cell spread** across the three judges is shown alongside it. The panel exists to neutralize single-model bias: **no model judges only its own output**, and a generous-to-self or harsh-to-rivals bias from any one judge is diluted by the other two. The visible spread is the honesty check — a wide spread on a cell is a signal that the judges disagree, not a number to trust blindly. This is why the dashboard publishes **per-cell scores, not a provider ranking**. The unit of truth is “this model, this tier, on this prompt” — rolling that up into a single “best provider” headline would discard exactly the per-cell, per-dimension detail the panel is designed to preserve. ## Corpus [Section titled “Corpus”](#corpus) The harness runs against a **fixed set of generation prompts** — representative UI shapes that exercise the contract surface: `weather-card`, `survey-form`, `kanban-board`, and others, **plus gadget commits** (renderer-side capability flows). The corpus is fixed so that night-over-night movement reflects model and triad changes, not a shifting set of prompts. ## Reproducibility [Section titled “Reproducibility”](#reproducibility) The benchmark is **source-available** — the entire harness ships in the public repo. To run it yourself: ```bash git clone https://github.com/ggui-ai/ggui cd ggui pnpm install pnpm --filter @ggui-ai/benchmark bench … ``` You need a provider API key (set the relevant provider environment variable; the harness reads it the same way the live dashboard does). The benchmark **dataset is licensed CC-BY-4.0** — reuse it, cite it, build on it. ## See also [Section titled “See also”](#see-also) * [UI Generator](/architecture/ui-generator/) — the harness the benchmark exercises. * [benchmarks.ggui.ai](https://benchmarks.ggui.ai) — the live dashboard. # Event System > How user gestures travel from the renderer back to the agent — EventType vocabulary, ActionEnvelope shape, subscription rules, and the two consumer paths. User actions originate in the renderer (on the live channel), land in the server, and reach the agent through one of two read paths. This page covers the closed vocabulary of events, the canonical envelope, how subscriptions gate what gets delivered, and how an agent or React SDK consumer actually reads them. For the full wire grammar of the envelope itself, see [Envelopes](/protocol/envelopes/). For the channel topology, see the [Architecture overview](/architecture/overview/). ## Event vocabulary [Section titled “Event vocabulary”](#event-vocabulary) The protocol recognizes exactly **one** event type. Every user gesture that drives a turn is a `data:submit`. The earlier `data:change` / `lifecycle:*` / `interaction:*` / `error:*` vocabulary was deleted (draft-2026-06-12) — those types never had a producer. ```typescript type EventType = "data:submit"; // the only member ``` A `data:submit` is schema-validated against the render’s `actionSpec`. ## The envelope [Section titled “The envelope”](#the-envelope) Every user gesture arrives as a flat `ActionEnvelope`: ```typescript interface ActionEnvelope { sessionId: string; // bound at subscribe time; server rejects mismatches type: EventType; payload?: TPayload; // for `data:submit`: { action, data?, tool? } clientSeq?: number; // client-monotonic dedup hint schemaVersion?: string; // producer's PROTOCOL_SCHEMA_VERSION (advisory) } ``` The envelope is intentionally flat — no nested `event` / `context` / `meta` blocks. Render-level diagnostics (device info, interface context, user identity) are captured **once** at subscribe time on the server, not per-delivery. See [Envelopes — “Fields intentionally NOT on the envelope”](/protocol/envelopes/#fields-intentionally-not-on-the-envelope) for the rationale. ## Subscription gating [Section titled “Subscription gating”](#subscription-gating) Delivery gating falls out of the contract’s `actionSpec`: every declared action emits a `data:submit` envelope the agent reads via `ggui_consume`. The old per-event `EventSubscription` filter (an allowlist of event types at render time) and the `DEFAULT_SUBSCRIPTION` constant were deleted from the protocol (no shims) — there is no wire-level subscribe object on render, and `data:submit` is now the only event type. The current `ggui_render` input is `{handshakeId, props, themeId?, infra?, override?}` — `props` is required (pass `{}` when the contract declares no propsSpec), and there is no `subscribe` field. ## Two reader paths, one envelope [Section titled “Two reader paths, one envelope”](#two-reader-paths-one-envelope) The same `ActionEnvelope` reaches consumers through two distinct seams. **Don’t conflate them.** | Consumer | Path | Shape returned | | -------------------------------------- | ----------------------------------------- | ------------------- | | **Agent** (server-side, LLM-driven) | Long-polls `ggui_consume` over MCP | `ConsumeEventEntry` | | **Renderer SDK** (browser, e.g. React) | Live tail on the WebSocket subscribe seam | `ActionEnvelope` | `ggui_consume` is the agent’s read path. It’s render-keyed, consume-once, and the row shape (`ConsumeEventEntry`) carries a tiny bit of extra context the LLM needs to route the gesture. The WebSocket subscribe seam is what the iframe-runtime uses to deliver interaction events into the rendered component. See [MCP Protocol — Events](/api/mcp-protocol/#events) for both shapes. ### Host relay + the `ai.ggui/userAction` doorbell [Section titled “Host relay + the ai.ggui/userAction doorbell”](#host-relay--the-aigguiuseraction-doorbell) In MCP-Apps hosts the gesture reaches the server via the host’s `ggui_runtime_submit_action` `tools/call` relay instead of the WS. If the response reports `consumerPresent: false` (no `ggui_consume` long-poll is draining — e.g. after a page reload), the iframe emits a `ui/message` whose text directs the agent to call `ggui_consume({sessionId})`, with an optional structured mirror on `content[0]._meta["ai.ggui/userAction"]` — a pure pointer; the gesture itself is only ever drained via `ggui_consume`. ## The MCP control surface [Section titled “The MCP control surface”](#the-mcp-control-surface) The agent never talks to the live channel directly — it renders UI, polls events, discovers gadgets, and browses the blueprint marketplace over MCP. The canonical agent-callable tool surface (lifecycle, capability discovery, stream emit, and blueprint marketplace) is enumerated in the [MCP Protocol](/api/mcp-protocol/) reference — that page is the single source of truth, with field-level shapes and return types. Linking out instead of duplicating here keeps this page from drifting as the tool surface evolves. The `ggui_consume` long-poll in particular is the agent’s event read path — its return shape (`ConsumeEventEntry`) is covered in [MCP Protocol — Events](/api/mcp-protocol/#events). ## Outbound traffic on the same channel [Section titled “Outbound traffic on the same channel”](#outbound-traffic-on-the-same-channel) `ActionEnvelope` is the inbound half of the live channel. Server-to-renderer traffic on the same WebSocket arrives as `StreamEnvelope` (one delivery per named `streamSpec` channel). Contract violations are not a separate channel: an invalid inbound action is answered with a typed `error` frame (`CONTRACT_VIOLATION`) on the live channel, and a `ggui_render` / `ggui_emit` validation failure rejects the agent’s own tool call. The former `_ggui:contract-error` channel and its `ContractErrorPayload` vocabulary were removed (draft-2026-06-11). Both surviving shapes are covered in [Envelopes](/protocol/envelopes/). ## Generation progress events [Section titled “Generation progress events”](#generation-progress-events) While the server is generating a fresh UI (no blueprint hit), it emits progress on the reserved `_ggui:lifecycle` channel (a `{type:'data'}` frame) that the renderer surfaces as a loading state. The canonical `GguiLifecyclePayload` vocabulary lives in `packages/protocol/src/types/lifecycle.ts`: ```plaintext handshake_started → handshake_completed → render_started → consume_polling ``` These feed the built-in progress UI inside the rendered iframe (the iframe-runtime) — your end-user sees real-time feedback instead of a frozen iframe. ## React SDK integration [Section titled “React SDK integration”](#react-sdk-integration) In the web consumer path the live-channel WebSocket — and the progress events above — are owned by the **iframe-runtime inside the sandboxed ``**, not by host code. The host doesn’t open the socket, set a `wsEndpoint`, or handle `ActionEnvelope`s directly: the `wsUrl` is server-stamped on the render’s `ai.ggui/render` slice, the iframe connects + subscribes + resumes on its own, and the progress UI animates inside the iframe. ```tsx import { AppRenderer } from "@mcp-ui/client"; import { useMcpAppsChat } from "@ggui-ai/react/chat-helpers"; // Mount the render; the iframe-runtime drives the live channel + progress UI. const { sessions, handleAppMessage } = useMcpAppsChat({ chatEndpoint }); // ``` Structured signals the host may want (dispatch telemetry, subscribe failures, version mismatches, auth-required) surface on the `ggui:observe` postMessage channel. See [React SDK](/sdk/react/) for the host surface and [Error Handling → renderer-side faults](/cookbook/error-handling/#renderer-side-faults-stay-inside-the-iframe) for the observability events. ## Where to next [Section titled “Where to next”](#where-to-next) * [Envelopes](/protocol/envelopes/) — canonical live-channel wire grammar * [MCP Protocol](/api/mcp-protocol/) — agent-side control plane * [WebSocket Protocol](/api/websocket-protocol/) — renderer-side live channel * [Architecture overview](/architecture/overview/) — three-channel topology # MCP services > Multi-mount MCP servers — McpService primitive, path reservations, anonymous mode, and how it differs from McpServerMount. A ggui server is not a single MCP server. One Node process composes: * **The audience-filtered routes** (`/mcp`, `/protocol`, `/ops`) — ggui’s native control plane, optionally extended by `McpServerMount`s that aggregate external tools onto the same surface. * **Zero or more `McpService`s** — fully isolated MCP servers, each mounted at its own HTTP path with its own tool namespace. This page covers the second half. For the audience model that drives the shared routes, see [Audience routes](/architecture/audience-routes/). ## What an `McpService` is [Section titled “What an McpService is”](#what-an-mcpservice-is) An `McpService` is a complete, self-contained MCP server reachable at a single HTTP path you pick (`/docs`, `/playground/todos`, `/internal/billing`, …). A client connecting to that path sees exactly the tools the service declares — no ggui-native tools, no other service’s tools, no audience filtering. The path **is** the namespace boundary. Contrast with `McpServerMount`. A mount is a named bundle of `SharedHandler`s aggregated onto the audience-filtered routes. Mounted tools compose alongside ggui’s native tools (`ggui_render`, `ggui_handshake`, `ggui_consume`, …) so a single MCP session over `/mcp` enumerates the union of everything. Services do not compose; they isolate. ```plaintext ┌─────────────────────────────────┐ Agent (LLM-driven) ─────▶│ /mcp │ │ ggui-native tools │ │ + every mount's tools │ audience-filtered │ (audience=agent|runtime) │ └─────────────────────────────────┘ ┌─────────────────────────────────┐ Operator console ──────▶│ /ops │ audience-filtered │ ggui_ops_* │ (audience=ops) └─────────────────────────────────┘ ┌─────────────────────────────────┐ Docs client ──────▶│ /docs │ │ docs_search, docs_read, │ isolated service │ docs_list │ (own namespace) └─────────────────────────────────┘ ┌─────────────────────────────────┐ Playground user ──────▶│ /playground/todos │ │ todos_list, todos_add, │ isolated service │ todos_toggle, todos_delete │ (own namespace) └─────────────────────────────────┘ ``` ## When to use which [Section titled “When to use which”](#when-to-use-which) | You want | Reach for | | -------------------------------------------------------------------------------------------- | ---------------------------------- | | Tools that should appear alongside `ggui_render` / `ggui_consume` in a generative-UI session | `McpServerMount` | | Tools the LLM-driven agent should call as part of the same MCP connection that drives ggui | `McpServerMount` | | A complete, standalone MCP server at its own URL (`mcp.example.com/docs`, `…/billing`, …) | `McpService` | | First-touch public surface for unauthenticated clients (docs lookup, marketing demos) | `McpService` (+ `anonymous: true`) | | Conceptually distinct tool surfaces that should NOT collide on names with each other | `McpService` (one per surface) | Rubric: * **Composes with ggui’s core tools?** Mount. * **Replaces ggui’s core tools at its own URL?** Service. If you find yourself disabling ggui-native tools on a mount because they don’t make sense for the caller, you wanted a service. If you find yourself proxying ggui’s `tools/list` output through a service to bolt on extra tools, you wanted a mount. ## The `McpService` shape [Section titled “The McpService shape”](#the-mcpservice-shape) ```ts import type { SharedHandler } from "@ggui-ai/mcp-server-handlers"; import type { ZodRawShape } from "zod"; interface McpService { /** * Human-readable service identifier. Surfaced in validation * errors and telemetry. No uniqueness constraint across services * — only `path` must be unique. */ readonly name: string; /** * HTTP path the service mounts at (e.g. `/docs`, * `/playground/todos`). Validated at compose time. */ readonly path: string; /** * Tool handler bundle. Same `SharedHandler` shape ggui-native * handlers and mount handlers use. */ readonly handlers: ReadonlyArray>; /** * Auth-optional: a valid bearer still resolves to the real * identity; missing/invalid credentials fall back to a * synthesized anonymous builder. Default `false` — same auth * posture as `/mcp`. */ readonly anonymous?: boolean; } ``` Field-by-field: * **`name`** — diagnostic-only. Surfaces in `validateMcpServices` error messages and composition telemetry so an operator with several services can tell which one is misconfigured. Does NOT appear on the wire; tool names stay whatever each handler declares. * **`path`** — the HTTP path Express mounts the service handler at. Branded `ServicePath` after passing `validateServicePath`. Must be unique across the `mcpServices` array. * **`handlers`** — `SharedHandler[]`, the same canonical shape every ggui-native handler satisfies. The server registers each handler through `buildMcpServer`’s regular path; validation, logging, and output-schema parsing all happen uniformly. * **`anonymous`** — makes auth optional. Default `false`. See [Anonymous mode](#anonymous-mode) below. ## Path reservations [Section titled “Path reservations”](#path-reservations) Ten paths are reserved at validation time. Declaring a service at any of them throws at server-construction: | Path | Why reserved | | -------------- | --------------------------------------------------------------------------------------------------------------------------------------------- | | `/` | Root — must not be swallowed by a service router. | | `/mcp` | The agent-facing audience-filtered MCP route. | | `/protocol` | The protocol-discovery audience-filtered route. | | `/ops` | The operator-facing audience-filtered route. | | `/ws` | The live-channel WebSocket upgrade path. | | `/health` | Reserved for health probing — the live endpoint is `/ggui/health`; `/health` is held back so a service can never shadow a future short alias. | | `/.well-known` | RFC-reserved discovery prefix (OAuth metadata, security.txt, …). | | `/oauth` | OAuth dance endpoints (authorize / token / register). | | `/_ggui` | Internal ggui control surfaces (admin console, pairing, debug routes). | | `/ggui` | Same — public-facing `/ggui/*` endpoints. | A typo in a host config (`path: '/mpc'` instead of `/mcp`) can no longer silently shadow OAuth discovery, health, or per-app traffic. Caution Reservations apply to **exact equality**, not to prefixes you build on top. A service at `/.well-known/example` is rejected because of the `/.well-known` reservation, but a service at `/wellknown-example` is fine. Pick a distinct first segment. ## Path validation rules [Section titled “Path validation rules”](#path-validation-rules) `validateServicePath` enforces: 1. **Regex** `^/[a-zA-Z0-9_/-]+$` — must start with `/` and contain only letters, digits, `-`, `_`, and `/`. No whitespace, no `.`, no path traversal. 2. **No trailing slash** — `/docs/` is rejected; use `/docs`. Prevents trailing-slash variant collisions where `/docs` and `/docs/` would resolve to different routes. 3. **Non-empty after the leading slash** — at least one character after `/`. 4. **Not a reserved path** — see the table above. Throwing happens at server construction, before any request is served — misconfiguration cannot become a runtime mystery. ## Compose-time invariants [Section titled “Compose-time invariants”](#compose-time-invariants) `validateMcpServices(services)` walks the whole array and enforces: * **Non-empty `name`** on every entry. The name appears in every other error message; empty names defeat the diagnostic. * **`path` passes `validateServicePath`** — all the rules above. * **Service paths are unique** across the `mcpServices` array. Two services cannot mount at the same path. * **Every handler declares a non-empty `outputSchema`.** An empty `ZodRawShape` (`{}`) silently strips `structuredContent` at the MCP SDK boundary — the handler can return `{ items: [...] }` and the wire answer is `{}`. Operators hitting this see success-looking responses with missing data and no diagnostic. Rejected at compose time so the failure arrives with the service + tool names attached. * **No `audience` tag on service handlers.** Services bypass audience filtering entirely — the path IS the audience. An explicit `audience: ['ops']` on a service handler is silently meaningless. Rejected loudly. * **Tool names are unique within a service.** Two handlers declaring the same `name` inside one service collide; one would shadow the other at registration. Rejected. * **Cross-service tool-name collisions ARE allowed.** Services are isolated namespaces. A client connects to one path and only ever sees that path’s tools, so `docs_search` on `/docs` and `docs_search` on `/internal/docs` coexist without ambiguity. This is by design — collapsing into a global tool namespace would defeat the isolation that makes services worth having. ## Anonymous mode [Section titled “Anonymous mode”](#anonymous-mode) `anonymous: true` makes auth **optional**, not skipped. The server always attempts to resolve a presented bearer — a valid credential resolves to the real identity (letting one service mix public reads with authenticated capabilities); only a missing or invalid credential makes the binding layer fall back to the synthesized: ```ts { identity: { kind: 'builder' }, source: 'anonymous', } ``` Two things to note about the shape: * **`identity.kind` does NOT widen.** The `Identity` union still has its three variants (`'builder'` / `'user'` / `'app'`). Anonymous traffic collapses to `'builder'` so handlers that pattern-match on `kind` keep working without an extra arm. Adding a `'anonymous'` kind would force every consumer in the codebase to add a defensive case — the synthesized `'builder'` is the lighter touch. * **The signal lives on `source`.** `AuthResult.source` carries a dedicated `'anonymous'` variant. Handlers that need to distinguish “this request was authenticated” from “this request was let through anonymously” read `source` directly — pattern-matching on `identity.kind` alone cannot answer the question. ### When to use it [Section titled “When to use it”](#when-to-use-it) * **Read-only public surfaces.** Documentation lookup, public catalog browse, marketing demos. * **First-touch onboarding flows** where requiring a bearer token would block the use case (a brand-new visitor to `mcp.example.com/docs` has nothing to present). * **Server-to-server probes** that verify protocol shape without claiming an identity. ### When NOT to use it [Section titled “When NOT to use it”](#when-not-to-use-it) * **Any write path.** An anonymous caller has no accountability surface; you cannot audit who created the row. * **Anything tenant-scoped.** `ctx.appId` for an anonymous request collapses to the single builder. Reads will return data the caller has no business seeing, or writes will land in a tenant they don’t own. * **Anything sensitive enough that you’d want rate limits per-caller.** Anonymous mode pairs with per-IP / per-session rate limits (the `RateLimiter` seam composes the same way for authenticated and anonymous paths), but it cannot give you per-user attribution. ## Wiring it up [Section titled “Wiring it up”](#wiring-it-up) Pass the service array to `createGguiServer`: ```ts import { createGguiServer } from "@ggui-ai/mcp-server"; // monorepo-internal first-party services (not published): import { createDocsHandlers, loadDocsCorpus } from "@ggui-private/mcp-docs"; import { createPlaygroundTodosHandlers, createInMemoryTodoStore, } from "@ggui-private/mcp-playground-todos"; const corpus = await loadDocsCorpus("./docs"); const todoStore = createInMemoryTodoStore(); const server = await createGguiServer({ // ... ggui-native options (renderChannel, blueprintStore, …) mcpServices: [ { name: "docs", path: "/docs", handlers: createDocsHandlers({ corpus }), anonymous: true, // read-only doc lookup; no token required }, { name: "playground-todos", path: "/playground/todos", handlers: createPlaygroundTodosHandlers({ store: todoStore }), // no `anonymous: true` — handlers throw when ctx.userId is missing }, ], }); await server.listen(6781); ``` After `listen()`: * `POST /mcp` — ggui-native + every mount’s tools (audience-filtered). * `POST /docs` — `docs_search`, `docs_read`, `docs_list`. No auth required. * `POST /playground/todos` — `todos_list`, `todos_add`, `todos_toggle`, `todos_delete`. Auth-gated (handlers reject anonymous callers). The audience-filtered routes and the service routes coexist on the same Node process, the same Express app, the same WebSocket binding. There is no second server to run. ## Examples in the wild [Section titled “Examples in the wild”](#examples-in-the-wild) Three first-party services are built on this primitive for the hosted deployment (coming soon): * **`/docs`** — `@ggui-private/mcp-docs`. Three read-only tools (`docs_search`, `docs_read`, `docs_list`) over the ggui documentation corpus. Anonymous-mode; the canonical “public surface” example. See [MCP Docs Service](/api/mcp-docs/). * **`/playground/todos`** — `@ggui-private/mcp-playground-todos`. Four tools (`todos_list`, `todos_add`, `todos_toggle`, `todos_delete`) for the landing-page playground. Auth-gated per-user state (handlers reject anonymous callers). See [Playground · todos](/clients/playground-todos/). * **`/playground/mdh`** — `@ggui-private/mcp-playground-mdh`. Million-Dollar Homepage playground service. See [Playground · MDH](/clients/playground-mdh/). The hosted deployment’s unified `/dev` endpoint (coming soon) is the fullest example: one anonymous service co-hosting public docs + protocol tools with ops tools that re-impose auth per-tool — possible precisely because `anonymous` is auth-optional, so a presented connector key still resolves to the real identity. For the operator-facing audience-filtered route alongside these services, see [Ops MCP](/api/ops-mcp/). For running an analogous service stack under your own hostname, see [`ggui serve`](/cli/serve/). ## Where to next [Section titled “Where to next”](#where-to-next) * [Audience routes](/architecture/audience-routes/) — the audience model behind `/mcp` / `/protocol` / `/ops` * [Architecture overview](/architecture/overview/) — the three-channel topology services live inside * [Event System](/architecture/event-system/) — live-channel traffic is shared by every service on the same host * [MCP Docs Service](/api/mcp-docs/) — the canonical anonymous-mode example * [Ops MCP](/api/ops-mcp/) — the operator-facing audience-filtered route * [Playground · todos](/clients/playground-todos/) and [Playground · MDH](/clients/playground-mdh/) — authenticated-service examples * [`ggui serve`](/cli/serve/) — self-host an analogous service stack # Architecture > How ggui is wired — three channels, a symmetric capability model, a generation pipeline, and a four-tier artifact registry. Protocol-level; implementation-agnostic. This page is the **protocol’s architecture** — what every ggui implementation must do, independent of how it’s deployed. For deployment shapes, see [Self-Hosted](/self-hosted/pairing/) and [Reference deploys](/self-hosted/reference-deploys/). ## Three actors [Section titled “Three actors”](#three-actors) ```plaintext Agent (LLM-driven) │ │ MCP ▼ Server ◀───── WebSocket (live) ──────▶ Renderer (iframe / standalone page) │ │ │ bootstrap (bundle fetch) │ └────────────────────────────────────────────┘ ``` * **Agent** — your code with an LLM in the loop. Speaks to the server over MCP. * **Server** — speaks MCP outward, orchestrates generation, routes events. Hosts the artifact registry the renderer pulls from. Runs at your URL via `ggui serve` (hosted `mcp.ggui.ai` coming soon). * **Renderer** — an iframe (or standalone page) hosting the generated component. Sends user actions back to the server through the host’s `tools/call` relay. ## Three channels [Section titled “Three channels”](#three-channels) ggui’s wire is split across three orthogonal channels. Each has one job. | Channel | Direction | Purpose | | ------------- | ----------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | **Bootstrap** | Server → Renderer | One-shot fetch of the compiled component bundle when the iframe first loads. Any gadgets bound to the session load behind a `` SRI gate. | | **MCP** | Agent ↔ Server | Control plane. `ggui_handshake`, `ggui_render`, `ggui_update`, `ggui_consume`, `ggui_emit`, `ggui_get_session`. | | **Live** | Renderer ↔ Server | WebSocket at `ws://127.0.0.1:6781/ws` (self-hosted default; hosted `wss://mcp.ggui.ai/ws` coming soon). Server deliveries outbound (`StreamEnvelope`, props updates, drain acks); contract violations on an inbound action are answered with a typed `error` frame (`CONTRACT_VIOLATION`, code -32020) on this channel — nothing lands on the consume buffer. | The rendered view’s user actions reach the server through the host’s MCP-Apps `tools/call` relay to `ggui_runtime_submit_action` — the spec-canonical dispatch path; the server appends the gesture to the pipe `ggui_consume` drains. The live WebSocket’s job is server → renderer delivery (stream emits, props updates, drain acks). The channels are independent. The renderer can drop and reconnect the live channel without disturbing an agent’s MCP turn; the agent can `ggui_render` repeatedly without ever touching the live channel. → See [Protocol overview](/protocol/overview/) for the formal three-channel spec. ## Capability model [Section titled “Capability model”](#capability-model) ggui has two **symmetric** capability surfaces — one for what the agent can do, one for what the renderer can render. Both are operator-bounded and declared per-app. | | Renderer side | Agent side | | --------------- | --------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------- | | **Unit** | **gadget** | **tool** | | **Catalog** | `clientCapabilities.gadgets` | `agentCapabilities.tools` | | **Role** | Wraps a 3rd-party browser library (Leaflet, Mapbox, Chart.js) into an LLM-callable hook | Gives the agent a function to invoke (e.g. `searchContacts`, `createInvoice`) | | **Authored as** | `ggui.gadget.json` manifest | MCP tool code | | **Bound at** | iframe boot (SRI-verified) | session start | Plus a third primitive — **blueprints** — which aren’t capabilities but **cached recipes** (pre-composed UIs). A blueprint hit short-circuits fresh generation. → See [Gadgets SDK](/sdk/gadgets/), [MCP Protocol](/api/mcp-protocol/), [Marketplace](/sdk/marketplace/). ## Generation pipeline [Section titled “Generation pipeline”](#generation-pipeline) Rendering is a two-call flow — `ggui_handshake` negotiates, `ggui_render` commits: ```plaintext 0. NEGOTIATE ggui_handshake({intent, blueprintDraft}) searches the blueprint cache by contract shape (exact contractHash hit short-circuits; semantic similarity otherwise) and returns a suggestion 1. COMMIT ggui_render({handshakeId, props}) consumes the handshake; a cache hit reuses the stored blueprint (~100ms) 2. GENERATE Otherwise, run the server's UI generator (@ggui-ai/ui-gen): workflow → impl → check → derive 3. COMPILE TSX → JS via esbuild (~20-50ms) 4. DELIVER Renderer fetches the compiled bundle on the bootstrap channel ``` Published artifacts (gadgets, blueprints) on the marketplace registry pick up an author signature at *publish* time — Ed25519 (publisher keypair) for private artifacts, sigstore keyless (OIDC) for public ones — see [Marketplace](/sdk/marketplace/). Fresh generations are session-scoped and skip that step. The generator (step 2) is a bounded harness: pick a workflow (`single_pass`, `staged`, `staged-concurrent`), run the LLM-driven impl phase, run a check leg (typecheck, render-smoke, per-axis assertions), and on failure derive a revised harness and retry up to `maxIterations`. Output: a TypeScript-typed contract plus a compiled component module. → See [UI Generator](/architecture/ui-generator/) for the harness internals. ## Artifact registry [Section titled “Artifact registry”](#artifact-registry) Gadgets and blueprints resolve through a four-tier waterfall on every push: ```plaintext 1. App-local ggui.json#app.gadgets, ggui.json#blueprints.include (plus installed artifacts under .ggui/installed-blueprints/) 2. Per-org private operator's private registry (artifacts with visibility:"private") 3. Public registry.ggui.ai (marketplace) 4. Fall back fresh generation ``` A blueprint is keyed by a stable `blueprintId`; its `contractHash` — the RFC 8785 canonical-JSON hash of its `DataContract`, scoped per `(appId, contractHash)` — groups variants and is the cache lookup key. An exact contractHash hit short-circuits to score 1.0; otherwise a multi-axis semantic search (contract embedding, structural fingerprint, variance tags, intent) ranks candidates. Install the same `(scope, name, version)` on two different servers and the matcher returns the byte-identical UI on both. → See [Marketplace](/sdk/marketplace/), [Self-Hosted Registry](/sdk/self-hosted-registry/). ## Deployment shapes [Section titled “Deployment shapes”](#deployment-shapes) The same protocol runs in two shapes: | | Self-hosted | Hosted (coming soon) | | ---------------- | ------------------------------------------------------------------- | --------------------------------- | | **MCP endpoint** | your URL via `ggui serve` | `mcp.ggui.ai` | | **WS endpoint** | `ws://127.0.0.1:6781/ws` (or your URL) | `wss://mcp.ggui.ai/ws` | | **Auth** | pluggable `AuthAdapter` + optional OAuth 2.1 (`ggui serve --oauth`) | OAuth | | **Registry** | your registry (or none) | `registry.ggui.ai` | | **Generation** | in-process | managed | | **Pick if** | ”I need data residency, custom auth, or air-gapped" | "I just want to ship an agent UI” | Both speak the same wire. Switching between them is configuration, not code. → See [OSS Quick Start](/oss-quickstart/) to run it yourself. A managed hosted path at `mcp.ggui.ai` is coming soon. ## Where to next [Section titled “Where to next”](#where-to-next) * [Protocol overview](/protocol/overview/) — formal spec * [How ggui works](/how-it-works/) — narrative walk-through of the four moments * [Event System](/architecture/event-system/) — live-channel event flow in detail * [UI Generator](/architecture/ui-generator/) — the generator harness # UI Generator > The bounded harness that turns a data contract into a compiled React component — workflow, check, derive, repeat. The UI generator runs when a [blueprint match](/architecture/overview/#generation-pipeline) misses and ggui has to build a component from scratch. It takes a **data contract** (`PropsSpec` + `ActionSpec` + `StreamSpec` + `ContextSpec`) and returns a compiled, contract-typed React module — typically a handful of LLM turns; cache hits via blueprints are what deliver the \~100ms path. ## The harness [Section titled “The harness”](#the-harness) A **harness** is a per-contract execution plan derived from the contract’s risk and axes. It bundles three things: * **Workflow** — the topology of LLM phases and tasks. Today the dispatcher always picks `single_pass`; `staged` and `staged_concurrent` are registered as reserved-future topologies for risk-tier routing that has not yet shipped. * **Prompt + boilerplate** — the system prompt and fragment set that tell the LLM *how* to author against `@ggui-ai/design`, the active [gadgets](/glossary/#gadget-renderer-side-capability), and the contract. * **Check leg** — the post-conditions the output must satisfy. Generation runs the workflow to produce source, runs the check leg, and — if checks fail — derives a revised harness and re-runs, bounded by `maxIterations`. ```plaintext workflow → source → check → pass? ─► return │ └─ fail → derive (swap fragments / upgrade workflow / adjust prompt) → loop ``` ## Workflows [Section titled “Workflows”](#workflows) | Workflow | Status | Shape | | ------------------- | ----------------------------------- | ------------------------------------------- | | `single_pass` | **Live** — only dispatched workflow | One impl turn. | | `staged` | Reserved (registered, not routed) | Plan → execute. | | `staged_concurrent` | Reserved (registered, not routed) | Plan → parallel skeleton tasks → integrate. | `pickWorkflow` is deliberately conservative — every classification today routes to `single_pass`. The staged topologies are wired up so a future risk-tier router can dispatch to them without a schema change, but changing the picker is a first-class experiment with its own bench gate. All three feed the same check + derive loop; the workflow only changes how source is produced, not how it’s validated. ## Plain-text impl loop (no tool-call ceremony) [Section titled “Plain-text impl loop (no tool-call ceremony)”](#plain-text-impl-loop-no-tool-call-ceremony) The impl phase is structured so the LLM doesn’t waste turns on deterministic ceremony: 1. The LLM receives **everything pre-injected** — primitives docs, design tokens, the data contract with examples, gadget capability cards. 2. The LLM writes the component as **plain text**. No tool calls required. 3. The system **auto-runs** `self_check` and `compile_component` on the emitted source. 4. Failures are fed back as a structured diff for the next turn to fix. This is what keeps healthy generations to 3–5 turns. Turns ≥ 6 is a triad-misalignment signal, not a turn-budget problem. ## Check leg [Section titled “Check leg”](#check-leg) Every workflow runs the same checks before returning: * **TypeScript** — the emitted source is compiled against the real `@ggui-ai/design` type definitions via the TypeScript compiler API on a virtual filesystem. Catches wrong prop types, missing required props, invalid imports, and `strictNullChecks` violations. * **Render smoke** — `ReactDOMServer.renderToString()` with contract-derived sample props. Catches `undefined.toLowerCase()`-class runtime errors before the component reaches the renderer. * **Per-axis assertions** — axis-specific checks derived from the contract (e.g., does an `interactive` axis include a focusable element? does a `submit` action have a matching form?). If any leg fails, the harness derives a revised configuration and loops. If the final iteration still fails, generation returns `ok: false` with `reason: "max-iterations"` and the last source, compiled output, and check result attached for diagnostics — the caller decides whether to surface a fallback, retry with a different harness, or report the failure. ## Multi-provider transport [Section titled “Multi-provider transport”](#multi-provider-transport) The harness is provider-agnostic. Only the LLM transport differs: * **Claude** (Anthropic) — raw API or Claude Agent SDK. * **OpenAI** — Responses API or OpenAI Agents SDK. * **Google** (Gemini) — GenAI API or Google ADK. Workflow, check, and derive are identical across providers. Switching providers does not change what gets generated, only the per-turn cost and latency profile. A deployment pins its model in `ggui.json#generation.model` (`provider:model`, e.g. `anthropic:claude-haiku-4-5-20251001`); the `GGUI_GENERATION_MODEL` env var overrides the manifest (precedence: `GGUI_GENERATION_MODEL` env > `ggui.json#generation.model` > per-provider default), and an agent may override per render via `ggui_render({infra: {model}})`. `generation.keySource: 'own' | 'managed'` declares whose provider key funds generation — self-hosted deployments always use their own key. ## Output shape [Section titled “Output shape”](#output-shape) A successful generation returns: * A **compiled component module** (TSX source compiled to JS). * A **typed data contract** — the same `PropsSpec` / `ActionSpec` / `StreamSpec` / `ContextSpec` that was the input, now frozen as the runtime wire shape. * A **blueprint candidate** — the contract (hashed via RFC 8785 to `contractHash`) plus its variance (`variantKey`), which the registry stores under a stable `blueprintId` for the next handshake’s cache match. The contract is enforced again at the MCP handler boundary so that what the agent renders matches what the component was generated to render. → See [Architecture overview](/architecture/overview/) for where the generator fits in the render pipeline, and the [Glossary](/glossary/) for `harness`, `blueprint`, `gadget`, `contract`. # ggui dev > Inner-loop dev hub — local blueprint registry, devtools console, optional agent supervision, optional managed tunnel. `ggui dev` is the developer-facing inner loop. It loads `ggui.ui.json` blueprint manifests, serves the local registry + dev hub at `127.0.0.1:6780`, and (with `--agent `) supervises a local agent runtime in the same shell. Distinct from [`ggui serve`](/cli/serve/), the production-shaped self-host counterpart. ## Quick start [Section titled “Quick start”](#quick-start) ```bash ggui dev ``` Binds `127.0.0.1:6780`, indexes any `ggui.ui.json` blueprints declared in `ggui.json#blueprints.include`, and auto-opens the dev hub in your browser. To iterate against a local agent runtime in the same shell: ```bash ggui dev --agent ./agent.ts ``` ## What you get [Section titled “What you get”](#what-you-get) Port 6780 hosts the local blueprint registry + dev hub. The dev server exposes: | Path | What it serves | | ------------------------------------- | ------------------------------------------------------------------------------------ | | `/hub` | The dev dashboard (no auth — same-origin XHRs embed the bearer for everything else). | | `/hub/preview?ui=` | Iframe-mountable preview shell for one indexed blueprint. | | `/health` | Liveness probe (no auth). | | `/uis`, `/uis/:id`, `/uis/:id/bundle` | Discovered `ggui.ui.json` blueprints (Bearer auth). | | `/events` | Server-sent events for live reload (Bearer auth). | | `/runtime/status`, `/runtime/events` | Mounted when `--agent ` is set (Bearer auth). | All non-`/hub` endpoints require `Authorization: Bearer `. The token is taken from `GGUI_DEV_TOKEN` if set; otherwise a random one is generated and printed (with an `export` hint) on the boot banner. `ggui dev` also sets `GGUI_MODE=dev` in the process env. The dev stack itself does not run an MCP server — that’s [`ggui serve`](/cli/serve/)’s job — but a supervised `--agent` that composes `@ggui-ai/mcp-server` inherits the env and mounts its own `/devtools/*` console namespace. ## vs `ggui serve` [Section titled “vs ggui serve”](#vs-ggui-serve) | Concern | `ggui dev` | `ggui serve` | | ----------------- | ------------------------------------------------------ | -------------------------------------------------------------------------------- | | Audience | Developer iterating on UIs | Operator running a self-hosted instance | | Default port | `6780` | `6781` | | Default mode | `GGUI_MODE=dev` (mounts `/devtools/*`) | Production-shaped (no devtools) | | Agent supervision | Opt-in via `--agent ` | Default-on, sourced from `ggui.json#agent.entry` | | Tunnel | Opt-in via `--tunnel` (provider seam; none bundled) | Bring your own (`cloudflared` etc.) + set `--public-base-url` | | Auth posture | Loopback bind + single bearer token (`GGUI_DEV_TOKEN`) | Strict-auth pairing by default; opt-down via `--dev-allow-all` / `--public-demo` | Both can run side-by-side without colliding (different ports). ## Flags [Section titled “Flags”](#flags) ```text ggui dev [options] ``` | Flag | Default | Purpose | | ----------------- | ----------- | --------------------------------------------------------------------------------------------------------------------------------------- | | `--port ` | `6780` | Bind port. `0` = OS-assigned. | | `--host ` | `127.0.0.1` | Bind host. Loopback only by default. | | `--no-serve` | off | Load + discover and exit without binding. Useful for one-shot manifest validation / discovery dry-run. | | `--no-open` | off | Skip auto-opening the browser at the hub URL. Implied by non-TTY stdout, `BROWSER=none`, or `CI=1`. | | `--agent ` | none | Supervise a local agent runtime. See [Agent supervision](#agent-supervision) for extension routing. | | `--tunnel` | off | Opt into managed mode — open a managed tunnel above the local stack and print the remote URL. See [Managed mode](#managed-mode-tunnel). | ## Agent supervision [Section titled “Agent supervision”](#agent-supervision) `--agent ` points at the agent file you’re iterating on. Extension routing decides how it’s spawned: | Extension | Spawn | Notes | | --------------------- | --------------------------- | ------------------------------------------------------------ | | `.js`, `.mjs`, `.cjs` | `node ` | Plain Node | | `.ts`, `.tsx`, `.mts` | `node --import=tsx ` | `tsx` must be resolvable in your project (`pnpm add -D tsx`) | The dev-stack picks the agent’s port and forwards it via `PORT` env unless `--tunnel` is also set, in which case the CLI pre-allocates a free port and hands it down so the tunnel can forward inbound traffic to it. Bad `--agent` paths fail before the socket binds — the CLI validates the command mapping and exits 1 with a remediation hint. ## Managed mode tunnel [Section titled “Managed mode tunnel”](#managed-mode-tunnel) With `--tunnel`, once the local stack is listening `ggui dev` asks a `TunnelProvider` to open a managed tunnel above the host and prints the remote URL beside the local hub URL. The dev loop runs unchanged whether the tunnel resolves or not. Provider discovery reads `GGUI_TUNNEL_PROVIDER` — a module specifier exporting `createTunnelProvider()`. No provider is bundled; without the env var the banner prints `tunnel skipped: no tunnel provider configured` and local dev runs unchanged. Real providers (`cloudflared` bindings, `ngrok`) plug into this seam without changing the CLI surface. For a known-working public URL today, run `cloudflared tunnel --url http://localhost:6780` (or your provider of choice) in a sibling shell, then point claude.ai or your MCP client at the printed URL. For the production-shaped equivalent on `ggui serve`, see [`ggui serve` → Recommended setups](/cli/serve/#recommended-setups). ## Common workflows [Section titled “Common workflows”](#common-workflows) **Iterate on a blueprint manifest:** ```bash ggui dev # edit packages//ggui.ui.json # refresh the hub — the registry re-indexes on every load ``` **Iterate on an agent + blueprints together:** ```bash ggui dev --agent ./agent.ts # edit agent.ts → the supervised runtime restarts on file change (when the runtime supports it) # edit ggui.ui.json → the registry re-indexes ``` **Run a second `ggui dev` alongside the first (6780 is taken):** ```bash ggui dev --port 0 # 0 = OS-assigned; the actual port prints in the boot banner. # Pass `--port 6790` (or any free integer) if you need a stable URL. ``` **Validate manifests without binding a socket:** ```bash ggui dev --no-serve # loads + discovers + exits non-zero on any malformed ggui.ui.json # good in CI for catching manifest regressions ``` ## See also [Section titled “See also”](#see-also) * [`ggui` CLI overview](/cli/) — the full command surface. * [`ggui serve`](/cli/serve/) — production-shaped self-host counterpart. * [OSS Quick Start](/oss-quickstart/) — the bootstrap walkthrough. * [Glossary](/glossary/) — `gadget` / `tool` / `blueprint` definitions. # ggui CLI > Open command-line tool for the ggui protocol — local dev, self-host, marketplace authoring, and (coming soon) ggui.ai cloud provisioning. `ggui` is the open CLI for the ggui protocol, shipped as [`@ggui-ai/cli`](https://www.npmjs.com/package/@ggui-ai/cli). It does three jobs: 1. **Local protocol dev + self-host** — `ggui dev` (iterate gadgets and blueprints against a local registry + dev hub), `ggui serve` (run a self-hosted personal-mode app), `ggui keys … --keys-file` (mint local bearers, no account), and `ggui export-pool` (share blueprints across deployments). Account-free; runs entirely on your machine. **Available now.** 2. **Marketplace authoring** — `ggui gadget` / `ggui blueprint` author and publish marketplace artifacts; `ggui theme validate` checks DTCG theme files. 3. **ggui.ai cloud provisioning** *(Preview — managed cloud, coming soon)* — `ggui login` / `keys` / `create` / `deploy` / `push` / `provider-key` provision apps and `ggui_user_*` connector keys against the managed ggui.ai cloud. ## Install [Section titled “Install”](#install) ```bash npm install -g @ggui-ai/cli # or pnpm add -g @ggui-ai/cli ``` ## Commands at a glance [Section titled “Commands at a glance”](#commands-at-a-glance) | Command | Purpose | | ------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `ggui dev` | Boot the local UI registry server, open the dev hub, optionally supervise an agent | | `ggui serve` | Run the open self-hosted personal-mode app (MCP server + supervised agent) | | `ggui keys` | List / create / revoke connector keys. With `--keys-file `, keys are minted into a local JSON store (no account) that `ggui serve --keys-file` reads — the self-host path. Without it, keys target the ggui.ai cloud *(Preview — coming soon)*. Also registers publisher Ed25519 public keys (see [`ggui keys register`](/cli/keys-register/)) | | `ggui theme` | Validate a DTCG theme file against the protocol’s theme schema (`theme validate `) | | `ggui export-pool` | Export this deployment’s reusable blueprints as a directory artifact, loadable elsewhere via `ggui serve --seed-pool ` | | `ggui gadget` | Author and consume marketplace gadgets (`create` / `publish` / `install` / `search`) | | `ggui blueprint` | Author UI blueprints for the marketplace (`create` / `publish` / `install` / `uninstall` / `search`) | | `ggui login` | *(Preview — coming soon)* Sign into `api.ggui.ai` via the OAuth Device Authorization Grant | | `ggui logout` | *(Preview — coming soon)* Discard the local `api.ggui.ai` session | | `ggui whoami` | *(Preview — coming soon)* Print the authenticated user | | `ggui create` | *(Preview — coming soon)* Provision a ggui.ai cloud app (`create app`) | | `ggui deploy` | *(Preview — coming soon)* Idempotent ggui.ai cloud provisioning; `--push-keys` also pushes provider keys | | `ggui push` | *(Preview — coming soon)* Compile and upload local-pool blueprints to a ggui.ai cloud app | | `ggui provider-key` | *(Preview — coming soon)* Push an LLM provider key to a ggui.ai cloud app (`provider-key set`) | | `ggui --version` | Print the installed `@ggui-ai/cli` version | Run `ggui --help` for the per-command flag list. `ggui --help` prints the full surface. ## Local dev & self-host [Section titled “Local dev & self-host”](#local-dev--self-host) `ggui dev` and `ggui serve` are the two ways to run the protocol locally: * **[`ggui dev`](/cli/dev/)** — inner-loop dev hub. Loads `ggui.ui.json` manifests, serves the local gadget + blueprint registry, and (with `--agent `) supervises a local agent runtime. Default `127.0.0.1:6780`. Use while iterating. * **[`ggui serve`](/cli/serve/)** — production-shaped self-host. Boots an MCP server with a supervised agent (`ggui.json#agent.entry`), ready to put behind your own auth on a public URL. Default `127.0.0.1:6781`; clients connect over WebSocket at `ws://127.0.0.1:6781/ws` (or `wss:///ws` once tunneled). See [Reference deploys](/self-hosted/reference-deploys/) for Docker / Fly / Render manifests and [OSS Quick Start](/oss-quickstart/) for the bootstrap. Run `ggui dev --help` or `ggui serve --help` for the full flag list. ## Bearer keys — local first [Section titled “Bearer keys — local first”](#bearer-keys--local-first) The available-now path is account-free: `ggui keys list / create / revoke --keys-file ` mints bearer keys into a local JSON store — the same file format `ggui serve --keys-file` reads — so a locally minted bearer authenticates on the next serve boot. No account involved. ### ggui.ai auth & keys (Preview — coming soon) [Section titled “ggui.ai auth & keys (Preview — coming soon)”](#gguiai-auth--keys-preview--coming-soon) The managed ggui.ai cloud is coming soon. Once live, agent runtimes pointed at the universal MCP at `mcp.ggui.ai` authenticate with a `ggui_user_*` connector key, and `ggui login` signs the CLI into `api.ggui.ai` so you can mint and revoke those keys from the terminal — see [`ggui login`](/cli/login/) for the device-flow walkthrough plus `whoami`, `keys list / create / revoke`, and `logout`. ## Pairing a client app [Section titled “Pairing a client app”](#pairing-a-client-app) `ggui` has no `pair` subcommand. To pair a client to a `ggui serve` instance, follow [Self-hosted pairing](/self-hosted/pairing/) — tunnel setup, QR handshake, and the AuthAdapter swap-in for production. ## Configuration [Section titled “Configuration”](#configuration) Two environment variables control where the CLI talks and stores state: | Variable | Default | Purpose | | ----------------- | --------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------- | | `GGUI_API_URL` | `https://api.ggui.ai` | Override the auth + key-management endpoint. Useful for sandbox / dev testing. | | `GGUI_CONFIG_DIR` | `~/.ggui` | Override the `~/.ggui` root (`auth.json`, BYOK `credentials.json`, embedding-model cache, publisher keypairs). Useful for isolated dev shells. | ## Versioning [Section titled “Versioning”](#versioning) ```bash ggui --version ``` Versions track the `@ggui-ai/cli` npm package. Protocol semantics are pinned to the `@ggui-ai/protocol` major version it bundles — see [Protocol version policy](/protocol/version-policy/) for what changes between major bumps. # ggui keys register > Register a publisher's Ed25519 public key with the ggui.ai marketplace registry so signed gadget and blueprint publishes validate. Coming soon This page describes the **managed hosted path** (`mcp.ggui.ai` / `console.ggui.ai` / the `registry.ggui.ai` marketplace), which is **not yet live** — it is not part of GGUI Preview 0.1.0. The self-hosted path is available today — start with the [Quickstart](/oss-quickstart/). This page is kept as a preview of the managed path and goes live when hosted ggui ships. `ggui keys register` ships your **publisher** Ed25519 public key to the marketplace registry’s `POST /author-keys` endpoint. After this, every subsequent `ggui gadget publish` / `ggui blueprint publish` whose bundle is signed by the matching private key validates against the registry’s stored row. ## When to use it [Section titled “When to use it”](#when-to-use-it) You run `ggui keys register` **after** `ggui gadget publish` (or `ggui blueprint publish`) has auto-generated a keypair on disk for a given scope. The publish flow does the keypair generation automatically on first run, then signs the artifact, then POSTs to `/publish`. The registry rejects a signed publish whose `publicKeyId` isn’t already registered for the caller’s identity — that’s the moment to run this command. The typical bootstrap sequence for a new publisher: ```bash # 1. First publish under @my-org auto-generates # ~/.ggui/keys/@my-org/{private,public}.key and signs the bundle. ggui gadget publish # → Error: unknown_key — public key not registered for this publisher. # 2. Register the public half with the registry. ggui keys register --scope @my-org # 3. Re-run the publish. The signature now verifies. ggui gadget publish ``` After step 2, every future publish under `@my-org` from this machine (and any other machine you copy `~/.ggui/keys/@my-org/private.key` to) validates without a second `register` call. ## Usage [Section titled “Usage”](#usage) ```text ggui keys register --scope <@scope> [--registry ] [--auth=bearer [--token ]] ``` | Flag | Required | Purpose | | --------------- | -------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `--scope` | yes | npm scope the key was generated under, prefixed with `@` (e.g. `--scope @my-org`). The CLI reads `~/.ggui/keys//public.key` — must already exist (see “When to use it” above). | | `--registry` | no | Override the registry URL. Same resolution chain as `ggui gadget publish`: `--registry` flag, then `GGUI_REGISTRY` env var, then `ggui.json#registry` walking up from CWD. No hard-coded default — pick a registry deliberately so a typo doesn’t accidentally register against prod. | | `--auth=bearer` | no | Send an explicit bearer token instead of the stored `ggui login` session — for self-hosted registries that authenticate with a static publish token. Pair with `--token ` or set `GGUI_REGISTRY_TOKEN`. Same flags the publish verbs take. | | `--token` | no | The bearer token for `--auth=bearer` (overrides `GGUI_REGISTRY_TOKEN`). | ## Auth [Section titled “Auth”](#auth) `ggui keys register` uses your stored **`ggui login` session** by default — the same credential `ggui gadget publish` sends to `/publish`, and the one the hosted registry’s `/author-keys` route authenticates (see [`Marketplace § Auth`](/sdk/marketplace/#auth)). The CLI reads `~/.ggui/auth.json`, refreshes the access token automatically when it has expired, and sends it as `Authorization: Bearer ` — the only identity surface; the request body carries only `publicKeyBase64`, never the caller’s user id. The server derives the publisher subject from the verified credential and the `keyId` from the raw public-key bytes. Self-hosted operators running their own [`@ggui-ai/registry-server`](/sdk/self-hosted-registry/) deployments pass `--auth=bearer --token ` (or set `GGUI_REGISTRY_TOKEN`) — the same escape hatch the publish flow takes (see [`ggui marketplace`](/sdk/marketplace/#auth) for the parallel). ## Output [Section titled “Output”](#output) Success — first-write (HTTP 201): ```bash $ ggui keys register --scope @my-org Registered publisher key for @my-org. registry: https://registry.ggui.ai subject: keyId: a1b2c3d4e5f60718 ``` Idempotent re-register (HTTP 200) — same public-key bytes for `(subject, keyId)` already on file: ```bash $ ggui keys register --scope @my-org Already registered publisher key for @my-org. registry: https://registry.ggui.ai subject: keyId: a1b2c3d4e5f60718 ``` Both exit `0`. Safe to run unconditionally in CI bootstrap scripts. ## Errors and exit codes [Section titled “Errors and exit codes”](#errors-and-exit-codes) | Code | Exit | Meaning | | --------------------- | ---- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `no-registry` | 1 | No registry URL resolved. Pass `--registry`, set `GGUI_REGISTRY`, or add `registry` to `ggui.json`. | | `invalid-registry` | 1 | Resolved URL is malformed. | | `no-keypair` | 1 | No `~/.ggui/keys//public.key` on disk. Run `ggui gadget publish` or `ggui blueprint publish` under this scope first — first-publish generates the keypair as a side-effect. | | `auth_failed` | 1 | The stored login session expired (or its refresh was rejected). Run `ggui login` again. | | `auth_config_missing` | 1 | No stored login session (`~/.ggui/auth.json` missing or unreadable) — run `ggui login` first. Or `--auth=bearer` was passed without `--token` / `GGUI_REGISTRY_TOKEN`. | | `network-error` | 1 | `fetch` threw — DNS, TLS, or connection refused. | | `unauthorized` | 1 | Registry rejected the bearer credential (HTTP 401). Re-run `ggui login` (the session may have been revoked) — or, for self-hosted registries, check the `--auth=bearer` token. | | `invalid_request` | 1 | Registry refused the body (HTTP 400). The public-key file is corrupted or the wrong length — regenerate by deleting `~/.ggui/keys//` and re-running `ggui gadget publish`. | | `key_conflict` | 3 | Registry holds a different public key for the same `(subject, keyId)` tuple (HTTP 409). Vanishingly rare — a 64-bit `keyId` SHA-256 truncation collision OR a stale row from a previous owner. Exit `3` is distinct so scripts can detect it without parsing the message. | | `http-error` | 1 | Any other non-2xx response (typically 5xx). Check the message for the registry’s error string; retry transient failures. | | `bad-response` | 1 | Registry returned a 2xx with a malformed body, or any status with invalid JSON. Almost always a registry-side bug — re-run; if persistent, the registry is misconfigured. | The structured `error` field of the registry’s response body (closed enum: `unauthorized` / `invalid_request` / `key_conflict` / `server_error`) is preferred over status-code mapping when the response carries a well-formed body — `key_conflict` returned with a non-409 status still surfaces as `key_conflict` on the CLI side. ## Trust model [Section titled “Trust model”](#trust-model) The on-disk private key under `~/.ggui/keys//private.key` is mode `0o600` — treat it like a long-lived password. Copying it between machines lets you publish from CI without re-registering. Losing it means generating a new keypair under the same scope: when the new public key gets registered, both old + new keys are valid (publish flow pins the signing key onto each `ArtifactVersionRow` at publish time, so historical versions still verify under the previous key). To rotate out the old key entirely, register the new one + remove the old row server-side (operator-only). See [`Marketplace § Trust model`](/sdk/marketplace/#trust-model) for the full per-author-key + per-version pinning design and the install-time two-leg verification (SHA-384 + Ed25519). # ggui login > Sign the @ggui-ai/cli into ggui.ai via OAuth 2.0 Device Authorization Grant to manage hosted connector keys. Coming soon This page describes the **managed hosted path** (`api.ggui.ai` / `console.ggui.ai`), which is **not yet live** — it is not part of GGUI Preview 0.1.0. The self-hosted path is available today — start with the [Quickstart](/oss-quickstart/); for connector keys without an account, see [local keys](#no-account-local-keys) below. This page is kept as a preview of the managed path and goes live when hosted ggui ships. `ggui login` signs the open `@ggui-ai/cli` into [`api.ggui.ai`](https://api.ggui.ai) so you can manage `ggui_user_*` connector keys from the terminal — list, mint, and revoke — without leaving your shell. It uses the [OAuth 2.0 Device Authorization Grant](https://datatracker.ietf.org/doc/html/rfc8628): the CLI prints a URL and a short code, you approve in any browser (even on a different machine), and tokens land on disk once approval completes. ## Install [Section titled “Install”](#install) ```bash npm install -g @ggui-ai/cli # or pnpm add -g @ggui-ai/cli ``` ## Sign in [Section titled “Sign in”](#sign-in) ```bash $ ggui login Endpoint: https://api.ggui.ai (default) Open this URL in your browser to approve: https://console.ggui.ai/cli-confirm/ABCD-EFGH Verification code: ABCD-EFGH (Confirm this matches what the browser shows.) Waiting for approval… Signed in. Tokens saved to ~/.ggui/auth.json. Try `ggui whoami` or `ggui keys list`. ``` Under the hood: 1. **Device code request** — the CLI POSTs `/v1/auth/device` and prints the user-readable code plus URL. 2. **Browser approval** — `console.ggui.ai/cli-confirm/

` shows the same code, your account, and an `Approve` button. Confirm the codes match, then approve. 3. **CLI polls `/v1/auth/poll`** every few seconds until the server returns tokens. 4. **Tokens land on disk** — `~/.ggui/auth.json` (mode `0600`) holds the access bearer, the refresh bearer, and the API endpoint they were minted against. The approval window is \~10 minutes. If it expires, just run `ggui login` again. ## Flags [Section titled “Flags”](#flags) ```text ggui login [--name ] [--no-open] ``` | Flag | Purpose | | ----------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `--name` | Device label shown in the console and `ggui whoami`. Defaults to `ggui CLI on `. Lets you distinguish `ggui CLI on laptop` from `ggui CLI on workstation`. | | `--no-open` | Skip auto-opening the verification URL. Implied when stdout isn’t a TTY, when `BROWSER=none`, or when `CI=1` — so the flow stays sane in pipelines. | ## Configuration [Section titled “Configuration”](#configuration) Resolution order for the API endpoint: 1. `GGUI_API_URL` env var — operator override (sandbox / dev testing). Login prints `(env)`. 2. `~/.ggui/auth.json#endpoint` — captured at last login. Login prints `(auth.json)`. 3. `https://api.ggui.ai` — production default. Login prints `(default)`. The suffix on the first `Endpoint:` line of `ggui login` tells you which tier won — useful when an unexpected `GGUI_API_URL` in your shell silently redirects the device flow at a sandbox. For sandbox / non-prod testing: ```bash GGUI_API_URL=https://abc123.execute-api.us-east-1.amazonaws.com ggui login ``` For an isolated dev shell that shouldn’t touch your real session, set `GGUI_CONFIG_DIR`: ```bash GGUI_CONFIG_DIR=/tmp/ggui-test ggui login ``` ## Verify [Section titled “Verify”](#verify) ```bash $ ggui whoami User ID:  Session ID:  Client: ggui CLI on laptop Access expires: 2026-04-26T18:09:14.000Z Endpoint: https://api.ggui.ai ``` ## Manage keys [Section titled “Manage keys”](#manage-keys) Once signed in, you can mint connector keys from the terminal: ```bash $ ggui keys create --name "my agent runtime" API key: ggui_user_AbCdEf... ID: a1b2c3d4-... Prefix: AbCdEfGh Created: 2026-04-26T17:32:00.000Z IMPORTANT: copy the API key now — it will NEVER be shown again. Use it as the bearer token for the ggui-protocol-user MCP server. ``` The full secret is printed exactly once, GitHub-PAT-style. Lose it and you mint a new one. Pass `--expires-at ` to mint a key with a hard expiry. ```bash $ ggui keys list ID PREFIX NAME STATUS LAST USED a1b2c3d4-... ggui_user_AbCdEfGh… my agent runtime active — $ ggui keys revoke a1b2c3d4-... Revoked key a1b2c3d4-.... ``` `revoke` is a soft-revoke — the row stays in the audit table, but every subsequent request from the key returns `401 Unauthorized`. There’s no undo. ### No account? Local keys [Section titled “No account? Local keys”](#no-account-local-keys) `keys list / create / revoke` also work with **no account at all**: pass `--keys-file ` to flip them to a local JSON store. The file format is the same one `ggui serve --keys-file` reads, so a locally minted bearer authenticates against your own server on the next boot — no `ggui login` needed: ```bash ggui keys create --keys-file ./keys.json --name laptop ggui serve --keys-file ./keys.json ``` This is the available-today path for GGUI Preview self-hosting. See [`ggui keys register`](/cli/keys-register/) for publishing your **author** Ed25519 key to the marketplace (a different key, with a different purpose). ## Sign out [Section titled “Sign out”](#sign-out) ```bash $ ggui logout Signed out. ~/.ggui/auth.json removed. ``` `ggui logout` deletes the local session. Server-side tokens stay valid until their TTL expires (\~1h access, \~30d refresh). For sensitive scenarios, follow up with `ggui keys revoke ` for any keys minted during that session. ## Troubleshooting [Section titled “Troubleshooting”](#troubleshooting) **“Login window expired before approval”** — you took longer than \~10 minutes. Run `ggui login` again. **“Session expired. Run `ggui login` again.”** — both access and refresh tokens are dead (you’ve been away >30 days, or the server invalidated them). Re-authenticate. **“failed to start device flow”** — check `GGUI_API_URL`, your network, and the [troubleshooting page](/troubleshooting/). **Browser didn’t open** — fine. Copy the printed URL into any browser; the verification code in the URL matches the one printed beneath it.

# ggui serve

> Run the self-hosted ggui runtime — MCP server plus a supervised agent, configured by ggui.json.

`ggui serve` boots the self-hosted ggui runtime: an MCP server (`@ggui-ai/mcp-server`) plus, by default, a supervised agent process declared in `ggui.json`. It’s distinct from [`ggui dev`](/cli/dev/), the inner-loop development hub — `ggui serve` is the production-shaped self-host you’d put behind a tunnel or on a VM. ## Quick start [Section titled “Quick start”](#quick-start) ```bash ggui serve ``` Binds `127.0.0.1:6781`, mounts the first-run bundle, and starts the agent declared in `ggui.json#agent.entry` alongside MCP. The CLI auto-opens the landing page once the banner prints (skip with `--no-open`). **Point an agent at it** — the MCP endpoint is `http://127.0.0.1:6781/mcp`: ```bash GGUI_MCP_URL=http://127.0.0.1:6781/mcp GGUI_MCP_BEARER=dev # i.e. `Authorization: Bearer dev` — works with `ggui serve --dev-allow-all` ``` With the default strict auth, swap `dev` for a pair-minted bearer (or one minted via `ggui keys create --keys-file `). For the bootstrap walkthrough, see [OSS Quick Start](/oss-quickstart/). ## The first-run bundle [Section titled “The first-run bundle”](#the-first-run-bundle) A default `ggui serve` mounts these same-origin surfaces: | Path | What it serves | | --------------------------- | --------------------------------------------------------------------------------------------------- | | `/` | Landing page — server identity, pair-code card, links into the console | | `/mcp` | MCP HTTP endpoint — agents call `ggui_render`, `ggui_consume`, etc. here. | | `/ws` | Live-channel WebSocket — live session plane for MCP Apps iframes and the console. | | `/ggui/health` | Liveness probe — the path the [reference-deploy](/self-hosted/reference-deploys/) healthchecks hit. | | `/r/` | Signed render-viewer URL — resolves a shortCode to its session. | | `/settings` | LLM provider-key page — paste a key; takes effect without restart. | | `/pair`, `/admin/pair/init` | Pairing endpoints for paired viewer clients. | Hosts that need a different shape (no landing page, no pairing, programmatic control) should compose `createGguiServer()` directly rather than invoke this CLI. `createGguiServer({ mcpServices: [...] })` can also mount additional standalone MCP services at their own paths — see [MCP services](/architecture/mcp-services/). ## Flags [Section titled “Flags”](#flags) ```text ggui serve [options] ``` ### Bind & lifecycle [Section titled “Bind & lifecycle”](#bind--lifecycle) | Flag | Default | Purpose | | --------------- | ----------- | -------------------------------------------------------------------------------------------------------------------------------- | | `--port ` | `6781` | Bind port. `0` = OS-assigned (the actual port prints in the boot banner). | | `--host ` | `127.0.0.1` | Bind host. Loopback only by default. | | `--mcp-only` | off | Run just the MCP server; skip agent supervision even if `ggui.json` has `agent.entry`. Also implies `--no-open`. | | `--no-open` | off | Skip auto-opening the operator’s browser. Auto-open is also skipped whenever stdout is not a TTY (CI, supervised, piped output). | ### Auth posture (mutually exclusive) [Section titled “Auth posture (mutually exclusive)”](#auth-posture-mutually-exclusive) By default, `/mcp` rejects any bearer that wasn’t minted by the pairing flow. These flags relax that for specific scenarios: | Flag | Posture | | ----------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | | `--dev-allow-all` | Accept any bearer — or none at all (the no-bearer probe MCP custom connectors send) — as `builder`. Local-dev / tunnel smoke ONLY. **Never expose to the open internet** — the banner prints an unmissable warning when this is set. | | `--public-demo` | Same any-bearer auth as `--dev-allow-all`, plus a per-IP `FixedWindowRateLimiter` on `ggui_render` (default: 30 `ggui_render` calls per 10 min per IP) and a “PUBLIC DEMO — operator pays” banner. Use case: a single shared LLM key for an audience demo (Show HN, blog, classroom). Mutually exclusive with `--dev-allow-all`. | | `--multi-tenant` | Strict-auth multi-tenant posture. The console `/settings` LLM-keys gate switches from admin-token to auth-adapter so each authenticated end-user manages their OWN provider keys (scope = `userId` for `kind:'user'`, `appId` for `kind:'app'`). `kind:'builder'` identities are rejected. Mutually exclusive with `--dev-allow-all` and `--public-demo`. Note: the admin-token `/keys` pairing plane is separate from the `/settings` LLM-keys plane that `--multi-tenant` rebinds. | ### Custom connector hosts [Section titled “Custom connector hosts”](#custom-connector-hosts) | Flag | Purpose | | ----------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `--oauth` | Mount OAuth 2.1 + PKCE + Dynamic Client Registration routes (`.well-known/oauth-*` + `/oauth/{authorize,token,register}`). Required for hosts whose Add Connector form has no field for a pre-shared bearer (claude.ai, ChatGPT). Pure-bearer clients (Claude Desktop with bearer in config) work without it. | | `--public-base-url ` | Override the public base URL used to compose the iframe-runtime + live-channel URLs written into each render’s `ai.ggui/render` slice. Set to a tunnel URL (`https://.trycloudflare.com`) when testing against a remote MCP host so those URLs resolve from the host’s perspective. Without this, they derive from `--host:--port` and only work from the same machine. | ### Operator config [Section titled “Operator config”](#operator-config) | Flag | Purpose | | ------------------------ | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `--admin-token ` | Pin the admin bearer that gates the console `/keys` plane. Without this, the server mints a fresh `ggui_admin_*` per boot and prints it on the banner. Pin a stable value when you want the bearer to survive restarts. | | `--keys-file ` | JSON file backing the pairing service. When set, paired bearers survive restart. Stored in plaintext at `0600` perms — assume operator-controlled disk (e.g. `~/.ggui/keys.json`). | | `--ephemeral` | Opt out of the default cross-restart persistence bundle (`.ggui/persistent/`). With this flag, HMAC secrets, renders, vectors, short-codes, and paired bearers all reset on every restart. Use for tests, CI loops, or incident-response nuclear-revoke. | | `--seed-pool ` | Repeatable. Load a read-only shared blueprint pool (a [`ggui export-pool`](/cli/) artifact) for exact-contract reuse, consulted after the server’s own blueprints. | | `--mcp-instructions ` | Server-level MCP instructions preset (the string injected into the LLM’s system prompt above the tool catalog). Presets: `default`, `aggressive`, `always`, `minimal`, `off`. Also accepts `GGUI_MCP_INSTRUCTIONS` env (CLI flag wins). | ## Agent runtime supervision [Section titled “Agent runtime supervision”](#agent-runtime-supervision) By default, `ggui serve` boots your agent alongside MCP. The entry file comes from `ggui.json`: ```json { "agent": { "entry": "./agent.ts" } } ``` Supported extensions: * `.js` / `.mjs` / `.cjs` — runs as `node ` * `.ts` / `.tsx` / `.mts` — runs as `node --import=tsx ` (`tsx` must be resolvable in your project) Failure modes: * **No `ggui.json`** or **no `agent.entry`** — falls back to MCP-only with a warning. Useful when you want to point an external agent runtime at the server. * **Malformed `ggui.json`** or **unsupported entry extension** — hard error, exits 1 before binding. * **Agent crashes after startup** — logged, MCP keeps running. No auto-restart — compose that with your own supervisor (systemd, pm2, Docker restart policy). ## Generation (bring your own key) [Section titled “Generation (bring your own key)”](#generation-bring-your-own-key) Component-code generation on a self-hosted server uses **your** LLM provider key (BYOK). At boot, `ggui serve` resolves a key in this order: 1. **Provider env vars** — `ANTHROPIC_API_KEY`, `OPENAI_API_KEY`, `GOOGLE_API_KEY` (falling back to `GEMINI_API_KEY`), `OPENROUTER_API_KEY`. The env layer always wins. 2. **`~/.ggui/credentials.json`** — keys pasted at `/settings` land here (plaintext, `0600`). The model comes from `ggui.json#generation.model`, in either `provider:model` (canonical) or `provider/model` (LiteLLM) form: ```json { "generation": { "model": "anthropic:claude-haiku-4-5-20251001" } } ``` `GGUI_GENERATION_MODEL` env overrides the manifest model — the precedence is **`GGUI_GENERATION_MODEL` env > `ggui.json#generation.model` > per-provider default**. The env value accepts both `provider:model` (canonical) and `provider/model` (LiteLLM) forms; a malformed value is a hard boot error, not a silent fallback. It’s the ops escape hatch for pointing one `ggui serve` instance at a different model without editing the manifest. When neither env nor manifest sets a model, the per-provider default applies (anthropic `claude-haiku-4-5`, openai `gpt-5.5-2026-04-23`, google `gemini-3.1-flash-lite`). Rules: * **A key without a model is a hard error** — if a boot key resolves but `generation.model` is unset, `ggui serve` exits with an actionable message showing both accepted model-string forms. * **Bedrock routes are rejected** on the OSS path (IAM-only; supported via the hosted runtime only). * **No key at all is not fatal** — the banner prints `⚠ no LLM key configured`, and renders fall back to a Connect-a-key card pointing at `/settings`. Pasting a key there takes effect without a restart. ## Persistent storage [Section titled “Persistent storage”](#persistent-storage) By default, `ggui serve` writes a cross-restart bundle under `.ggui/persistent/` (project-local when a `ggui.json` was resolved, else `~/.ggui/persistent/`) so HMAC secrets, renders, vectors, short-codes, and paired bearers survive a restart. claude.ai chat-history revisits keep working without re-pairing. Bundle layout: ```text .ggui/persistent/ ├── ws-token-secret.hex (HMAC, 0600) ├── render-signer-secret.hex (HMAC, 0600) ├── short-codes.sqlite (signed render-URL resolution — backs the /r/ viewer) ├── sessions.sqlite (GguiSessionStore — renders + event history) ├── vectors.sqlite (RAG corpus) └── keys.json (paired bearers) ``` Override the directory with `GGUI_PERSISTENT_DIR`. Pass [`--ephemeral`](#operator-config) to skip the bundle entirely. Explicit `ggui.json#storage` declarations always win. Declare them to override paths or swap a single surface back to memory while keeping the rest persistent: ```json { "storage": { "renders": { "driver": "sqlite", "path": "./ggui-sessions.sqlite" }, "vectors": { "driver": "sqlite", "path": "./ggui-vectors.sqlite" }, "threads": { "driver": "sqlite", "path": "./ggui-threads.sqlite" } } } ``` | Store | Default | sqlite driver requires | | --------- | -------------------------------------------------------------------------------------------------------- | ------------------------- | | `renders` | sqlite under `.ggui/persistent/sessions.sqlite` (opt out with `--ephemeral` or `{ "driver": "memory" }`) | `better-sqlite3` peer dep | | `vectors` | sqlite under `.ggui/persistent/vectors.sqlite` (opt out with `--ephemeral` or `{ "driver": "memory" }`) | `better-sqlite3` peer dep | | `threads` | **routes unmounted unless declared** (opt-in) | `better-sqlite3` peer dep | Paths in `ggui.json#storage` resolve relative to the `ggui.json` directory, regardless of where `ggui serve` was invoked from. For threads specifically, `driver: "memory"` mounts the routes but data resets on restart (`/ggui/health` reports `threads.durability: "ephemeral"`); `driver: "sqlite"` is durable. ## Recommended setups [Section titled “Recommended setups”](#recommended-setups) **Local-only smoke (no tunnel, no remote MCP host):** ```bash ggui serve ``` **claude.ai custom connector (over a public tunnel):** ```bash # Terminal 1: tunnel cloudflared tunnel --url http://localhost:6781 # Terminal 2: serve ggui serve --oauth \ --public-base-url https://.trycloudflare.com ``` Note the `PAIR_CODE` from the boot banner. Visit the public URL, complete pairing, save the bearer. Then in claude.ai → Settings → Connectors → Add custom connector, point it at `https://.trycloudflare.com/mcp`, leave Client ID / Secret empty (the server uses Dynamic Client Registration), click Connect, and paste the bearer when prompted. **Quick local-dev without auth (NEVER over a public tunnel):** ```bash ggui serve --dev-allow-all ``` For the full pairing walkthrough, see [Self-hosted pairing](/self-hosted/pairing/). ## Production hardening [Section titled “Production hardening”](#production-hardening) The default auth shape is dev-mode pairing. For production, swap in a real `AuthAdapter` by composing `createGguiServer()` directly in your agent entrypoint instead of running the CLI: ```typescript import { createGguiServer } from "@ggui-ai/mcp-server"; const server = createGguiServer({ auth: { /* your AuthAdapter — OIDC, Cognito, custom */ }, }); ``` The adapter gates both `/mcp` and the live-channel `/ws` upgrade. See [Reference deploys](/self-hosted/reference-deploys/) for Docker / Fly / Render manifests. ## Current limits [Section titled “Current limits”](#current-limits) * Strict-auth only — `/mcp` rejects any bearer not pair-minted (or relaxed via `--dev-allow-all` / `--public-demo`). * Single-tenant by default — every request scopes to one `builder` app ID. Use `--multi-tenant` to scope per authenticated user. * No auto-restart on agent crash — compose with your own supervisor. ## See also [Section titled “See also”](#see-also) * [`ggui` CLI overview](/cli/) — the full command surface. * [`ggui login`](/cli/login/) — sign into `api.ggui.ai` for `ggui_user_*` connector keys *(Preview — managed cloud, coming soon; separate from `ggui serve`’s pairing flow)*. * [Self-hosted pairing](/self-hosted/pairing/) — pair a viewer client to a `ggui serve` instance. * [Reference deploys](/self-hosted/reference-deploys/) — Docker, Fly, Render manifests for `ggui serve`.

# Connect Claude Desktop

> Connect Claude Desktop to a ggui server over MCP with OAuth and render generative UI inline.

`mcp.ggui.ai` speaks the [MCP Apps](/api/mcp-apps/) protocol, so any MCP-Apps-aware host renders ggui-generated UIs inline. Claude Desktop is the most polished client today; this page walks install and first use end to end. The same procedure works for **claude.ai**, **Goose**, and **VS Code Copilot** — anywhere you can add a remote MCP server and the host implements OAuth 2.1 + Dynamic Client Registration. See [Other clients](#other-clients). ## Available now: connect your own server [Section titled “Available now: connect your own server”](#available-now-connect-your-own-server) The hosted `mcp.ggui.ai` endpoint is part of the managed cloud (Preview — coming soon). The identical “Add custom connector” flow already works against a self-hosted server today: 1. Run your server with OAuth enabled, on a URL Claude’s browser can reach: ```bash ggui serve --oauth --public-base-url https:// ``` 2. In Claude Desktop, **Settings → Connectors → Add custom connector**, paste `https:///mcp`. 3. On first use the consent page asks you to paste a locally-minted `ggui_user_*` key — that key becomes the OAuth access token. See [CLI: serve](/cli/serve/) for the flags and [Pairing](/self-hosted/pairing/) for minting keys. ## Step 1: Add the server [Section titled “Step 1: Add the server”](#step-1-add-the-server) In Claude Desktop, open **Settings → Connectors → Add custom connector** and paste: ```plaintext https://mcp.ggui.ai ``` Leave authentication blank — the server drives Claude through the OAuth ceremony on first use. Save. ## Step 2: Approve the connection [Section titled “Step 2: Approve the connection”](#step-2-approve-the-connection) The first time Claude calls a ggui tool (or you click **Connect** on the connector card), Claude pops a browser window to `console.ggui.ai/oauth/consent`. 1. Sign in to ggui (email + password — same account that holds your connector keys). 2. Click **Approve** on the consent card. The console mints a fresh `ggui_user_*` API key labelled `Claude Desktop` and hands it to the client through the OAuth flow. 3. The browser closes. Claude shows the connector as connected. Under the hood, no manual `client_id` to copy: * Auth-server discovery via [RFC 9728](https://datatracker.ietf.org/doc/html/rfc9728) (`/.well-known/oauth-protected-resource`) and [RFC 8414](https://datatracker.ietf.org/doc/html/rfc8414). * Client registration via [RFC 7591](https://datatracker.ietf.org/doc/html/rfc7591) Dynamic Client Registration. * [OAuth 2.1](https://datatracker.ietf.org/doc/html/draft-ietf-oauth-v2-1) + [PKCE](https://datatracker.ietf.org/doc/html/rfc7636); the `ggui_user_*` key becomes the access token. Full protocol details: [OAuth on mcp.ggui.ai](/api/oauth/). ## Step 3: Generate a UI [Section titled “Step 3: Generate a UI”](#step-3-generate-a-ui) In a new chat, ask Claude to do something concrete enough to need a form: > Help me triage 12 incoming product-support tickets. Show me a sortable table with severity, age, and a “draft reply” action per row. Claude calls the `ggui_render` tool with that intent. The MCP Apps capability advertised by `mcp.ggui.ai` tells Claude Desktop to render the result inline — the table appears directly in the chat, not as an external link. Submit a row and the data flows back to Claude as if you’d typed it. The first such call also seeds a blueprint cache: subsequent matching requests skip the generation step and return the cached gadget instantly. ## Tool-call budget and credits [Section titled “Tool-call budget and credits”](#tool-call-budget-and-credits) Each `ggui_render` burns a small amount of credit (LLM time + render storage). New accounts get **$5 free credit** out of the gate; top up at [`console.ggui.ai/credits`](https://console.ggui.ai/credits) when you run low. Prefer your own model account? Paste a provider key into [`console.ggui.ai/keys/providers`](https://console.ggui.ai/keys/providers) (BYOK — Anthropic, OpenAI, Google, OpenRouter). When a BYOK key is on file for the model the request would have used, ggui calls the model on your tab and only charges for orchestration. ## Revoke the connection [Section titled “Revoke the connection”](#revoke-the-connection) Two surfaces, both authoritative: * **Claude Desktop** — Settings → Connectors → ggui → **Disconnect**. Drops the cached token locally; the underlying API key on the ggui side is unaffected. * **console.ggui.ai** — on the home page, find the key labelled `Claude Desktop` (or whatever the client called itself) and click **Revoke**. The next request from any client holding that token returns `401`. The keys table at [`console.ggui.ai/keys/connector`](https://console.ggui.ai/keys/connector) is the authoritative revocation surface — every MCP-Apps client lands as one row, labelled with its DCR `client_name`. (A dedicated **Connected apps** view at `/connections` is a placeholder today; the keys table is what to use.) ## Other clients [Section titled “Other clients”](#other-clients) The same `mcp.ggui.ai` URL works in any host that ships MCP Apps + OAuth-DCR: * **claude.ai (web)** — Settings → Connectors → Add custom connector. Same flow, browser-only. * **Goose** — `goose configure`, then add a remote MCP server pointing at `https://mcp.ggui.ai`. * **VS Code Copilot** — drop into `.vscode/mcp.json`: `{ "servers": { "ggui": { "type": "http", "url": "https://mcp.ggui.ai" } } }`. Copilot triggers OAuth on first tool call. * **Anything that speaks RFC 7591 + RFC 9728** — drop the URL in; the server’s discovery metadata bootstraps the rest. Hosts that don’t speak MCP Apps yet (most CLI agents at time of writing) can still hit the endpoint with a static `Authorization: Bearer ggui_user_*` header. They lose inline rendering — `ggui_render` still works and returns the render as a `ui://ggui/render/` resource, but without MCP-Apps support there’s nothing to mount it (the agent receives no URL to open). ## Troubleshooting [Section titled “Troubleshooting”](#troubleshooting) **“Failed to connect” after approving** — stale token cache. Disconnect from Claude Desktop’s connector panel and reconnect; OAuth re-runs and mints a fresh key. **OAuth browser window never opens, or closes with an error** — most often a popup blocker on `console.ggui.ai`, or you were already signed into a different ggui account in that browser profile. Allow popups for `console.ggui.ai`, sign out of the wrong account, and click **Connect** on the connector card again to restart the ceremony. To inspect what was minted, open [`console.ggui.ai/keys/connector`](https://console.ggui.ai/keys/connector) — a successful ceremony shows up as a new row labelled with the client name. **“This MCP server has not been activated”** — your account has zero credits and no BYOK key on file. Top up at [`console.ggui.ai/credits`](https://console.ggui.ai/credits) or paste a provider key into [`console.ggui.ai/keys/providers`](https://console.ggui.ai/keys/providers). **No inline rendering, only a link** — the host doesn’t advertise MCP Apps capability. Upgrade to a current Claude Desktop build (MCP Apps support shipped late 2025) or open the link manually; the underlying session works either way. **Anything else** — see [Troubleshooting](/troubleshooting/).

# Connect other MCP hosts

> Wire Cursor, VS Code Copilot, claude.ai, Goose, Cline, Continue, or Windsurf into a ggui server over MCP HTTP.

A ggui server is a plain remote MCP server. Any host that can be pointed at an MCP HTTP URL can use it — Claude Desktop is just the most polished today. This page is the “everything else” companion to [Connect Claude Desktop](/clients/claude-desktop/), which stays the deep-dive for that one client. Each section below covers WHAT to paste, WHERE to paste it, the expected OAuth flow, and any inline-rendering caveat. If your host isn’t listed but speaks MCP over HTTP, jump to [Other / static-key fallback](#other--static-key-fallback). ## claude.ai (web) [Section titled “claude.ai (web)”](#claudeai-web) Anthropic’s web client gained MCP Apps support late 2025; the flow mirrors Claude Desktop exactly. 1. **Settings → Connectors → Add custom connector**. 2. Paste `https://mcp.ggui.ai`. Leave auth blank. 3. On first tool call, claude.ai opens an OAuth tab to `console.ggui.ai/oauth/consent`. Approve. Inline rendering works — generated UIs appear directly in the chat. ## Cursor [Section titled “Cursor”](#cursor) Cursor reads MCP server config from `~/.cursor/mcp.json` (global) or `.cursor/mcp.json` (per-project, takes precedence). ```json { "mcpServers": { "ggui": { "url": "https://mcp.ggui.ai" } } } ``` Restart Cursor. The first time the agent calls a ggui tool, Cursor opens the browser OAuth flow. Approve at `console.ggui.ai/oauth/consent` and the agent picks up the minted token automatically. Caution Cursor’s MCP Apps inline-rendering support is partial and moves quickly — verify against your installed build. If your build doesn’t render inline, the tool call still succeeds and returns a `ui://ggui/render/` resource, but there is no URL to open manually — the session round-trip works, the UI just can’t be mounted. ## VS Code Copilot (and Cline, Continue, Codeium) [Section titled “VS Code Copilot (and Cline, Continue, Codeium)”](#vs-code-copilot-and-cline-continue-codeium) VS Code’s native MCP support shipped in 2025. Drop into either `.vscode/mcp.json` (per-project) or your User Settings JSON: ```json { "servers": { "ggui": { "type": "http", "url": "https://mcp.ggui.ai" } } } ``` Reload the window. Copilot triggers OAuth on first tool call. **Cline**, **Continue**, and **Codeium** are VS Code extensions that ship their own MCP config surfaces. The JSON shape is the same `mcpServers`-style object with minor key naming differences per extension — drop the same URL into whichever `mcp.json` the extension reads: * **Cline** — extension settings → MCP Servers → Edit config. * **Continue** — `~/.continue/config.json` under `mcpServers`. * **Codeium** — extension settings → MCP integration. Consult each extension’s docs for the exact path; the URL itself is unchanged. ## Goose [Section titled “Goose”](#goose) Block’s open-source agent has full MCP Apps support, so inline rendering works in the Goose Desktop UI out of the box. ```plaintext goose configure ``` Choose **Add extension → Remote MCP Server**, give it a name (`ggui`), and paste: ```plaintext https://mcp.ggui.ai ``` Goose drives the OAuth ceremony in the terminal flow — it prints a URL, you sign in at `console.ggui.ai`, and Goose stores the resulting token. ## Windsurf [Section titled “Windsurf”](#windsurf) Codeium’s IDE supports remote MCP servers via Settings → MCP Servers → Add. Paste `https://mcp.ggui.ai` and save; OAuth runs on first tool call. The config-file location varies by OS and Windsurf build. Settings UI is the safest path. Caution Verify against your current Windsurf build — MCP Apps capability advertisement and inline-rendering behavior have changed across releases. ## Other / static-key fallback [Section titled “Other / static-key fallback”](#other--static-key-fallback) Any host that can be pointed at an MCP HTTP URL with a bearer token can hit `mcp.ggui.ai`. For hosts that don’t yet implement OAuth 2.1 + DCR — most CLI agents and home-grown clients — use a static API key instead: 1. Sign in at [`console.ggui.ai/keys/connector`](https://console.ggui.ai/keys/connector). 2. Click **Mint**. Label it after the client. 3. Configure the host to send the key as a bearer header: ```plaintext Authorization: Bearer ggui_user_xxxxxxxxxxxx ``` That’s the full setup. See [`ggui keys create`](/cli/login/#manage-keys) for minting a connector key from the terminal instead of the console. ## Things every host needs [Section titled “Things every host needs”](#things-every-host-needs) Everything on this page also works against a self-hosted `ggui serve` — substitute your server URL; add `--oauth` for hosts that drive the browser ceremony, or use a pair-minted bearer for the static-key path. For a working connection: * **MCP over HTTP** — `mcp.ggui.ai` is HTTP-only. Stdio-only hosts can’t connect directly. * **Bearer authentication** — either via the OAuth flow below, or a static `ggui_user_*` key in an `Authorization` header. For the seamless one-click flow: * **OAuth 2.1 + Dynamic Client Registration** ([RFC 7591](https://datatracker.ietf.org/doc/html/rfc7591), [RFC 8414](https://datatracker.ietf.org/doc/html/rfc8414), [RFC 9728](https://datatracker.ietf.org/doc/html/rfc9728), [PKCE](https://datatracker.ietf.org/doc/html/rfc7636)). The host discovers, registers, and exchanges tokens with zero manual `client_id` paste. Details: [OAuth on mcp.ggui.ai](/api/oauth/). For inline rendering: * **MCP Apps capability** — the host advertises that it can render `ui://` resources in-line. Without it, the render is still produced (as a `ui://ggui/render/` resource) but can’t be mounted; with it, generated UIs appear inside the chat or agent surface. Details: [MCP Apps capability](/api/mcp-apps/). Anything else: see [Troubleshooting](/troubleshooting/).

# console.ggui.ai

> User dashboard for the hosted mcp.ggui.ai server — apps, orgs, connector keys, blueprints, credits, BYOK provider keys, and OAuth/CLI consent.

Coming soon This page describes the **managed hosted path** (`mcp.ggui.ai` / `console.ggui.ai`), which is **not yet live** — it is not part of GGUI Preview 0.1.0. The self-hosted path is available today — start with the [Quickstart](/oss-quickstart/). This page is kept as a preview of the managed path and goes live when hosted ggui ships. [`console.ggui.ai`](https://console.ggui.ai) is the user-facing dashboard for the [`mcp.ggui.ai`](/clients/claude-desktop/) hosted MCP server. Sign in with email + password (Cognito). The left-sidebar nav surfaces seven top-level screens; three more screens are deep-linked into by the OAuth ceremony, the CLI device flow, and org-invite emails. | Screen | Path | What lives here | | -------------- | --------------------- | ---------------------------------------------------------------------- | | Apps | `/apps` | Per-app surfaces — keys, blueprints, theme, marketplace, settings. | | Orgs | `/orgs` | Organizations you belong to; org-scoped wallets, invites, members. | | Connector keys | `/keys/connector` | Mint, list, and revoke `ggui_user_*` API keys. | | Connected apps | `/connections` | OAuth clients that have access to your account (placeholder today). | | Credits | `/credits` | Balance, transaction log, coupon redeem. (Stripe top-up: coming soon.) | | Provider keys | `/keys/providers` | BYOK — paste your Anthropic / OpenAI / Google / OpenRouter keys. | | Account | `/account` | Email, default app, sign out. | | OAuth consent | `/oauth/consent` | Approval landing for MCP-Apps-aware clients (Claude Desktop, etc.). | | CLI sign-in | `/cli-confirm/[code]` | Approval landing for the [`ggui` CLI’s](/cli/login/) device flow. | | Org invite | `/invites/[inviteId]` | Auto-accept landing for an org-invite email link. | The home path `/` redirects to `/apps` — apps are the primary primitive in the current IA (post-2026-05 apps pivot). You don’t visit `/oauth/consent` or `/cli-confirm/*` directly — the MCP host or the CLI navigates you there at the right moment. ## Apps (`/apps`) [Section titled “Apps (/apps)”](#apps-apps) Apps are the primary primitive. Each app is a scope that owns its own blueprints, theme, marketplace installs, and per-app API keys. New accounts land here with a single default app already provisioned; create more from this page. Per-app sub-routes (under `/apps/[appId]/`): * **Keys** (`/keys`) — per-app connector keys (`ggui_user_*` rows bound to this `appId`, so the pod locks the session to this app) **and** per-app BYOK provider-key overrides (precedence over the user-pool BYOK keys for renders bound to this app). * **Blueprints** (`/blueprints`) — the app’s blueprint library: view source, rename, edit metadata, delete. * **Marketplace** (`/marketplace`) — browse + install published blueprints. * **Theme** (`/theme`) — per-app theme overrides. * **Settings** (`/settings`) — app-level config. Blueprints are matched during `ggui_handshake`, before any generation. Curated app blueprints match deterministically when the agent’s declared tools cover the blueprint’s `dataTools` (instant, zero-LLM); other cached blueprints are reused by contract similarity. Either way, a hit renders with zero generation cost. See the [generation pipeline](/architecture/overview/#generation-pipeline) for the matching flow and [Marketplace](/sdk/marketplace/) for authoring + distribution. ## Orgs (`/orgs`) [Section titled “Orgs (/orgs)”](#orgs-orgs) Organizations you belong to. Each org has its own credit wallet, member list, and invite flow. Coupons can be redeemed into an org wallet from the Credits page (next section). Per-org detail lives under `/orgs/[orgId]`. Invite-email links land at `/invites/[inviteId]` and auto-accept once you’re signed in (lookup is by Cognito email, so a newly-signed-up invitee sees every pending invite at `/invites`). Org-scoped agent infrastructure (production hosting, shared dashboards) is the domain of the separate **guuey** platform — same protocol underneath. ## Connector keys (`/keys/connector`) [Section titled “Connector keys (/keys/connector)”](#connector-keys-keysconnector) The connector-keys table. One row per active key: prefix, name, status, last-used timestamp, **Revoke** action. **Mint** opens a dialog with one optional field (a friendly label). Submit, the key reveals exactly once — copy it before dismissing the dialog. Lose it and you mint a new one; there’s no recovery. Every authentication path lands here as one row: * The console UI mint dialog → row labelled whatever you typed. * An OAuth ceremony from Claude Desktop / claude.ai / Goose / VS Code → row labelled `MCP —  — ` by default (with an ` — `prefix when the host requested per-app scope via RFC 8707 `resource`); the consent screen’s optional **Key name** field overrides the default before approval. * The [`ggui keys create`](/cli/login/#manage-keys) CLI command → row labelled with whatever `--name` you passed. This unification is deliberate. The keys list IS the authoritative list of “things that can talk to ggui as you” — there’s no separate “connected clients” surface. Revoke a row and the corresponding client’s next request returns `401`. ## Connected apps (`/connections`) [Section titled “Connected apps (/connections)”](#connected-apps-connections) OAuth clients (Claude Desktop, claude.ai, Goose, …) that have registered against your account via Dynamic Client Registration. Currently a placeholder — the full client list + per-row revoke is planned. For now, run the in-Claude `ggui_status` tool to see connected apps from inside your MCP host. ## Credits (`/credits`) [Section titled “Credits (/credits)”](#credits-credits) Balance, transaction log, and coupon redemption. New accounts get **$5 of free credit** (Anthropic-pool) on first sign-in. Each `ggui_render` burns credit proportional to the LLM call (prompt tokens × model rate) plus a small render-storage allocation. Blueprint-matched renders skip the LLM and cost \~nothing. The transaction log shows every charge with model + token breakdown. **Top up:** the button is present but currently disabled with a “coming soon” badge — Stripe top-up is post-launch. Until then, top up via the **Redeem coupon** card (codes can be applied to your personal wallet or to any org wallet you’re a member of). To bypass the credit pool entirely, paste a **provider key** at `/keys/providers` (next section). When a BYOK key is on file for the model the request would have used, ggui calls the model on your tab and only charges credit for orchestration overhead. ## Provider keys / BYOK (`/keys/providers`) [Section titled “Provider keys / BYOK (/keys/providers)”](#provider-keys--byok-keysproviders) Paste your own provider API keys to bypass the credit pool for model calls. The four supported providers, with the get-a-key link the dialog surfaces: * **Anthropic** — Claude models. Get a key at `console.anthropic.com` → Settings → API Keys. * **OpenAI** — GPT models. Get a key at `platform.openai.com` → API Keys. * **Google** — Gemini models. Get a key at `aistudio.google.com` → Get API key. * **OpenRouter** — multi-provider routing. Get a key at `openrouter.ai` → Settings → Keys. Keys are KMS-encrypted at rest and only decrypted in-memory in the per-request Lambda. The console only ever shows the key prefix once it’s been saved — submit the form once, the plaintext is consumed. Resolution at request time is per-provider: Anthropic call → check Anthropic BYOK key → if present, use it; if not, fall back to credit pool. Mixing is fine — you can BYOK Anthropic and pay-as-you-go for everything else. Revoke at any time. The next request that would have used the key falls back to the credit pool. ## Account (`/account`) [Section titled “Account (/account)”](#account-account) Email, default app selection (used by legacy redirects like `/blueprints` → `/apps/{defaultAppId}/blueprints`), and sign out. ## OAuth consent (`/oauth/consent`) [Section titled “OAuth consent (/oauth/consent)”](#oauth-consent-oauthconsent) The browser landing for MCP-Apps-aware clients running through the [OAuth ceremony](/api/oauth/). When Claude Desktop (or any compatible host) asks `mcp.ggui.ai` to authorize, the server 302s the browser here with the OAuth params. You see: * The requesting client’s name (from DCR `client_name`). * The scope being requested (`mcp`). * Two buttons: **Approve** / **Cancel**. **Approve** mints a fresh `ggui_user_*` key (default label `MCP —  — `, editable via the Key-name field; per-app flows that carried an RFC 8707 `resource` get the `appId` bound on the row so the session locks to that app) and posts it back to the MCP server through a cross-origin form POST. The server completes the OAuth flow and the client’s next `/mcp` request authenticates with the new key. **Cancel** redirects back to the client with `error=access_denied` per [RFC 6749 §4.1.2.1](https://datatracker.ietf.org/doc/html/rfc6749#section-4.1.2.1). The MCP server is never contacted; nothing is minted. You don’t visit this URL directly — the MCP host sends you here at OAuth time. ## CLI sign-in (`/cli-confirm/[user_code]`) [Section titled “CLI sign-in (/cli-confirm/\[user\_code\])”](#cli-sign-in-cli-confirmuser_code) The browser landing for the [`ggui login`](/cli/login/) device flow. The CLI prints a URL like: ```plaintext https://console.ggui.ai/cli-confirm/AB12-CD34 ``` Open it, sign in if you aren’t already, confirm the codes match, click **Approve**. The CLI’s polling on `/v1/auth/poll` flips to `approved` and tokens land at `~/.ggui/auth.json`. Same trust model as the OAuth consent screen — confirm the code matches what the CLI printed before approving. ## Account model [Section titled “Account model”](#account-model) One Cognito user owns one ggui account. User-scoped data on the dashboard (your keys, your blueprints, your personal credit balance, your provider keys) is visible only to you via owner-auth on every model. Org-scoped data (org wallet, org members, org-scoped blueprints) is gated by org membership.

# Million-Dollar Homepage playground

> Hosted MCP service at mcp.ggui.ai/playground/mdh — shared 1000×1000 pixel grid. Demo of generative UI rendering globally shared mutable state.

Coming soon This page describes the **managed hosted path** (`mcp.ggui.ai` / `console.ggui.ai`), which is **not yet live** — it is not part of GGUI Preview 0.1.0. The self-hosted path is available today — start with the [Quickstart](/oss-quickstart/). This page is kept as a preview of the managed path and goes live when hosted ggui ships. `mcp.ggui.ai/playground/mdh` is a first-party hosted MCP service that exposes a globally shared 1000×1000 pixel grid — a tribute to [Alex Tew’s 2005 Million-Dollar Homepage](https://en.wikipedia.org/wiki/The_Million_Dollar_Homepage). Every signed-in user reads and writes the same canvas. Where the [Todos playground](/clients/playground-todos/) demonstrates per-user persistent state, MDH demonstrates **globally shared mutable state** rendered as generative UI. Three tools, one canvas, last-write-wins semantics. The agent claims a pixel, the host renders the new region inline, the next user can claim right over the top. ## What it demonstrates [Section titled “What it demonstrates”](#what-it-demonstrates) * **Shared mutable state across users**: every claim is visible to every other signed-in caller immediately. * **Generative UI over a sparse data shape**: `mdh_read_region` returns only claimed cells, so the host renders a sparse grid — pure data, no markup decisions baked into the tool. * **Attribution without ownership**: each claimed cell records the writer’s `userId`, but anyone can overwrite. Identity is recorded; tenure is not. If you want the deep “why” of the service model behind this, see [MCP services](/architecture/mcp-services/) — `playground-mdh` is one of three reference services in `cloud/mcp-services/`. ## The three tools [Section titled “The three tools”](#the-three-tools) All three are Cognito-gated. Anonymous calls (no bearer token) return an explicit `authentication required` error. | Tool | Input | Output | Notes | | ----------------- | ------------------------- | ------------------------------------------------------ | --------------------------------------------------------------------------------- | | `mdh_read_region` | `{ x, y, width, height }` | `{ pixels: Pixel[] }` | Sparse — unclaimed cells omitted. Region capped at 100×100 cells per call. | | `mdh_claim_pixel` | `{ x, y, color }` | `{ pixel }` | Single-cell write. `color` is `#RRGGBB` hex. Last-write-wins (see warning below). | | `mdh_get_stats` | `{}` | `{ totalClaimed, uniqueUsers, gridWidth, gridHeight }` | Cheap aggregate read. `gridWidth` and `gridHeight` are always 1000. | A `Pixel` row is `{ x, y, color, userId, claimedAt }`. Coordinates are integers, `0 <= x < 1000` and `0 <= y < 1000`. `claimedAt` is ISO-8601. The region cap exists to bound payload size: a 100×100 fully-claimed read returns 10,000 entries. Agents wanting a wider view issue multiple reads — the sparse response means the cost scales with claimed density, not region size. Last-write-wins, no permanent ownership `mdh_claim_pixel` REPLACES any existing claim on that cell. The most recent writer’s `userId` overwrites the previous one. There is no “owned forever”, no allow-list, no first-claim precedence. Anyone signed in can paint over anything. The wire description on the tool itself spells this out so agents surface it before the user buys in. ## Auth model [Section titled “Auth model”](#auth-model) Cognito-gated. Every handler resolves `ctx.userId` from the bearer key minted by [console.ggui.ai](/clients/console/) and rejects calls without one. On `mdh_claim_pixel`, that same `userId` is stamped on the resulting pixel row — that’s the attribution surface visible to subsequent `mdh_read_region` callers. Reads are gated too. The canvas is end-user surface, not a public dataset; anonymous reads are deliberately not exposed at this slice. ## Try it [Section titled “Try it”](#try-it) The wiring is identical to the [Todos playground](/clients/playground-todos/) — same auth header, same OAuth ceremony from MCP-Apps-aware hosts, different URL. **Claude Desktop**: Settings → Connectors → Add custom connector, paste: ```plaintext https://mcp.ggui.ai/playground/mdh ``` Approve the OAuth ceremony at `console.ggui.ai`; a new `ggui_user_*` row appears in [`/keys/connector`](https://console.ggui.ai/keys/connector). Then try prompts like: > Show me a 50×50 region around the center of the canvas. > Claim the pixel at (500, 500) in red. > How many pixels have been claimed total, and by how many users? **Claude Agent SDK** (or any host that takes a remote MCP URL): ```typescript import { query } from "@anthropic-ai/claude-agent-sdk"; const apiKey = process.env.GGUI_USER_KEY!; // ggui_user_* const mcpServers = { mdh: { type: "http" as const, url: "https://mcp.ggui.ai/playground/mdh", headers: { Authorization: `Bearer ${apiKey}` }, }, }; for await (const msg of query({ prompt: "Claim (500, 500) in #ff0044, then show me a 20×20 region around it.", options: { model: "claude-sonnet-4-6", mcpServers, allowedTools: [ "mcp__mdh__mdh_read_region", "mcp__mdh__mdh_claim_pixel", "mcp__mdh__mdh_get_stats", ], }, })) { // consume the SDK message stream } ``` The Claude Agent SDK namespaces every MCP tool as `mcp____` — with `serverName: "mdh"` here, the tool ids land as `mcp__mdh__mdh_read_region`, `mcp__mdh__mdh_claim_pixel`, and `mcp__mdh__mdh_get_stats`. For the full agent-side wiring (system prompt, error handling, streaming), see the [Claude Agent example](/examples/claude-agent/) and swap the URL + tool whitelist. ## How it’s built [Section titled “How it’s built”](#how-its-built) The service is closed-source (`@ggui-private/mcp-playground-mdh` — not published or mirrored), but its shape is simple enough to describe: * a package entry exporting a `createPlaygroundMdhHandlers({ grid })` convenience factory, * one file per tool (read-region / claim-pixel / get-stats), * a `PixelGrid` interface with an in-memory implementation. The grid is the single source of truth for its own dimensions and surfaces them on the wire via `mdh_get_stats`. To build your own globally shared state surface in this shape, use the OSS **`McpService` mount primitive** in [`@ggui-ai/mcp-server`](/architecture/mcp-services/) — the same seam this service mounts through. The [MCP services architecture](/architecture/mcp-services/) page covers mount points, the `AuthAdapter` contract, and how `ctx.userId` flows from the bearer header to the handler.

# Todos playground

> Hosted MCP service at mcp.ggui.ai/playground/todos — a per-user persistent todo list demo. Try generative UI rendering shared state.

Coming soon This page describes the **managed hosted path** (`mcp.ggui.ai` / `console.ggui.ai`), which is **not yet live** — it is not part of GGUI Preview 0.1.0. The self-hosted path is available today — start with the [Quickstart](/oss-quickstart/). This page is kept as a preview of the managed path and goes live when hosted ggui ships. `mcp.ggui.ai/playground/todos` is a first-party hosted MCP service that exposes a per-user persistent todo list. Sign in to [console.ggui.ai](/clients/console/), point your agent at the URL, and watch the agent call a tool while ggui renders the list inline as generative UI. State survives refresh, re-login, and host swaps — todos are pinned to your Cognito identity, not the session. This is the smallest “real” MCP-driven surface we ship. No code to write, no schema to learn — the four tools are described by the server’s `tools/list` response, and the host (Claude Desktop, Goose, your own agent) drives them. ## What it demonstrates [Section titled “What it demonstrates”](#what-it-demonstrates) * **Agent → tool → generative UI**: the agent calls a tool, the runtime feeds the result into a blueprint, the host renders the resulting list inline. * **Per-user persistence**: log out, come back tomorrow from a different host — same todos. * **Cognito identity at the MCP edge**: tool handlers read `ctx.userId` directly. No second auth layer; the `Authorization: Bearer ggui_user_*` header that authenticated the MCP session also scopes the data. If you want the deep “why” of the service model that makes this work, see [MCP services](/architecture/mcp-services/) — `playground-todos` is one of three reference services in `cloud/mcp-services/`. ## The four tools [Section titled “The four tools”](#the-four-tools) All four are scoped to the caller’s `ctx.userId`. Anonymous calls (no bearer token) return an explicit `authentication required` error. | Tool | Input | Output | Notes | | -------------- | ---------- | -------------------------------------- | ---------------------------------------------------------------------------------- | | `todos_list` | `{}` | `{ todos: Todo[] }` | Oldest-first by `createdAt`. Empty array when the user has none — never `null`. | | `todos_add` | `{ text }` | `{ todo }` | `text` trimmed; empty or 500+ char bodies rejected at the schema boundary. | | `todos_toggle` | `{ id }` | `{ found: boolean, todo: Todo\|null }` | Flips `done`. Missing id and cross-user id collapse to one shape (see Auth model). | | `todos_delete` | `{ id }` | `{ ok: boolean }` | `ok: true` only when the caller owned a todo with that id. | A `Todo` row is `{ id, text, done, createdAt }`. `createdAt` is an ISO-8601 string; `id` is opaque (treat it as a black-box handle returned by `todos_add` or `todos_list`). ## Auth model [Section titled “Auth model”](#auth-model) Cognito-gated. Every handler resolves `ctx.userId` from the bearer key minted by [console.ggui.ai](/clients/console/) and rejects calls without one. The console is the only place those keys come from — either through an OAuth ceremony from an MCP-Apps-aware host or by minting one manually at `/keys/connector`. Cross-user probing is structurally prevented. `todos_toggle` and `todos_delete` collapse two failure modes into one wire shape: * The id doesn’t exist. * The id exists but belongs to a different user. Both return the same negative response (`found: false` / `ok: false`). A caller can’t enumerate other users’ id space. ## Try it from Claude Desktop [Section titled “Try it from Claude Desktop”](#try-it-from-claude-desktop) If you’ve already connected `mcp.ggui.ai` from the [Claude Desktop walkthrough](/clients/claude-desktop/), Claude Desktop will pick up `playground-todos` once you add the second connector. In Settings → Connectors → Add custom connector, paste: ```plaintext https://mcp.ggui.ai/playground/todos ``` The OAuth ceremony runs again against `console.ggui.ai`; approve it the same way you did for the main `mcp.ggui.ai` connector. A new `ggui_user_*` row appears in [`/keys/connector`](https://console.ggui.ai/keys/connector) labelled with the connector name. Once connected, try prompts like: > What’s on my todo list? > Add “ship the launch post” to my todos, then show me everything pending. > Mark the first todo as done. Claude calls `todos_list` / `todos_add` / `todos_toggle`, and the table updates in the chat. Refresh Claude Desktop, ask again — same list comes back. ## Try it from your own agent [Section titled “Try it from your own agent”](#try-it-from-your-own-agent) The Todos playground is a vanilla remote MCP server. Anything that speaks `streamable-http` MCP + bearer auth works. For Claude Agent SDK: ```typescript import { query } from "@anthropic-ai/claude-agent-sdk"; const apiKey = process.env.GGUI_USER_KEY!; // ggui_user_* const mcpServers = { todos: { type: "http" as const, url: "https://mcp.ggui.ai/playground/todos", headers: { Authorization: `Bearer ${apiKey}` }, }, }; for await (const msg of query({ prompt: "Add 'try the playground' to my todos, then list everything.", options: { model: "claude-sonnet-4-6", mcpServers, allowedTools: [ "mcp__todos__todos_list", "mcp__todos__todos_add", "mcp__todos__todos_toggle", "mcp__todos__todos_delete", ], }, })) { // consume the SDK message stream } ``` The Claude Agent SDK namespaces every MCP tool as `mcp____` — with `serverName: "todos"` here, the tool ids land as `mcp__todos__todos_list`, `mcp__todos__todos_add`, and so on. The pattern is the same for any other host that takes an MCP-server URL. For the full agent-side wiring (system prompt, error handling, streaming), see the [Claude Agent example](/examples/claude-agent/). The same pattern applies — swap `mcp.ggui.ai` for `mcp.ggui.ai/playground/todos`, narrow the tool whitelist to the four todos tools. ## How it’s built [Section titled “How it’s built”](#how-its-built) The service is closed-source (`@ggui-private/mcp-playground-todos` — not published or mirrored), but its shape is simple enough to describe: * a package entry exporting a `createPlaygroundTodosHandlers({ store })` convenience factory, * one file per tool (list / add / toggle / delete), each \~50 lines, * a `TodoStore` interface with an in-memory implementation for tests; production swaps in a DynamoDB-backed store against the same interface. To build your own service in this shape, use the OSS **`McpService` mount primitive** in [`@ggui-ai/mcp-server`](/architecture/mcp-services/) — the same seam this service mounts through. The [MCP services architecture](/architecture/mcp-services/) page covers mount points, the `AuthAdapter` contract, and how `ctx.userId` is resolved end-to-end.

# Auth-Gated UI

> Gate ggui renders behind your app's auth with a bearer token on useMcpAppsChat — and understand why OAuth tool-consent is the client's job, not ggui's.

ggui is a renderer, not an identity provider. Your app already knows who the user is. There are **two separate auth concerns**, and only the first is yours to wire: 1. **Authenticating your app → agent backend.** Every request `useMcpAppsChat` makes (prompt POST, snapshot GET, the iframe → MCP tool-call relay) carries a bearer token your app supplies. The backend gates chat ownership on the principal that token identifies. **This is the recipe below.** 2. **OAuth consent for tools the *agent* calls.** When a tool the agent calls needs the end-user to authorize a third-party service (Google, Slack), that consent flow is **not** ggui’s responsibility — it belongs to MCP’s OAuth standard, the hosting agent, and your client app. ggui only surfaces an optional signal. See [OAuth tool-consent is the client’s job](#oauth-tool-consent-is-the-clients-job). ## Gate the chat behind a bearer token [Section titled “Gate the chat behind a bearer token”](#gate-the-chat-behind-a-bearer-token) `useMcpAppsChat` takes two auth hooks: * **`getAuthToken()`** — called before every request; return the bearer token to send as `Authorization: Bearer ` (or `undefined` for none). * **`onUnauthenticated()`** — called on a `401`; refresh/re-mint your token, return `true` to retry the request once, `false` to surface the error. The [`ggui-basic-web`](https://github.com/ggui-ai/ggui/tree/main/samples/apps/ggui-basic-web) sample uses these for a **guest-token** flow: it mints an anonymous token from the backend’s `POST /auth/guest` mount, caches it, and re-mints on `401`. Swap that mint for your own user-session token and the same wiring gates renders behind a signed-in user: ```tsx import { useCallback, useRef } from "react"; import { useMcpAppsChat } from "@ggui-ai/react/chat-helpers"; function Chat({ agentEndpoint }: { agentEndpoint: string }) { const { user, getAccessToken, refreshSession } = useAuth(); // your auth provider // Read the current token per-request (kept in a ref so the callback // always sees the latest after a refresh). const tokenRef = useRef(getAccessToken()); const getAuthToken = useCallback(() => tokenRef.current, []); // 401 → refresh once, then retry. Return false to give up. const onUnauthenticated = useCallback(async () => { const fresh = await refreshSession(); if (!fresh) return false; tokenRef.current = fresh; return true; }, [refreshSession]); const { entries, sessions, send, handleAppMessage } = useMcpAppsChat({ chatEndpoint: `${agentEndpoint}/agent`, snapshotEndpoint: `${agentEndpoint}/agent`, getAuthToken, onUnauthenticated, }); // render `entries` + mount `sessions` with  as usual } ``` The token rides every hook request automatically. The backend authenticates it and scopes the conversation (its `chatId` snapshot + resume state) to that principal — clear the session and you’re a different principal with different chats. ### Mount-time gate [Section titled “Mount-time gate”](#mount-time-gate) Don’t open the chat until your provider says the user is signed in — a plain conditional render, exactly how the sample guards on `guestTokenReady`: ```tsx function App({ agentEndpoint }: { agentEndpoint: string }) { const { isAuthenticated, isLoading } = useAuth(); if (isLoading) return 
Loading…
; if (!isAuthenticated) return ; return ; } ``` ### The tool-call relay carries the token too [Section titled “The tool-call relay carries the token too”](#the-tool-call-relay-carries-the-token-too) When the iframe dispatches a tool call, your host relays it to the agent backend’s `POST /agent` (`kind: "tool-call"`) inside ``. Send the **same** bearer token on that relay so the MCP call runs as the signed-in user — the sample’s `onCallTool` reads `getAuthToken()` and sets `Authorization: Bearer ` on the relay fetch. Don’t trust anything the iframe puts in the payload as identity; the token on the relay is the credential. ## OAuth tool-consent is the client’s job [Section titled “OAuth tool-consent is the client’s job”](#oauth-tool-consent-is-the-clients-job) A different situation: the agent wants to call a tool that needs the **end-user** to authorize a third-party service first (read their Google Calendar, post to Slack). That consent flow is deliberately **outside** the ggui protocol. It is owned by: * **MCP’s OAuth standard** — the spec defines how an MCP server advertises that a tool needs authorization and how the consent/token exchange happens. * **The hosting agent** (e.g. `@anthropic-ai/claude-agent-sdk`) — follows that spec to handle tool authorization. * **Your end-user client app** — the surface actually talking to the agent backend pops its own consent UI and drives the redirect. Consent is a client decision, not something ggui’s runtime or protocol renders. * **Guuey’s MCP proxy** — captures an OAuth credential once so multiple agents can reuse it (coming soon — part of the hosted Guuey platform, not the open ggui protocol). So ggui itself renders no consent screen. What it *does* provide is an **optional helper signal**: when the server emits a `system` frame with `action: "auth_required"`, the iframe-runtime projects it to a typed `AuthRequiredEvent` on the `ggui:observe` postMessage channel (`{ type: "ggui:observe", event }`). A host that wants to surface a consent prompt can listen for it and open `authUrl` in a popup — but rendering that prompt, and the OAuth exchange itself, are your client’s job: ```ts import type { AuthRequiredEvent } from "@ggui-ai/react"; // AuthRequiredEvent shape (kind: "auth-required"): // provider — canonical service id ("google", "slack") // authUrl — URL to open to start the OAuth consent flow // displayName? — human-readable service name // scopes? — requested OAuth scopes // message? — why access is needed ``` It’s a projection of the protocol’s `SystemPayload`, not a new ggui surface — surfacing it is opt-in, and ggui draws nothing itself. See [Error Handling → renderer-side faults](/cookbook/error-handling/#renderer-side-faults-stay-inside-the-iframe) for the full observability catalog. ## See also [Section titled “See also”](#see-also) * [React SDK](/sdk/react/) — `useMcpAppsChat` (`getAuthToken` / `onUnauthenticated`) + `` * [`ggui-basic-web` sample](https://github.com/ggui-ai/ggui/tree/main/samples/apps/ggui-basic-web) — the runnable guest-token reference (`Chat.tsx`) * [Error Handling](/cookbook/error-handling/) — HTTP `401`, JSON-RPC auth codes, and the `auth-required` observability event * [Getting Started](/getting-started/) — the agent-side handshake → render → consume loop

# Chat with Your Own Storage

> Build a ggui-powered chat UI on top of your existing persistence layer using @ggui-ai/react and the pure helpers in @ggui-ai/react/chat-helpers.

You own the UI, you own the storage. ggui gives you the streaming protocol (`useInvoke`) plus pure helpers for persistence shape (`@ggui-ai/react/chat-helpers`). Where messages live, how threads are indexed, what your composer looks like — that’s yours. ## When to use this pattern [Section titled “When to use this pattern”](#when-to-use-this-pattern) Pick this pattern when **at least one** holds: * You already have a persistence layer (Postgres, Firestore, IndexedDB, your Redux store) and don’t want a second one. * You need full control of the message schema — custom attachments, per-message ACLs, server-side fan-out. * You’re integrating ggui into an existing chat surface, not building a new one. If none apply and you just want “a chat UI that works”, reach for `useChatThread` in `@ggui-ai/react/chat-thread` — same flow behind a single hook with a pluggable `MessageStorageAdapter` and a `ChatThreadProvider`. ## 60-line example [Section titled “60-line example”](#60-line-example) The file below is the complete working integration — streaming, persistence, card rendering, send — in \~60 lines. ```tsx import { useEffect } from "react"; import { GguiProvider, useInvoke } from "@ggui-ai/react"; import { useRafThrottled, invokeMessageToContentGroups, extractRenderFromToolResult, type ContentGroup, } from "@ggui-ai/react/chat-helpers"; // Replace with any storage: localStorage, fetch, IndexedDB, Firestore, … const store: Array<{ threadId: string; group: ContentGroup }> = []; function persist(threadId: string, messages: ReturnType["messages"]) { const seen = new Set(store.filter((e) => e.threadId === threadId).map((e) => e.group.key)); for (const msg of messages) { for (const group of invokeMessageToContentGroups(msg)) { if (!seen.has(group.key)) store.push({ threadId, group }); } } } function Chat({ threadId, endpointUrl }: { threadId: string; endpointUrl: string }) { const { messages, send, isStreaming } = useInvoke({ endpointUrl }); const throttled = useRafThrottled(messages); useEffect(() => { persist(threadId, messages); }, [threadId, messages]); return (  {throttled.map((m) => (  {m.role}:  {m.content.map((b, i) => { if (b.type === "text") return {b.text}
; if (b.type === "tool_result") { const render = extractRenderFromToolResult(b); return {JSON.stringify(render, null, 2)}
; } return null; })} 
 ))}  
 ); } export default function App() { return (    ); } ``` ## The `ContentGroup` contract [Section titled “The ContentGroup contract”](#the-contentgroup-contract) `invokeMessageToContentGroups(message)` splits a finalized invoke message into `ContentGroup`s — the durable unit you persist: ```ts interface ContentGroup { key: string; // `${message.id}-${startBlockIdx}` — see invariant below kind: "text" | "card" | "other"; authorRole: "user" | "agent"; blocks: ContentBlock[]; // a contiguous run of text, or a tool_use + tool_result pair cardSnapshot: unknown | null; // frozen GguiSession for kind="card" textPreview: string; // ~160-char preview for chat-list tiles } ``` **The `key` invariant.** `key` is deterministic from `message.id` plus the block index where the group starts. Two consequences: 1. **Idempotency.** Re-persisting the same group with the same key is a no-op. If your storage uses `key` as primary key (recommended), the `persist()` loop above can run after every token delta without producing duplicates. 2. **Streaming messages are excluded.** A message whose `isStreaming` is `true` returns `[]` — groups appear only after the turn finalizes. That’s why `persist()` doesn’t need a separate “on end\_turn” callback. ## Reloading a thread [Section titled “Reloading a thread”](#reloading-a-thread) On thread reopen, rebuild the `ConversationMessage[]` your store remembers and seed the hook via `initialMessages`. `contentGroupsToConversationMessages` collapses groups sharing the same `message.id` prefix back into one message: ```tsx import { contentGroupsToConversationMessages } from "@ggui-ai/react/chat-helpers"; function useSeededInvoke(threadId: string, endpointUrl: string) { // Resolve before render — useInvoke captures initialMessages on mount. const groups = store.filter((e) => e.threadId === threadId).map((e) => e.group); const seed = contentGroupsToConversationMessages(groups); return useInvoke({ endpointUrl, initialMessages: seed }); } ``` `initialMessages` is a **seed on mount**. Changing it on re-render does *not* reset hook state — intentional; the hook owns the conversation after mount. To switch threads, unmount the `` subtree (set `key={threadId}`) and let the new instance seed from that thread’s store. ## Why `send({ clientMessageId })` matters [Section titled “Why send({ clientMessageId }) matters”](#why-send-clientmessageid--matters) `useInvoke` accepts an optional `clientMessageId` the caller controls: ```tsx send("hello", { clientMessageId: crypto.randomUUID() }); ``` The rendered user message’s `id` becomes that value, which buys: * **Retry without duplicates.** Retrying the same send after a network failure produces the same `clientMessageId` → same `ContentGroup.key` → the outbox is idempotent by construction. * **Cross-device continuity.** Persist user messages optimistically on the sending device; when the agent turn later replays on another device, the same `clientMessageId` collapses both into one thread entry. Without `clientMessageId`, `useInvoke` falls back to a random `user_` id — fine for ephemeral chats, wrong for durable storage. ## What you still have to do [Section titled “What you still have to do”](#what-you-still-have-to-do) The helpers stop at *shape*. You still own: * **Transport to storage.** `store.push(...)` above is an in-memory array for brevity. Replace it with `fetch('/persist')`, an IndexedDB write, a Firestore batch — whatever fits your stack. * **Thread indexing.** `ContentGroup.textPreview` is the building block for chat-list tiles; wiring it into a sidebar is yours. * **Reconnect + resume.** `useInvoke` does not replay an interrupted stream. If the page reloads mid-turn, the in-flight assistant message is lost — only finalized groups persist. `useChatThread` (next segment up) closes this gap. When those boundaries start to hurt, reach for `useChatThread`: same primitives, plus a `MessageStorageAdapter` interface, a `ChatThreadProvider`, the outbox, seed-on-reopen wiring, and the optimistic-send UX. ## See also [Section titled “See also”](#see-also) * [React SDK reference](/sdk/react/) — `useInvoke`, `useChatThread`, and the `` render host * [Glossary](/glossary/) — gadget, tool, blueprint, render * [Self-hosted registry](/sdk/self-hosted-registry/) — point `endpointUrl` at your own server

# Custom Theming

> Match ggui-generated UIs to your brand with two-layer theming — design-token CSS variables for your host chrome, plus the operator's ggui.json theme preset for the generated iframe content.

ggui theming has **two independent layers**, and they live on opposite sides of the iframe boundary. Keep them straight: 1. **Your host chrome** — the React app *around* the render (the chat panel, the surrounding page). You theme this with `@ggui-ai/design` tokens: a `` injects `--ggui-*` CSS variables on `:root` and your CSS references them. 2. **The generated UI inside the iframe** — the component the agent rendered. You never style this from the host (it’s sandboxed on a different origin). Its palette comes from the operator’s `ggui.json` `theme` preset, resolved per render and applied as a runtime CSS-variable overlay. The two are matched by convention: the [`ggui-basic-web`](https://github.com/ggui-ai/ggui/tree/main/samples/apps/ggui-basic-web) sample picks the **same** preset for its host chrome that the demo `ggui.json` picks for the iframe, so the chat shell and the generated card share one palette. For the full token reference (palettes, scales, semantic colors), see [Design Tokens](/design/tokens/). ## Layer 1 — host chrome via `` [Section titled “Layer 1 — host chrome via \”](#layer-1--host-chrome-via-themeprovider) `` (from `@ggui-ai/design/themes`) takes a raw [DTCG](https://design-tokens.github.io/community-group/format/) token tree and a color mode, injects every token as a `--ggui-*` CSS variable on `:root`, and wires the base `html, body` font + surface colors. Pull a registered preset with `getRawTheme(id, mode)`: ```tsx import { ThemeProvider, getRawTheme } from "@ggui-ai/design/themes"; // Resolve the raw DtcgTheme token tree for a registered preset. const INDIGO_DARK = getRawTheme("indigo", "dark"); function App() { return (  {/* Your chat panel / page chrome. Every CSS rule that references a --ggui-* variable now resolves against the indigo/dark tokens. */}  ); } ``` Your stylesheet then references the injected variables — exactly how the sample’s `globals.css` maps semantic chat-shell roles onto ggui tokens: ```css :root { --bg-app: var(--ggui-color-background); --bg-chat: var(--ggui-color-surface); --accent: var(--ggui-color-primary-500); --fg: var(--ggui-color-onSurface); --fg-muted: var(--ggui-color-onSurfaceVariant); } .chat { background: var(--bg-chat); color: var(--fg); font-family: var(--ggui-font-family-sans); } ``` ### Discovering presets [Section titled “Discovering presets”](#discovering-presets) `@ggui-ai/design/themes` ships a registry — list what’s available at runtime (e.g. to build a theme picker): ```tsx import { getThemeIds, listThemes, getDefaultThemeId } from "@ggui-ai/design/themes"; getThemeIds(); // ['ggui', 'indigo', 'claudic', …] — every registered id listThemes(); // [{ id, name, description, modes }, …] — metadata for a picker getDefaultThemeId(); // 'ggui' ``` ## Layer 2 — generated iframe content via `ggui.json` [Section titled “Layer 2 — generated iframe content via ggui.json”](#layer-2--generated-iframe-content-via-gguijson) You can’t reach into the sandboxed iframe with CSS. Instead, the operator’s `ggui.json` `theme` block sets the app’s default preset. At each render the server resolves the theme (per-render `themeId` override → App default → server fallback), snapshots it onto the GguiSession, and the iframe-runtime applies it as a `--ggui-*` CSS-variable overlay on the iframe’s `:root` — generated components reference the same token variables your host chrome does, so the palette is a runtime overlay, not generated code: ```json { "schema": "1", "protocol": "draft-2026-06-12", "app": { "slug": "my-app", "name": "My App" }, "theme": { "preset": "indigo", "mode": "dark" } } ``` `theme` also accepts a bare string preset shorthand (`"theme": "indigo"`), an `overrides` map of flat dot-path token tweaks on top of a preset (`{ "preset": "indigo", "mode": "dark", "overrides": { "color.primary.500": "#8b5cf6" } }` — no custom theme file needed), and a `{ "file": "./theme.json", "mode": "dark" }` form pointing at a DTCG theme file. Lint a theme file before serving with `ggui theme validate `. Match `theme.preset` / `theme.mode` here to the `getRawTheme(...)` you pass to `` in your host, and the chat shell and the rendered card render in one cohesive palette — the sample does exactly this. ### Per-render theme override [Section titled “Per-render theme override”](#per-render-theme-override) The preset isn’t fixed per app. Agents can list the available presets with the `ggui_list_themes` tool (each entry is `{ id, name, description, modes }`) and pick one for a single render via the `themeId` input on `ggui_render` — the override wins over the app default for that render only. `App.availableThemeIds` allowlists which presets agents may pick. ## Ad-hoc CSS token overrides [Section titled “Ad-hoc CSS token overrides”](#ad-hoc-css-token-overrides) The token layer is just CSS custom properties with hardcoded fallbacks, so you can also override individual variables without a registered preset — useful for a quick rebrand on top of a chosen theme, or for scoping one subtree. ### Global override [Section titled “Global override”](#global-override) Set tokens on `:root` to restyle every host-chrome surface in the page: ```css :root { /* Primary brand color (50 → 900 scale) */ --ggui-color-primary-50: #eff6ff; --ggui-color-primary-500: #3b82f6; --ggui-color-primary-600: #2563eb; --ggui-color-primary-900: #1e3a8a; /* Typography */ --ggui-font-family-sans: "Inter", system-ui, sans-serif; --ggui-font-family-mono: "JetBrains Mono", ui-monospace, monospace; /* Shape + density */ --ggui-spacing-md: 16px; --ggui-shape-radius-md: 8px; --ggui-shape-shadow-md: 0 8px 16px -4px rgba(15, 23, 42, 0.1); } ``` These run on top of whatever `` injected — the provider writes the `:root` block first, your stylesheet’s later `:root` rules win on equal specificity. ### Scoped theming [Section titled “Scoped theming”](#scoped-theming) CSS custom properties cascade — wrap any subtree to give it its own accent without leaking to siblings: ```tsx  {/* Everything in here picks up the purple primary + softer corners */} 
 ``` ## Token reference [Section titled “Token reference”](#token-reference) | Category | Pattern | Example | | ------------------- | --------------------------------------------------------------------------------------------------------------- | -------------------------------------- | | Colors (primary) | `--ggui-color-primary-{50-900}` | `--ggui-color-primary-600` | | Colors (neutral) | `--ggui-color-neutral-{50-900}` | `--ggui-color-neutral-200` | | Colors (status) | `--ggui-color-{success,warning,error,info}-{50,100,200,500,600,700,800}` | `--ggui-color-success-500` | | Surface + content | `--ggui-color-{surface,onSurface,surfaceVariant,onSurfaceVariant,container,onContainer,outline,outlineVariant}` | `--ggui-color-onSurface` | | Typography | `--ggui-font-{family,size,weight,lineHeight}-*` | `--ggui-font-size-lg` | | Spacing | `--ggui-spacing-{xs,sm,md,lg,xl,2xl,3xl}` | `--ggui-spacing-md` | | Shape — radius | `--ggui-shape-radius-{none,sm,md,lg,xl,2xl,full}` | `--ggui-shape-radius-lg` | | Shape — shadow | `--ggui-shape-shadow-{none,xs,sm,md,lg,xl,2xl}` | `--ggui-shape-shadow-md` | | Motion — duration | `--ggui-motion-duration-{instant,fast,normal,slow,slower}` | `--ggui-motion-duration-normal` | | Motion — transition | `--ggui-motion-transition-{colors,opacity,transform,all}` | `--ggui-motion-transition-colors` | | Accessibility | `--ggui-accessibility-focusRing-{color,width,offset}` (plus `reducedMotion`, `highContrast`) | `--ggui-accessibility-focusRing-color` | | Z-Index | `--ggui-zIndex-{hide,base,docked,dropdown,sticky,banner,overlay,modal,popover,skipLink,toast,tooltip}` | `--ggui-zIndex-modal` | **Status colors are scales** — `success`, `warning`, `error`, and `info` each ship the full 50/100/200/500/600/700/800 stops (no singletons). **Material role pairs** (`surface`/`onSurface`, `surfaceVariant`/`onSurfaceVariant`, `container`/`onContainer`, `outline`/`outlineVariant`) keep contrast right across nested surfaces — override the pair together, never one half. See [Design Tokens](/design/tokens/) for live swatches and the complete scale. ## See also [Section titled “See also”](#see-also) * [Design Tokens](/design/tokens/) — full palette, scales, and live swatches * [React SDK](/sdk/react/) — `` + `useMcpAppsChat`, the web render-hosting surface * [`ggui-basic-web` sample](https://github.com/ggui-ai/ggui/tree/main/samples/apps/ggui-basic-web) — the runnable reference that matches host chrome to `ggui.json`

# Error Handling

> Classify ggui failures across HTTP, JSON-RPC, and the renderer's ggui:observe channel — and recover from each at its layer.

ggui surfaces failures at three layers. Handle each at its layer: 1. **HTTP** — the MCP transport returns `401` / `403` / `429` / `5xx` before the JSON-RPC body is even parsed. 2. **JSON-RPC** — a `tools/call` reaches the server but the server returns a `-32xxx` error code (or a tool-level failure with `isError: true`). 3. **Live channel** — a failure *after* a successful render. Action-validation rejections ride the live-channel WebSocket as typed `error` frames carrying `code: 'CONTRACT_VIOLATION'` (numeric `-32020`) and surface in the renderer, not the agent; nothing lands on the consume buffer. ## Layer 1 — HTTP errors from the MCP transport [Section titled “Layer 1 — HTTP errors from the MCP transport”](#layer-1--http-errors-from-the-mcp-transport) These come back as `IsHttpError` / response status from whichever HTTP client your MCP SDK uses. Treat them as transport failures — the server hasn’t even looked at your JSON-RPC payload yet. | Status | Meaning | Retry? | | ------------- | ----------------------------------- | -------------------------------- | | `401` | Bad / expired API key | No — fix config | | `403` | Key valid but app not authorized | No — fix config | | `429` | Rate-limited (`Retry-After` header) | Yes, after `Retry-After` seconds | | `5xx` | Transient server failure | Yes, with exponential backoff | | network error | DNS / connection / TLS failure | Yes, with exponential backoff | The `Retry-After` header (when present) is authoritative — honor it verbatim. See [`/api/rate-limits/`](/api/rate-limits/) for the 429 response shape (body + header) and a raw-HTTP backoff recipe. ## Layer 2 — JSON-RPC errors from the server [Section titled “Layer 2 — JSON-RPC errors from the server”](#layer-2--json-rpc-errors-from-the-server) A 200 HTTP response can still carry a JSON-RPC error. The MCP SDKs surface these as thrown errors with a numeric `code`; raw HTTP callers see `{ "error": { "code": -32xxx, "message": "..." } }` in the response body. | Code | Name | When | Retry? | | -------- | ------------------- | -------------------------------- | ------------------------ | | `-32700` | Parse Error | Invalid JSON in request | No — fix the call | | `-32600` | Invalid Request | Not a valid JSON-RPC object | No — fix the call | | `-32601` | Method Not Found | Unknown tool name | No — fix the call | | `-32602` | Invalid Params | Missing / invalid tool arguments | No — fix the call | | `-32603` | Internal Error | Server-side failure | Yes, with backoff | | `-32001` | Unauthorized | Invalid token or app ID | No — fix config | | `-32002` | Session Not Found | Session expired or reaped | Re-handshake + render | | `-32003` | App Not Found | App ID does not exist | No — fix config | | `-32004` | Production Failed | UI generation failed | Yes — try simpler intent | | `-32005` | Capability Denied | Requested capability not allowed | No — fix config | | `-32013` | Rate Limit Exceeded | Platform rate limit hit | Yes, after backoff | Platform deployments also reserve the `-32010` range (`-32010` generation quota, `-32011` app limit, `-32012` concurrent-session limit, `-32020` contract violation). Full table with descriptions: [`/api/mcp-protocol/#error-codes`](/api/mcp-protocol/#error-codes). Tool-level failures (the tool ran but returned an error result) come back as a successful JSON-RPC response with `isError: true` on the `tools/call` result content. Inspect `result.content` for the failure detail. On the OSS server the `ggui_*` tools surface their domain failures this way — `isError` results whose typed error classes carry string codes like `handshake_not_found` and `session_not_found` — while the numeric `-32xxx` table above is the protocol-level canonical set. ### Retry with exponential backoff (raw `@modelcontextprotocol/sdk`) [Section titled “Retry with exponential backoff (raw @modelcontextprotocol/sdk)”](#retry-with-exponential-backoff-raw-modelcontextprotocolsdk) The SDK throws on HTTP failures and on JSON-RPC errors alike; you classify by `error.code` (JSON-RPC) or by reading the HTTP status off the underlying response. The pattern below works for both layers. ```typescript import { Client } from "@modelcontextprotocol/sdk/client/index.js"; import { StreamableHTTPClientTransport } from "@modelcontextprotocol/sdk/client/streamableHttp.js"; const client = new Client({ name: "my-agent", version: "1.0.0" }); const transport = new StreamableHTTPClientTransport(new URL("http://127.0.0.1:6781/mcp"), { requestInit: { headers: { Authorization: "Bearer dev" } }, }); await client.connect(transport); async function callWithRetry( name: string, args: Record, maxRetries = 3 ): Promise { for (let attempt = 0; attempt <= maxRetries; attempt++) { try { const result = await client.callTool({ name, arguments: args }); if (result.isError) { // Tool-level failure — payload is in result.content. throw new Error(`Tool ${name} returned isError: ${JSON.stringify(result.content)}`); } return result.structuredContent as T; } catch (error) { if (attempt === maxRetries) throw error; // JSON-RPC error: error.code is a -32xxx number. const code = (error as { code?: number }).code; // Permanent — surface immediately. if (code === -32001 || code === -32003) throw error; // auth / app config if (code === -32600 || code === -32601 || code === -32602) throw error; // bad request // Render expired — caller replays at the render layer (see below). if (code === -32002) throw error; // Rate-limited (HTTP 429) — honor Retry-After if the SDK surfaces it. const retryAfter = (error as { retryAfter?: number }).retryAfter; if (retryAfter != null) { await sleep(retryAfter * 1000); continue; } // Transient (5xx, network, -32603, -32004) — exponential backoff. await sleep(Math.min(1000 * 2 ** attempt, 10_000)); } } throw new Error("unreachable"); } const sleep = (ms: number) => new Promise((r) => setTimeout(r, ms)); ``` ### Recover from an expired render (`-32002`) [Section titled “Recover from an expired render (-32002)”](#recover-from-an-expired-render--32002) `-32002 Session Not Found` is the protocol-level canonical code for “the session is gone”. On the OSS server, though, expiry never surfaces as a numeric JSON-RPC error from the `ggui_*` tools — the call succeeds at the JSON-RPC layer and returns `isError: true` with the failure detail in the result content: * **`handshake_not_found`** (from `ggui_render`) — handshake records are single-use and TTL’d (10 minutes); the supplied `handshakeId` was unknown, already consumed, or expired. (A render *consumes* its handshake — handshakes aren’t bound to renders.) * **`session_not_found`** (from `ggui_consume` / `ggui_update` / `ggui_emit` / `ggui_get_session`) — the `sessionId` was never minted, expired via TTL, or belongs to another app. Recovery is the same in every case: re-run `ggui_handshake` → `ggui_render`, which mints a fresh `sessionId`. (Other `isError` results from `ggui_render` — contract violations, schema mismatches — leave the handshake **alive**: fix the arguments and retry on the *same* `handshakeId`.) ```typescript function errorText(result: { isError?: boolean; content?: unknown }): string { const blocks = (result.content ?? []) as Array<{ type: string; text?: string }>; return blocks .filter((b) => b.type === "text") .map((b) => b.text) .join(" "); } async function resilientRender(intent: string, contract: object, props: Record) { const mintHandshake = async () => { const hs = await client.callTool({ name: "ggui_handshake", arguments: { intent, blueprintDraft: { contract } }, }); return (hs.structuredContent as { handshakeId: string }).handshakeId; }; // Negotiate, then render. `ggui_render` takes { handshakeId, props } — // accept the suggestion as-is (no `override`). let result = await client.callTool({ name: "ggui_render", arguments: { handshakeId: await mintHandshake(), props }, }); // Expired / already-consumed handshake → mint a fresh // handshake → render pair (a fresh sessionId comes with it). if (result.isError && /handshakeId .* not found/i.test(errorText(result))) { result = await client.callTool({ name: "ggui_render", arguments: { handshakeId: await mintHandshake(), props }, }); } return result; } ``` ## Layer 3 — typed failures on the live channel [Section titled “Layer 3 — typed failures on the live channel”](#layer-3--typed-failures-on-the-live-channel) The transport- and JSON-RPC-level errors above fire when the **request** fails. A separate layer surfaces **after** a successful render: when the renderer dispatches an action that fails validation (undeclared action name, payload rejected by the declared `actionSpec[name].schema`), the server answers with a typed `error` frame carrying `code: 'CONTRACT_VIOLATION'` — and nothing lands on the consume buffer. These ride the live-channel WebSocket alongside the renderer — not the agent-side MCP poll — so there is no JSON-RPC error to catch on the agent. The renderer observes them and surfaces an error activity row. Earlier protocol drafts reserved a `_ggui:contract-error` channel carrying a `ContractErrorPayload` envelope, but it never gained a first-party emitter and was removed in `draft-2026-06-11` — the channel, payload shape, validator, and code union are all deleted. Contract failures now surface on the call that caused them: inbound action violations answer with a `CONTRACT_VIOLATION` (`-32020`) `error` frame on the live channel, and nothing reaches the consume buffer; `ggui_render` / `ggui_emit` validation failures reject the agent’s own tool call; push-time schema mismatches reject with `SCHEMA_MISMATCH_ERROR`. The reserved `_ggui:` namespace itself survives — the only first-party reserved channels today are `_ggui:lifecycle` and `_ggui:preview` — and your `streamSpec` MUST NOT declare reserved names. ### Translate errors to user-facing messages [Section titled “Translate errors to user-facing messages”](#translate-errors-to-user-facing-messages) Keep user-visible copy at one layer; never leak stack traces or JSON-RPC codes to end users. ```typescript function getUserMessage(error: unknown): string { const code = (error as { code?: number }).code; const status = (error as { status?: number }).status; if (status === 401 || code === -32001) return "Authentication failed — contact support."; if (status === 429) return "Too many requests — please slow down."; if (code === -32002) return "Your session expired."; if (code === -32004) return "We couldn't generate that UI — try a simpler request."; if (status && status >= 500) return "The server is temporarily unavailable."; return "Something went wrong."; } ``` *** ## Renderer-side: faults stay inside the iframe [Section titled “Renderer-side: faults stay inside the iframe”](#renderer-side-faults-stay-inside-the-iframe) The canonical web host mounts each render in a **sandboxed iframe** via `` (from `@mcp-ui/client`, driven by `useMcpAppsChat` — see [React SDK](/sdk/react/)). That sandbox is the fault boundary: an error thrown by LLM-generated component code is contained to its own iframe and **cannot crash your host React tree**. You don’t wrap renders in a host-side error boundary — the origin isolation does that structurally. Two host-observable failure surfaces matter. ### Transport faults — `` [Section titled “Transport faults — \”](#transport-faults--apprenderer-onerror) ``’s own `onError` fires for iframe/transport-level failures (the sandbox bundle failed to load, the runtime failed to boot). It receives a plain `Error`. Log it or show a placeholder in the frame: ```tsx  console.warn("[render] AppRenderer error", err)} /> ``` (This is exactly what the [`ggui-basic-web`](https://github.com/ggui-ai/ggui/tree/main/samples/apps/ggui-basic-web) sample does.) ### Structured failures — the `ggui:observe` channel [Section titled “Structured failures — the ggui:observe channel”](#structured-failures--the-gguiobserve-channel) After a successful mount, ggui’s iframe-runtime emits a typed `ObservabilityEvent` to the parent on a dedicated postMessage channel — `{ type: "ggui:observe", event }`. This is where runtime health signals surface, so a host can show an activity row without parsing wire frames. The union is **extensibly-closed** — match the kinds you know, treat the unknown tail as generic: | `event.kind` | When | | ------------------------- | --------------------------------------------------------------------------------------------------------------------------- | | `subscribe-failed` | A non-fatal subscribe failure the reconnect ladder is handling. | | `schema-version-mismatch` | The protocol-version handshake rejected the connection. | | `auth-required` | A tool the agent calls needs OAuth consent — carries `provider` + `authUrl`. See [Auth-Gated UI](/cookbook/auth-gated-ui/). | The `ObservabilityEvent` union + its member types are re-exported from `@ggui-ai/react` (`import type { ObservabilityEvent } from "@ggui-ai/react"`). The `{ type: "ggui:observe", event }` envelope is specified in [Bootstrap handshake](/protocol/bootstrap-handshake/). *** ## See also [Section titled “See also”](#see-also) * [`/api/mcp-protocol/#error-codes`](/api/mcp-protocol/#error-codes) — full JSON-RPC error-code table, plus `CONTRACT_VIOLATION` (`-32020`) * [`/api/rate-limits/`](/api/rate-limits/) — HTTP `429` response shape, `Retry-After` semantics, raw-HTTP backoff recipe * [`@ggui-ai/react` SDK reference](/sdk/react/) — `` + `useMcpAppsChat`, the web render host * [Bootstrap handshake](/protocol/bootstrap-handshake/) — the `ggui:observe` channel + `ObservabilityEvent` catalog * [Troubleshooting](/troubleshooting/) — common errors and their root causes

# Feedback Form

> Configure an agent to render a feedback form via ggui's MCP server, then read the typed result — the smallest end-to-end ggui pattern.

The smallest useful ggui interaction: an agent renders a form, the user submits, the agent reads the typed data. Following the **Zero Agent Code** principle, you don’t hand-call `handshake` / `render` / `consume` — you configure the agent host with ggui’s MCP server and the LLM autonomously drives the loop from a user prompt. Two flavors below — the canonical LLM-driven shape via the Claude Agent SDK, and a manual orchestration variant using `@modelcontextprotocol/sdk` when you need imperative control. ## Canonical: Claude Agent SDK + ggui MCP [Section titled “Canonical: Claude Agent SDK + ggui MCP”](#canonical-claude-agent-sdk--ggui-mcp) The developer’s job: define the typed contract, configure the host, write a user-facing prompt. The LLM autonomously calls `ggui_handshake` → `ggui_render` → `ggui_consume` and surfaces the typed payload. ```typescript import { query } from "@anthropic-ai/claude-agent-sdk"; import { defineContract } from "@ggui-ai/protocol"; // Typed contract — `actionData` for `submit` narrows to {rating, comments}. // The contract still lives in code: it's how you document the shape // the agent should negotiate during `ggui_handshake`. // NOTE: the intent string is NOT a contract field — it travels separately // as the flat `intent` argument of `ggui_handshake` (see the manual // orchestration variant below). const feedbackContract = defineContract({ propsSpec: { properties: { userName: { schema: { type: "string" } }, product: { schema: { type: "string" } }, }, }, actionSpec: { submit: { label: "Submit feedback", schema: { type: "object", properties: { rating: { type: "number", minimum: 1, maximum: 5 }, comments: { type: "string" }, }, required: ["rating", "comments"], }, }, }, } as const); async function collectFeedback(userName: string, product: string) { const result = query({ prompt: `Collect product feedback from ${userName} about ${product}. Render a feedback form with a 1–5 star rating, a comments text area, and a submit button. Greet the user by name. Wait for their submission, then report the rating and comments back to me as JSON: {"rating": number, "comments": string}.`, options: { mcpServers: { ggui: { type: "http", url: "http://127.0.0.1:6781/mcp", // ggui serve --dev-allow-all headers: { Authorization: "Bearer dev", }, }, }, allowedTools: [ "mcp__ggui__ggui_handshake", "mcp__ggui__ggui_render", "mcp__ggui__ggui_consume", ], }, }); // The LLM drives ggui_handshake → ggui_render → ggui_consume on its own. // The render surfaces as an MCP-Apps resource the host mounts; // the typed payload comes back in the final message. for await (const message of result) { if (message.type === "assistant") { console.log(message.message.content); } } } ``` For a complete runnable example (including streaming partial events, multi-turn refinement, and the host’s MCP wiring), see [Examples → Claude Agent](/examples/claude-agent/). ## Manual orchestration: `@modelcontextprotocol/sdk` [Section titled “Manual orchestration: @modelcontextprotocol/sdk”](#manual-orchestration-modelcontextprotocolsdk) Use this shape when you need imperative control — e.g. you’re building a non-LLM workflow, testing the protocol directly, or reacting to every action the user fires (not just the terminal submit). This calls ggui’s MCP tools directly via the official MCP SDK; no LLM in the loop. ```typescript import { Client } from "@modelcontextprotocol/sdk/client/index.js"; import { StreamableHTTPClientTransport } from "@modelcontextprotocol/sdk/client/streamableHttp.js"; const transport = new StreamableHTTPClientTransport(new URL("http://127.0.0.1:6781/mcp"), { requestInit: { headers: { Authorization: "Bearer dev" }, }, }); const client = new Client({ name: "feedback-script", version: "1.0.0" }); await client.connect(transport); async function collectFeedbackLive(userName: string, product: string) { // 1. Handshake — negotiate the contract before rendering. // Pre-launch, `ggui_render` is handshake-first only. const hsResp = await client.callTool({ name: "ggui_handshake", arguments: { intent: "Collect product feedback (live)", blueprintDraft: { contract: feedbackContract, variance: { seedPrompt: `Show ${userName} a product feedback form for ${product}.`, }, }, }, }); const { handshakeId } = JSON.parse((hsResp.content[0] as { type: "text"; text: string }).text); // 2. Render — accept the handshake's suggestion (omit `override`). // `ggui_render` mints the `sessionId`. const renderResp = await client.callTool({ name: "ggui_render", arguments: { handshakeId, props: { userName, product }, }, }); const { sessionId, resourceUri } = JSON.parse( (renderResp.content[0] as { type: "text"; text: string }).text ); console.log(`Render ${sessionId} ready (${resourceUri}) — a host mounts it.`); // 3. Consume — long-poll until the user submits. while (true) { const consumeResp = await client.callTool({ name: "ggui_consume", arguments: { sessionId, timeout: 25 }, }); const { events, status } = JSON.parse( (consumeResp.content[0] as { type: "text"; text: string }).text ); for (const entry of events) { // Every consume entry has `type: 'action'`; the contract's // `actionSpec` key fires on `entry.intent`. if (entry.intent === "submit") { console.log("Submitted:", entry.actionData); return entry.actionData; } console.log(`Action ${entry.intent}:`, entry.actionData); } if (status !== "active") break; // 'expired' — render TTL elapsed } } ``` No `sleep()` between polls Pass a `timeout` to `ggui_consume` (recommended: `15` or `25` seconds for chat agents — `25` is the maximum; the server rejects anything above 25 with `INVALID_PARAMS`) so the server long-polls. A busy `while` loop without a timeout — or with a hand-rolled `setTimeout` between calls — burns budget and adds latency. ## What the user sees [Section titled “What the user sees”](#what-the-user-sees) The agent renders the form into whatever MCP-Apps host the user is in — inline in claude.ai / Claude Desktop, or in your own app via ``. The user fills it in and submits; the agent reads the typed result off `ggui_consume`: ```plaintext Rating: 4/5 — Great product, would love dark-mode support! ``` There is no URL to hand the user — the render is an MCP-Apps resource the host mounts. (For local development without an MCP-Apps host, the operator console bundled with `ggui serve` can display the render for debugging.) ## Related [Section titled “Related”](#related) * [Examples → Claude Agent](/examples/claude-agent/) — full runnable Claude Agent SDK + ggui MCP example * [MCP protocol reference](/api/mcp-protocol/) — wire-level `ggui_handshake` / `ggui_render` / `ggui_consume` shapes * [Event system](/architecture/event-system/) — `actionSpec`-driven flow and the `ConsumeEventEntry` shape * [Multi-step wizard](/cookbook/multi-step-wizard/) — chain forms across successive renders * [Glossary](/glossary/) — `render`, `contract`, `envelope`, `blueprint`

# Multi-Step Wizard

> Let the LLM sequence a back-navigable wizard by minting a fresh render per step and prefilling on back-navigation.

Wizards in ggui mint a **fresh GguiSession per step** (each keyed by its own `sessionId`). Each step is its own `handshake → render → consume` round-trip; back-navigation re-renders the prior step with the prior `actionData` plumbed in as `props.prefill`. ## Pattern [Section titled “Pattern”](#pattern) 1. The host opens an MCP session against the ggui server and exposes the `ggui_*` tools to the LLM. No wrapper SDK in the agent process. 2. You describe the wizard once in the user prompt and define one typed `defineContract` per step. 3. The LLM autonomously sequences: `ggui_handshake` → `ggui_render` → `ggui_consume` → react on `intent` → next step’s `ggui_handshake` → `ggui_render` → … . 4. If a consume entry’s `intent` is `"back"`, the LLM re-handshakes the prior step with `props.prefill` carrying the prior `actionData` so values are restored. ## LLM-driven flow (canonical — Claude Agent SDK) [Section titled “LLM-driven flow (canonical — Claude Agent SDK)”](#llm-driven-flow-canonical--claude-agent-sdk) Configure a Claude Agent SDK host with the ggui MCP server, allow the `ggui_*` tools, and ship a wizard-shaped prompt. The model does the rest. ```typescript import { query } from "@anthropic-ai/claude-agent-sdk"; import { GGUI_AGENT_SYSTEM_PROMPT, defineContract } from "@ggui-ai/protocol"; // One contract per step. `defineContract({...} as const)` infers TS payload // types from the JSON Schemas — no parallel interface definitions. // A DataContract declares data flow only (propsSpec / actionSpec / // streamSpec / contextSpec) — there are no `name` or `layout` fields. // The step header ("Step 1 of 3 — Personal Information") travels as the // handshake `intent` (and, for finer aim, `blueprintDraft.variance.seedPrompt`). const personalInfoContract = defineContract({ actionSpec: { submit: { label: "Next", schema: { type: "object", required: ["fullName", "email", "phone"], properties: { fullName: { type: "string" }, email: { type: "string", format: "email" }, phone: { type: "string" }, }, }, }, }, } as const); // ... companyInfoContract + reviewContract defined similarly (also expose a // "back" actionSpec entry on step 2 and 3). for await (const event of query({ prompt: `Walk me through a 3-step onboarding wizard. Step 1 (Personal Info): use this contract verbatim ${JSON.stringify(personalInfoContract)} Step 2 (Company Info, with Back): use this contract verbatim ${JSON.stringify(companyInfoContract)} Step 3 (Review & Confirm): use this contract verbatim ${JSON.stringify(reviewContract)} Render step 3 with props.summary = {personal, company} so the summary can render. Navigation rules: - After each ggui_render, call ggui_consume and inspect events[].intent. - intent === "submit": advance to the next step's handshake + render, threading prior data via the render's props. - intent === "back": re-handshake the prior step's contract with props.prefill = priorActionData so values are restored. - After step 3 submit, simply stop — renders decay implicitly via TTL.`, options: { model: "claude-haiku-4-5", systemPrompt: GGUI_AGENT_SYSTEM_PROMPT, mcpServers: { ggui: { type: "http", url: process.env.GGUI_MCP_URL!, // e.g. http://127.0.0.1:6781/mcp headers: { Authorization: `Bearer ${process.env.GGUI_API_KEY!}` }, }, }, allowedTools: [ "mcp__ggui__ggui_handshake", "mcp__ggui__ggui_render", "mcp__ggui__ggui_consume", ], tools: [], strictMcpConfig: true, }, })) { // Inspect events for logging; the LLM drives each render autonomously. if (event.type === "assistant") console.log(event.message); } ``` ## Manual orchestration (raw MCP, no Claude in the loop) [Section titled “Manual orchestration (raw MCP, no Claude in the loop)”](#manual-orchestration-raw-mcp-no-claude-in-the-loop) When you need explicit control over each step — testing, deterministic playback, non-LLM driver — open a raw MCP session and call `ggui_*` tools yourself. ```typescript import { Client } from "@modelcontextprotocol/sdk/client/index.js"; import { StreamableHTTPClientTransport } from "@modelcontextprotocol/sdk/client/streamableHttp.js"; import { defineContract } from "@ggui-ai/protocol"; const transport = new StreamableHTTPClientTransport(new URL(process.env.GGUI_MCP_URL!), { requestInit: { headers: { Authorization: `Bearer ${process.env.GGUI_API_KEY!}` } }, }); const mcp = new Client({ name: "wizard-driver", version: "1.0.0" }); await mcp.connect(transport); async function call(name: string, args: unknown): Promise { const res = await mcp.callTool({ name, arguments: args as Record }); return JSON.parse((res.content[0] as { text: string }).text) as T; } async function step( contract: unknown, intent: string, prefill?: Record ): Promise<{ intent: TIntent; actionData: unknown; sessionId: string }> { // ggui_handshake({ intent, blueprintDraft }) → { handshakeId, action, suggestion } const hs = await call<{ handshakeId: string }>("ggui_handshake", { intent, blueprintDraft: { contract }, }); // ggui_render({ handshakeId, props }) — props is REQUIRED (pass {} when none). const { sessionId } = await call<{ sessionId: string }>("ggui_render", { handshakeId: hs.handshakeId, props: prefill ? { prefill } : {}, }); // Long-poll ggui_consume until the user submits or goes back. for (;;) { const { events, status } = await call<{ events: Array<{ intent: string; actionData: unknown }>; status: "active" | "expired"; }>("ggui_consume", { sessionId, timeout: 25 }); // `intent` is the actionSpec key the iframe dispatched against — "back" // is its own entry, not a sub-field of "submit". const entry = events.find((e) => e.intent === "back" || e.intent === "submit"); if (entry) return { ...(entry as { intent: TIntent; actionData: unknown }), sessionId }; if (status === "expired") throw new Error("render expired before a terminal action"); } } // Step 1 const personal = await step<"submit">( personalInfoContract, "Onboarding step 1 of 3 — personal info" ); // Step 2 with back handling let company: { intent: "submit"; actionData: unknown } | null = null; while (!company) { const r = await step<"submit" | "back">( companyInfoContract, "Onboarding step 2 of 3 — company info" ); if (r.intent === "back") { // Re-render step 1 with the prior actionData as prefill — a fresh sessionId // is minted; the prior render decays via TTL on its own. await step<"submit">( personalInfoContract, "Onboarding step 1 of 3 — personal info (revisit)", personal.actionData as Record ); continue; } company = r as { intent: "submit"; actionData: unknown }; } // No explicit close — the render decays implicitly via TTL after step 3 submit. ``` ## Step lineage [Section titled “Step lineage”](#step-lineage) ```plaintext Step 1 render: sessionId=r_p1 (personal info) Step 2 render: sessionId=r_c1 (company info) ← user sees Company User clicks Back: -- no pop -- (r_c1 decays via TTL) Step 1 revisit: sessionId=r_p2 (prefilled with prior personal data) Step 2 revisit: sessionId=r_c2 Step 3: sessionId=r_r1 (review & confirm) ``` Each step is an independent render with its own `sessionId`. There is no stack — renders are flat. Prior renders decay via TTL after the agent moves on. ## Inspecting a session’s state [Section titled “Inspecting a session’s state”](#inspecting-a-sessions-state) `ggui_get_session` returns the full `GguiSession` (id, appId, status, eventSequence, timestamps, plus the spec fields — propsSpec/actionSpec/contextSpec/streamSpec) — useful for debugging or for a driver that needs to know whether a session is still active. ```typescript const state = await call<{ id: string; status: string }>("ggui_get_session", { sessionId: company!.sessionId, }); console.log(`Render status: ${state.status}`); ``` ## See also [Section titled “See also”](#see-also) * [Claude Agent example](/examples/claude-agent/) — end-to-end LLM-driven loop wired to a ggui MCP server * [MCP protocol reference](/api/mcp-protocol/) — full `ggui_*` tool signatures * [GguiSession](/glossary/#gguisession-a-render) — the unit minted by each step * [Feedback Form](/cookbook/feedback-form/) — single-step variant of the same pattern

# Real-Time Dashboard

> Stream live data into a generated UI by declaring a streamSpec channel on the contract and pushing updates with ggui_emit. The iframe owns the live channel; your host app just mounts the render.

Live data in ggui is a **contract concern**, not a host-app concern. You don’t open a WebSocket in your React app and feed events into the render. Instead: 1. The agent declares a **`streamSpec`** channel on the contract — a typed, named live channel. 2. The agent renders the contract once, then pushes updates with **`ggui_emit`**. 3. The **generated component inside the iframe** subscribes to that channel and repaints as deliveries arrive. The iframe-runtime owns the channel transport (it picks WebSocket or polling per channel). Your host app does nothing dashboard-specific: it mounts the render with `` (via `useMcpAppsChat`) like any other ggui UI. The live updates flow **inside** the sandboxed iframe — they never pass through your host code. ## 1. Declare a `streamSpec` channel [Section titled “1. Declare a streamSpec channel”](#1-declare-a-streamspec-channel) A `streamSpec` is a flat `Record` on the contract. Each channel declares a JSON Schema its payloads must satisfy, plus optional accumulation + replay behavior: ```typescript const contract = { // What the component shows initially (validated like any props). propsSpec: { properties: { title: { schema: { type: "string" }, required: true }, }, }, // Live channels. Payloads on `metrics` must match its schema. streamSpec: { metrics: { description: "Live throughput + error-rate samples for the dashboard.", schema: { type: "object", properties: { ts: { type: "string" }, rps: { type: "number" }, errorRate: { type: "number" }, }, required: ["ts", "rps"], }, mode: "append", // accumulate each delivery into a series replay: "all", // a late-subscribing iframe gets the full backlog complete: true, // allow a terminal ggui_emit({ complete: true }) }, }, }; ``` | Channel field | Meaning | | ------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------- | | `schema` | **Required.** JSON Schema every payload on this channel must satisfy. `ggui_emit` validates against it. | | `mode` | `"append"` accumulates deliveries into a series (a feed / chart); `"replace"` keeps only the latest (a single live value). Optional (default `"append"`). | | `replay` | What a freshly-mounted iframe receives: `"all"` (full backlog), `"latest"` (last delivery only), `"none"`. Optional (default `"none"`). | | `complete` | Declare `complete: true` to allow a terminal `ggui_emit({ complete: true })` that closes the channel; undeclared channels reject it. Optional. | | `description` | Natural-language hint that steers how the generator wires the component to the channel. Optional but recommended. | | `source` | `{ tool, args? }` — declare a **pull** channel the runtime polls instead of you pushing. See [Pull channels](#pull-channels-source). | The example’s `replay: "all"` and `complete: true` are both explicit opt-ins on top of the defaults. ## 2. Render, then emit [Section titled “2. Render, then emit”](#2-render-then-emit) Negotiate + materialize the contract through the normal `ggui_handshake` → `ggui_render` flow, then push deliveries with `ggui_emit`. Payloads validate against `streamSpec[channel].schema`; a final `complete: true` closes the channel (allowed because the declaration above opted in with `complete: true`). ```typescript import { Client } from "@modelcontextprotocol/sdk/client/index.js"; import { StreamableHTTPClientTransport } from "@modelcontextprotocol/sdk/client/streamableHttp.js"; const client = new Client({ name: "dashboard-agent", version: "1.0.0" }); await client.connect( new StreamableHTTPClientTransport(new URL("http://127.0.0.1:6781/mcp"), { requestInit: { headers: { Authorization: "Bearer dev" } }, }) ); // Negotiate a blueprint for the dashboard contract. const handshake = await client.callTool({ name: "ggui_handshake", arguments: { intent: "Live ops dashboard — throughput + error rate", blueprintDraft: { contract }, }, }); const { handshakeId } = handshake.structuredContent as { handshakeId: string }; // Materialize it. `props` is REQUIRED — pass the initial values. const render = await client.callTool({ name: "ggui_render", arguments: { handshakeId, props: { title: "Ops — us-east-1" } }, }); const { sessionId } = render.structuredContent as { sessionId: string }; // Stubs — swap in your real metric source. const sleep = (ms: number) => new Promise((r) => setTimeout(r, ms)); const sample = () => 100 + Math.random() * 50; const sampleErr = () => Math.random() * 0.05; // Push live deliveries on the declared channel. for (let i = 0; i < 100; i++) { await client.callTool({ name: "ggui_emit", arguments: { sessionId, channel: "metrics", payload: { ts: new Date().toISOString(), rps: sample(), errorRate: sampleErr() }, }, }); await sleep(1000); } // Close the channel — subsequent emits on it reject. await client.callTool({ name: "ggui_emit", arguments: { sessionId, channel: "metrics", payload: {}, complete: true }, }); ``` ## 3. Host side: just mount the render [Section titled “3. Host side: just mount the render”](#3-host-side-just-mount-the-render) There is no dashboard-specific host code. The render arrives as an MCP-Apps resource on the agent’s tool result; your web app drives the conversation with `useMcpAppsChat` and mounts each render with ``. The generated component subscribes to the `metrics` channel itself, and the iframe-runtime delivers every `ggui_emit` into it — your host never sees the channel frames. ```tsx import { AppRenderer } from "@mcp-ui/client"; import { useMcpAppsChat } from "@ggui-ai/react/chat-helpers"; function Dashboard({ agentUrl }: { agentUrl: string }) { const { sessions, handleAppMessage } = useMcpAppsChat({ chatEndpoint: `${agentUrl}/agent`, }); // Mount the latest render with ; live `metrics` deliveries // repaint the chart INSIDE the iframe. See the React SDK page for the // full sandbox + relay wiring. } ``` See [React SDK](/sdk/react/) for the complete `` host contract (sandbox-proxy origin + `onReadResource` / `onCallTool` relay + `onMessage`), and the [`ggui-basic-web`](https://github.com/ggui-ai/ggui/tree/main/samples/apps/ggui-basic-web) sample for a runnable host. ## Pull channels (`source`) [Section titled “Pull channels (source)”](#pull-channels-source) The example above is a **push** channel — the agent actively emits. For data the runtime should fetch on its own cadence, declare a `source` instead and skip `ggui_emit` entirely: ```typescript streamSpec: { orders: { description: "Open orders, refreshed from the orders tool.", schema: { type: "object", properties: { id: { type: "string" }, total: { type: "number" } } }, source: { tool: "list_open_orders", args: { region: "us-east-1" } }, }, } ``` `source.tool` MUST resolve to a tool the contract declares in its `agentCapabilities.tools` (a cross-reference the contract linter enforces). The iframe-runtime polls that tool and feeds results into the channel; the agent declares the channel and renders, but never pushes. ## See also [Section titled “See also”](#see-also) * [React SDK](/sdk/react/) — `` + `useMcpAppsChat`, the host render-hosting surface * [MCP protocol reference](/api/mcp-protocol/) — `ggui_handshake`, `ggui_render`, `ggui_emit` request/response shapes * [Data vs Behavior](https://github.com/ggui-ai/ggui/blob/main/docs/principles/data-vs-behavior.md) — why live data is a contract field and rendering behavior is component code * [`ggui-basic-web` sample](https://github.com/ggui-ai/ggui/tree/main/samples/apps/ggui-basic-web) — a runnable MCP-Apps host

# Testing ggui Integrations

> Mock the MCP transport for agent-side unit tests and assert the tool calls your agent makes, without a real LLM round-trip.

ggui’s agent surface has a dedicated mock layer so you never need a live `mcp.ggui.ai` (or self-hosted `ggui serve`) connection in unit tests: * **Agent side** — mock the MCP transport (not a wrapper SDK) and assert the tool calls your agent makes. Integration tests against a live endpoint belong in a separate tier — see [Self-Hosted Reference Deploys](/self-hosted/reference-deploys/) for spinning up a throwaway stack. ## Agent side: mock the MCP transport [Section titled “Agent side: mock the MCP transport”](#agent-side-mock-the-mcp-transport) ggui is consumed over standard MCP, so the right place to draw the test boundary is at the **MCP transport**, not at a wrapper SDK. Stub the tool-call responses your agent expects and assert against the call log — your agent code stays unmodified between test and production. ### With `@modelcontextprotocol/sdk` — stub `Client.callTool` [Section titled “With @modelcontextprotocol/sdk — stub Client.callTool”](#with-modelcontextprotocolsdk--stub-clientcalltool) If your agent talks to ggui through the canonical MCP `Client`, stub `callTool` and seed the responses keyed by tool name. A unit-test fixture only needs the fields your agent actually reads, so each builder returns a `Pick<>` of the real protocol type (`GguiHandshakeOutput`, `GguiRenderOutput`, `GguiConsumeOutput` from `@ggui-ai/protocol`) — keeping the field names and shapes wire-faithful without hand-rolling every required field of the live envelope. ```typescript import { describe, it, expect, beforeEach, vi } from "vitest"; import { Client } from "@modelcontextprotocol/sdk/client/index.js"; import type { GguiHandshakeOutput, GguiRenderOutput, GguiConsumeOutput, ConsumeEventEntry, } from "@ggui-ai/protocol"; function makeMockMcpClient() { const callLog: Array<{ name: string; arguments: unknown }> = []; const responses = new Map(); let lastSessionId: string | null = null; const renderEvents = new Map(); const renderStatus = new Map(); responses.set( "ggui_handshake", (): Pick => ({ handshakeId: `h_${Date.now()}`, action: "create", }) ); responses.set( "ggui_render", (): Pick => { const sessionId = `rnd_${Date.now()}`; lastSessionId = sessionId; renderEvents.set(sessionId, []); renderStatus.set(sessionId, "active"); return { sessionId, // Spec-canonical MCP-Apps entry point. There is NO clickable // `url` on the wire — the host mounts the `ui://ggui/render/{id}` // iframe resource. (A dead `url` had the model hallucinating // links that resolve nowhere, so it was removed.) resourceUri: `ui://ggui/render/${sessionId}`, action: "create", }; } ); responses.set("ggui_consume", (args: { sessionId: string }): GguiConsumeOutput => { const events = renderEvents.get(args.sessionId) ?? []; renderEvents.set(args.sessionId, []); return { events, status: renderStatus.get(args.sessionId) ?? "active", }; }); const client = { callTool: vi.fn(async ({ name, arguments: args }) => { callLog.push({ name, arguments: args }); const handler = responses.get(name); if (!handler) { // JSON-RPC -32601: Method not found — matches the live server. throw new Error(`MCP tool not found: ${name}`); } const result = typeof handler === "function" ? handler(args) : handler; // MCP wraps tool output in { content: [...], structuredContent: ... }. return { content: [{ type: "text", text: JSON.stringify(result) }], structuredContent: result, }; }), } as unknown as Client; return { client, callLog, // Simulate a user gesture appearing on the consume pipe. simulateSubmit(sessionId: string, data: ConsumeEventEntry["actionData"]) { const events = renderEvents.get(sessionId) ?? []; events.push({ type: "action", sessionId, intent: "submit", actionData: data, uiContext: {}, actionId: "mockactn", firedAt: new Date().toISOString(), }); renderEvents.set(sessionId, events); }, // Status semantics match the canonical protocol: `active` = more events // may arrive; `expired` = TTL elapsed. Flip terminal state explicitly. simulateExpire(sessionId: string) { renderStatus.set(sessionId, "expired"); }, }; } ``` ### Driving the mock from a test [Section titled “Driving the mock from a test”](#driving-the-mock-from-a-test) ```typescript describe("feedback agent", () => { let mock: ReturnType; beforeEach(() => { mock = makeMockMcpClient(); }); it("collects user feedback end-to-end", async () => { const handshake = await mock.client.callTool({ name: "ggui_handshake", arguments: { intent: "Collect feedback" }, }); const { handshakeId } = handshake.structuredContent as GguiHandshakeOutput; const render = await mock.client.callTool({ name: "ggui_render", // `props` is REQUIRED on ggui_render — pass `{}` when the agreed // contract declares no propsSpec. arguments: { handshakeId, props: {} }, }); const { sessionId, resourceUri } = render.structuredContent as GguiRenderOutput; // The render's entry point is the spec-canonical MCP-Apps resource // URI the host mounts — not a clickable link. expect(resourceUri).toMatch(/^ui:\/\/ggui\/render\//); // Pretend the user filled in the form. mock.simulateSubmit(sessionId, { rating: 5, comments: "Great!" }); const consume = await mock.client.callTool({ name: "ggui_consume", arguments: { sessionId }, }); const { events, status } = consume.structuredContent as GguiConsumeOutput; expect(status).toBe("active"); expect(events[0].actionData).toEqual({ rating: 5, comments: "Great!" }); // Assert the agent issued exactly the expected tool-call sequence. expect(mock.callLog.map((c) => c.name)).toEqual([ "ggui_handshake", "ggui_render", "ggui_consume", ]); }); }); ``` `actionData`, not `payload` `ConsumeEventEntry` carries gesture data on `actionData` (the consume pipe is the agent-facing view of an `ActionEnvelope` from the live channel). `payload` is the live-channel wire field on the raw envelope. Asserting against `events[0].payload` will silently return `undefined`. ### With Claude Agent SDK — stub `mcpServers` [Section titled “With Claude Agent SDK — stub mcpServers”](#with-claude-agent-sdk--stub-mcpservers) Consumers using `@anthropic-ai/claude-agent-sdk` typically pass an `mcpServers` config to `query()`. For unit tests, supply an in-memory server entry that returns canned tool responses instead of dialling `mcp.ggui.ai`. Use the SDK’s `createSdkMcpServer` (or the SDK-specific test helper) and register tools that produce the same structured payloads shown above. The principle is identical: stub at the MCP transport surface so your agent prompt, tool-loop logic, and consume-pipe handling exercise unchanged. ### With raw HTTP — stub `fetch` [Section titled “With raw HTTP — stub fetch”](#with-raw-http--stub-fetch) If your agent talks to a self-hosted `ggui serve` over plain HTTP (`http://127.0.0.1:6781/mcp`), `vi.spyOn(global, "fetch")` (or `msw`) is the right boundary. Assert request URL + JSON-RPC method, and return the matching `structuredContent` payload. ### Asserting errors [Section titled “Asserting errors”](#asserting-errors) ggui surfaces failures at protocol level — there are no SDK-specific error classes to import. Assert against **JSON-RPC error codes** (`-32601` method-not-found, `-32602` invalid-params, etc.) or **HTTP status codes** (`401 Unauthorized`, `408 Request Timeout`, `429 Too Many Requests`), depending on your transport. ```typescript it("surfaces auth errors", async () => { const failingClient = { callTool: async () => { // Mirror the wire shape: an HTTP-401 surface from the gateway becomes a // JSON-RPC error on the MCP client. const err = new Error("Unauthorized") as Error & { code?: number }; err.code = -32000; // JSON-RPC server-error range; httpStatus 401 upstream. throw err; }, }; await expect( failingClient.callTool({ name: "ggui_handshake", arguments: {} }) ).rejects.toMatchObject({ code: -32000 }); }); ``` See [Error Handling](/cookbook/error-handling/) for retry, fallback, and graceful-degradation patterns built on these protocol-level signals. *** ## What not to test [Section titled “What not to test”](#what-not-to-test) LLM-generated component code is non-deterministic — assert **behavior**, not DOM structure. Pin contracts (via `defineContract` + `useContract`) and test your agent’s tool-call sequence. Leave the visual layer to live snapshots or a separate generation-quality tier. See also: [@ggui-ai/react](/sdk/react/) · [MCP Protocol](/api/mcp-protocol/) · [Troubleshooting](/troubleshooting/).

# Design Tokens

> Visual reference for the DTCG design tokens that ggui primitives consume — colors, spacing, typography, radius, and shadows.

Every `@ggui-ai/design` primitive — and by extension every gadget that ships with the OSS renderer — reads its visual state from [DTCG](https://design-tokens.github.io/community-group/format/) tokens exposed as CSS custom properties. Tokens themselves need no runtime: override the variables on any ancestor and the cascade does the rest on the next paint. Operator theming builds on these same variables: `ggui.json#theme` selects one of the shipped presets (or a DTCG file), and per-app overlays are validated `--ggui-*` maps injected after the base token block, so a partial overlay keeps token defaults. Agents can enumerate presets with `ggui_list_themes` and pick one per render via `ggui_render({themeId})` — see [Custom Theming](/cookbook/custom-theming/). ```css /* Every primitive uses var(--ggui-…) with a hardcoded fallback */ color: var(--ggui-color-primary-600, #0284c7); padding: var(--ggui-spacing-md, 16px); border-radius: var(--ggui-shape-radius-md, 8px); ``` Caution The swatch/scale tables below are rendered from a hand-maintained snapshot at `apps/docs/src/data/tokens.ts` (Astro can’t import the React design package directly). If a value here disagrees with `packages/design/src/themes/defaults/light.ts`, the design package is authoritative. *** ## Colors [Section titled “Colors”](#colors) ### Color Palettes [Section titled “Color Palettes”](#color-palettes) #### Primary (Sky Blue) 50 `#f0f9ff` `--ggui-color-primary-50` 100 `#e0f2fe` `--ggui-color-primary-100` 200 `#bae6fd` `--ggui-color-primary-200` 300 `#7dd3fc` `--ggui-color-primary-300` 400 `#38bdf8` `--ggui-color-primary-400` 500 `#0ea5e9` `--ggui-color-primary-500` 600 `#0284c7` `--ggui-color-primary-600` 700 `#0369a1` `--ggui-color-primary-700` 800 `#075985` `--ggui-color-primary-800` 900 `#0c4a6e` `--ggui-color-primary-900` #### Gray 50 `#f9fafb` `--ggui-color-neutral-50` 100 `#f3f4f6` `--ggui-color-neutral-100` 200 `#e5e7eb` `--ggui-color-neutral-200` 300 `#d1d5db` `--ggui-color-neutral-300` 400 `#9ca3af` `--ggui-color-neutral-400` 500 `#6b7280` `--ggui-color-neutral-500` 600 `#4b5563` `--ggui-color-neutral-600` 700 `#374151` `--ggui-color-neutral-700` 800 `#1f2937` `--ggui-color-neutral-800` 900 `#111827` `--ggui-color-neutral-900` #### Success 50 `#f0fdf4` `--ggui-color-success-50` 100 `#dcfce7` `--ggui-color-success-100` 200 `#bbf7d0` `--ggui-color-success-200` 500 `#22c55e` `--ggui-color-success-500` 600 `#16a34a` `--ggui-color-success-600` 700 `#15803d` `--ggui-color-success-700` 800 `#166534` `--ggui-color-success-800` #### Warning 50 `#fffbeb` `--ggui-color-warning-50` 100 `#fef3c7` `--ggui-color-warning-100` 200 `#fde68a` `--ggui-color-warning-200` 500 `#f59e0b` `--ggui-color-warning-500` 600 `#d97706` `--ggui-color-warning-600` 700 `#b45309` `--ggui-color-warning-700` 800 `#92400e` `--ggui-color-warning-800` #### Error 50 `#fef2f2` `--ggui-color-error-50` 100 `#fee2e2` `--ggui-color-error-100` 200 `#fecaca` `--ggui-color-error-200` 500 `#ef4444` `--ggui-color-error-500` 600 `#dc2626` `--ggui-color-error-600` 700 `#b91c1c` `--ggui-color-error-700` 800 `#991b1b` `--ggui-color-error-800` #### Info 50 `#ecfeff` `--ggui-color-info-50` 100 `#cffafe` `--ggui-color-info-100` 200 `#a5f3fc` `--ggui-color-info-200` 500 `#06b6d4` `--ggui-color-info-500` 600 `#0891b2` `--ggui-color-info-600` 700 `#0e7490` `--ggui-color-info-700` 800 `#155e75` `--ggui-color-info-800` ### Semantic Colors [Section titled “Semantic Colors”](#semantic-colors) These tokens map to specific UI roles and adapt between light and dark themes. **Surface**`#ffffff``--ggui-color-surface` **Surface Variant**`#f3f4f6``--ggui-color-surfaceVariant` **On Surface**`#111827``--ggui-color-onSurface` **On Surface Variant**`#6b7280``--ggui-color-onSurfaceVariant` **Outline**`#9ca3af``--ggui-color-outline` **Outline Variant**`#d1d5db``--ggui-color-outlineVariant` **Container**`#f9fafb``--ggui-color-container` **On Container**`#111827``--ggui-color-onContainer` ### Material Role Pairs [Section titled “Material Role Pairs”](#material-role-pairs) The canonical theme adds eight Material 3-inspired role pairs for surface/content layering. They sit alongside (and extend) the legacy text tokens — primitives use these to keep contrast right across nested surfaces. | CSS variable | Role | | ------------------------------- | ----------------------------------------------- | | `--ggui-color-surface` | Default page / sheet background | | `--ggui-color-onSurface` | Primary text and icons on `surface` | | `--ggui-color-surfaceVariant` | Subtle alternate background (cards, rails) | | `--ggui-color-onSurfaceVariant` | Secondary text/icons on `surfaceVariant` | | `--ggui-color-container` | Filled container (chips, banners, soft buttons) | | `--ggui-color-onContainer` | Text/icons on `container` | | `--ggui-color-outline` | Standard borders + dividers | | `--ggui-color-outlineVariant` | Faint borders, disabled outlines | *** ## Spacing [Section titled “Spacing”](#spacing) A consistent spacing scale ensures visual rhythm across all components. `xs` 4px `--ggui-spacing-xs` `sm` 8px `--ggui-spacing-sm` `md` 16px `--ggui-spacing-md` `lg` 24px `--ggui-spacing-lg` `xl` 32px `--ggui-spacing-xl` `2xl` 48px `--ggui-spacing-2xl` `3xl` 64px `--ggui-spacing-3xl` *** ## Typography [Section titled “Typography”](#typography) Canonical theme path: `font.{family,size,weight,lineHeight}.*`. Emitted as `--ggui-font-family-*`, `--ggui-font-size-*`, `--ggui-font-weight-*`, `--ggui-font-lineHeight-*`. #### Font Families The quick brown fox jumps over the lazy dog `Sans` `--ggui-typography-fontFamily-sans` The quick brown fox jumps over the lazy dog `Mono` `--ggui-typography-fontFamily-mono` #### Font Sizes `xs` 12px The quick brown fox `--ggui-typography-fontSize-xs` `sm` 14px The quick brown fox `--ggui-typography-fontSize-sm` `base` 16px The quick brown fox `--ggui-typography-fontSize-base` `lg` 18px The quick brown fox `--ggui-typography-fontSize-lg` `xl` 20px The quick brown fox `--ggui-typography-fontSize-xl` `2xl` 24px The quick brown fox `--ggui-typography-fontSize-2xl` `3xl` 30px The quick brown fox `--ggui-typography-fontSize-3xl` `4xl` 36px The quick brown fox `--ggui-typography-fontSize-4xl` #### Font Weights `Normal` The quick brown fox jumps over the lazy dog 400 `Medium` The quick brown fox jumps over the lazy dog 500 `Semibold` The quick brown fox jumps over the lazy dog 600 `Bold` The quick brown fox jumps over the lazy dog 700 #### Line Heights `Tight` Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. 1.25 `Normal` Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. 1.5 `Relaxed` Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. 1.75 *** ## Border Radius [Section titled “Border Radius”](#border-radius) Canonical theme path: `shape.radius.*`. Emitted as `--ggui-shape-radius-{none,sm,md,lg,xl,2xl,full}`. `none`0 `--ggui-shape-radius-none` `sm`4px `--ggui-shape-radius-sm` `md`8px `--ggui-shape-radius-md` `lg`12px `--ggui-shape-radius-lg` `xl`16px `--ggui-shape-radius-xl` `2xl`24px `--ggui-shape-radius-2xl` `full`9999px `--ggui-shape-radius-full` *** ## Shadows [Section titled “Shadows”](#shadows) Canonical theme path: `shape.shadow.*`. Emitted as `--ggui-shape-shadow-{none,xs,sm,md,lg,xl,2xl}`. `none` `0 0 0 0 transparent` `--ggui-shape-shadow-none` `xs` `0 1px 2px 0 rgba(15, 23, 42, 0.04)` `--ggui-shape-shadow-xs` `sm` `0 1px 3px 0 rgba(15, 23, 42, 0.06)` `--ggui-shape-shadow-sm` `md` `0 8px 16px -4px rgba(15, 23, 42, 0.10)` `--ggui-shape-shadow-md` `lg` `0 16px 32px -8px rgba(15, 23, 42, 0.14)` `--ggui-shape-shadow-lg` `xl` `0 24px 48px -12px rgba(15, 23, 42, 0.18)` `--ggui-shape-shadow-xl` `2xl` `0 25px 50px -12px rgba(0, 0, 0, 0.25)` `--ggui-shape-shadow-2xl` *** ## Motion [Section titled “Motion”](#motion) The canonical `motion` group covers durations, easings, keyframes, and ready-made `transition` shorthands. Durations + transitions both surface as CSS custom properties; transitions are pre-composed so primitives can drop in a single `var(--ggui-motion-transition-…)` without rewriting the curve. | Variable | Typical value | When to use | | ------------------------------------ | ------------------------------------------- | -------------------------------------- | | `--ggui-motion-duration-instant` | `0ms` | Immediate state flips (no animation) | | `--ggui-motion-duration-fast` | `100ms` | Hover, focus, small icon swaps | | `--ggui-motion-duration-normal` | `200ms` | Default for color/opacity transitions | | `--ggui-motion-duration-slow` | `300ms` | Layout shifts, large fades | | `--ggui-motion-transition-colors` | `color/background-color/border-color 200ms` | Theme-aware color changes | | `--ggui-motion-transition-opacity` | `opacity 200ms` | Fades, show/hide | | `--ggui-motion-transition-transform` | `transform 200ms` | Translate/scale interactions | | `--ggui-motion-transition-all` | `all 200ms` | Catch-all when several properties move | Respect `accessibility.reducedMotion` — set durations to `0ms` (or override the transitions to `none`) when it’s `'reduce'`. *** ## Accessibility [Section titled “Accessibility”](#accessibility) Top-level `accessibility` tokens make a11y intent explicit instead of leaving it implicit in component styles. They emit as `--ggui-accessibility-*` and pair with the standard media queries. | Variable | Type | Notes | | ------------------------------------ | ----------------------------- | ------------------------------------------------- | | `--ggui-accessibility-focusRing` | shadow / outline shorthand | The focus indicator used by every primitive | | `--ggui-accessibility-reducedMotion` | `'no-preference' \| 'reduce'` | Mirror of `prefers-reduced-motion`; drives motion | | `--ggui-accessibility-highContrast` | `'standard' \| 'increased'` | Mirror of `prefers-contrast`; thickens borders | Override `focusRing` at the theme level to brand the focus indicator across every primitive at once. *** ## Z-Index [Section titled “Z-Index”](#z-index) A canonical layering scale prevents floating UIs from fighting each other. Values are unitless integers and increase with elevation. | Variable | Layer | Typical occupant | | ------------------------ | ------ | -------------------------------------- | | `--ggui-zIndex-hide` | `-1` | Off-screen / underlay | | `--ggui-zIndex-base` | `0` | Page content | | `--ggui-zIndex-docked` | `10` | Docked sidebars, sticky toolbars | | `--ggui-zIndex-dropdown` | `1000` | Menus, select popovers | | `--ggui-zIndex-sticky` | `1100` | Sticky table headers | | `--ggui-zIndex-banner` | `1200` | Announcement / cookie banners | | `--ggui-zIndex-overlay` | `1300` | Backdrop scrims | | `--ggui-zIndex-modal` | `1400` | Modal dialogs | | `--ggui-zIndex-popover` | `1500` | Floating popovers anchored to content | | `--ggui-zIndex-skipLink` | `1600` | Keyboard skip-link (must beat modals) | | `--ggui-zIndex-toast` | `1700` | Toast notifications | | `--ggui-zIndex-tooltip` | `1800` | Tooltips (topmost interactive element) | *** ## Using Tokens [Section titled “Using Tokens”](#using-tokens) ### In primitives [Section titled “In primitives”](#in-primitives) Nothing to wire — every primitive already consumes the tokens: ```tsx import { Button, Card, Input } from '@ggui-ai/design';    ; ``` ### In your own components [Section titled “In your own components”](#in-your-own-components) Reference tokens by their CSS variable, with a hardcoded fallback for environments where the theme provider hasn’t loaded yet: ```css .my-component { background: var(--ggui-color-surface, #ffffff); padding: var(--ggui-spacing-md, 16px); border-radius: var(--ggui-shape-radius-lg, 12px); box-shadow: var(--ggui-shape-shadow-md, 0 8px 16px -4px rgba(15, 23, 42, 0.10)); font-family: var(--ggui-font-family-sans, system-ui, -apple-system, sans-serif); font-size: var(--ggui-font-size-sm, 14px); color: var(--ggui-color-onSurface, #111827); transition: var(--ggui-motion-transition-colors, color 200ms ease); } ``` ### Theming [Section titled “Theming”](#theming) Override on any element — `:root` for global, a wrapper for per-subtree: ```css :root { --ggui-color-primary-600: #7c3aed; /* Purple instead of sky blue */ --ggui-color-surface: #fefce8; /* Warm paper background */ --ggui-color-onSurface: #1f1410; /* Ink for the new surface */ --ggui-shape-radius-md: 16px; /* Pillier corners */ --ggui-motion-duration-normal: 240ms; /* Slightly more deliberate */ } ``` See [Custom Theming](/cookbook/custom-theming/) for the full recipe — global overrides, dark-mode pairs, and scoped subtrees.

# Claude Agent

> Drive ggui from Claude Agent SDK's tool-use loop — Claude calls ggui MCP tools directly, no client SDK in your code.

This is the canonical Claude pattern for ggui. You wire ggui as an MCP server in Claude Agent SDK’s `mcpServers` config, hand Claude the ggui posture prompt (`GGUI_AGENT_SYSTEM_PROMPT`), and Claude decides when to call `ggui_handshake` / `ggui_render` / `ggui_consume` based on the tool descriptions it discovers via `tools/list`. **Zero ggui SDK code in your app** — Claude drives the protocol directly. A runnable version of this example lives at [`samples/agents/claude-agent-sdk/`](https://github.com/ggui-ai/ggui/tree/main/samples/agents/claude-agent-sdk). ## Install [Section titled “Install”](#install) ```bash npm install @anthropic-ai/claude-agent-sdk @ggui-ai/protocol ``` ```bash export ANTHROPIC_API_KEY="sk-ant-..." ``` ## Code [Section titled “Code”](#code) claude-agent.ts ```typescript import { query } from "@anthropic-ai/claude-agent-sdk"; import { GGUI_AGENT_SYSTEM_PROMPT } from "@ggui-ai/protocol"; // 1. Wire ggui as an MCP server. Claude Agent SDK will issue `tools/list` // against this URL on session start and discover every `ggui_*` tool. // `Bearer dev` authenticates because `ggui serve --dev-allow-all` accepts // any bearer — local dev only. const mcpServers = { ggui: { type: "http" as const, url: "http://127.0.0.1:6781/mcp", headers: { Authorization: "Bearer dev" }, }, }; // 2. Whitelist the ggui tools Claude is allowed to call. The SDK auto-namespaces // every tool as `mcp____`, so the prefix is `mcp__ggui__`. const GGUI_ALLOWED_TOOLS = [ "mcp__ggui__ggui_handshake", "mcp__ggui__ggui_render", "mcp__ggui__ggui_update", "mcp__ggui__ggui_emit", "mcp__ggui__ggui_consume", "mcp__ggui__ggui_get_session", ]; async function chat(userPrompt: string) { console.log(`\nUser: ${userPrompt}\n`); // 3. `query()` returns an AsyncGenerator of SDKMessage events. The SDK runs // the full tool-use loop internally — you just consume the stream. for await (const message of query({ prompt: userPrompt, options: { model: "claude-sonnet-4-6", mcpServers, allowedTools: GGUI_ALLOWED_TOOLS, // The posture-only canonical ggui prompt. Tells Claude *when* to reach // for a UI; tool descriptions tell it *how*. systemPrompt: GGUI_AGENT_SYSTEM_PROMPT, }, })) { if (message.type === "assistant") { for (const block of message.message.content) { if (block.type === "text") { process.stdout.write(block.text); } else if (block.type === "tool_use") { console.log(`\n[tool_use] ${block.name}`); } } } else if (message.type === "user") { // Tool results flow back as user-role messages from the SDK's perspective. for (const block of message.message.content) { if (block.type === "tool_result") { console.log(`[tool_result] ${block.tool_use_id}`); } } } else if (message.type === "result") { console.log(`\n\n[done] subtype=${message.subtype}`); } } } chat("I want to book a restaurant reservation for this weekend").catch(console.error); ``` ## Run [Section titled “Run”](#run) ```bash npx tsx claude-agent.ts ``` ## What you’ll see [Section titled “What you’ll see”](#what-youll-see) `query()` yields a stream of `SDKMessage` events. The shapes you’ll observe in a typical ggui turn: * **`system` (init)** — opening event listing the model, allowed tools, and MCP servers Claude discovered. * **`assistant`** — Claude’s response chunks. `content` is an array of blocks: * `text` — natural-language reply (streamed across multiple events). * `tool_use` — Claude invoking a ggui tool (e.g. `mcp__ggui__ggui_handshake`). The `input` field contains the tool arguments Claude generated. * `thinking` — extended-thinking blocks when reasoning is on. * **`user`** — tool results flowing back into the loop. `content[].type === "tool_result"` with `tool_use_id` pointing at the matching `tool_use`. * **`result`** — terminal event with `subtype` (`success`, `error_max_turns`, `error_during_execution`, …), `usage` totals, and `total_cost_usd`. A reservation booking typically streams: `system` → `assistant`(text + `tool_use:ggui_handshake`) → `user`(tool\_result) → `assistant`(`tool_use:ggui_render`) → `user`(tool\_result with `sessionId` + `resourceUri` — the host mounts the UI from the MCP-Apps resource) → \[user submits the form] → `assistant`(`tool_use:ggui_consume`) → `user`(tool\_result with submission data) → `assistant`(confirmation text) → `result`. ## Patterns worth stealing [Section titled “Patterns worth stealing”](#patterns-worth-stealing) * **Zero Agent Code.** Your file imports `query` and `GGUI_AGENT_SYSTEM_PROMPT` and lists tool names — nothing else. The protocol lives behind MCP; Claude reads tool descriptions and drives it directly. * **`GGUI_AGENT_SYSTEM_PROMPT` is posture, not instructions.** It tells Claude *when* to reach for a UI (structured input, choices, visual presentation). Per-tool semantics ship from the server in `tools/list` — you don’t restate them. * **Tool namespace is structural.** `mcp____` is non-negotiable; it’s how the SDK routes calls. Match the prefix exactly in `allowedTools`. * **Whitelist explicitly.** Omit `allowedTools` and Claude can call every tool the server exposes (including ops/protocol tools). The list above is the minimal agent-facing surface. * **Prompt caching is automatic** when the SDK detects a stable `systemPrompt` + `mcpServers` prefix across turns. No `cache_control` plumbing needed in your code. ## Related [Section titled “Related”](#related) * [`samples/agents/claude-agent-sdk/`](https://github.com/ggui-ai/ggui/tree/main/samples/agents/claude-agent-sdk) — runnable reference for this file * [OpenAI agent example](/examples/openai-agent/) — same shape, function-calling transport * [`GGUI_AGENT_SYSTEM_PROMPT` reference](/api/mcp-protocol/) — what’s in the posture prompt and why * [How it works](/how-it-works/) — what happens between `ggui_render` and the rendered UI

# Gemini Agent

> Wire Google Gemini's function-calling API to ggui via MCP — bridge ggui's MCP tools into Gemini function declarations and continue the conversation once the user submits.

Wire ggui to Gemini’s function-calling API so the model can mint a UI mid-conversation, wait for the user to submit, and continue with the structured data. The pattern is a **thin bridge**: connect to ggui’s MCP server with the official `@modelcontextprotocol/sdk` client, then surface every MCP tool as a Gemini `FunctionDeclaration`. ## Setup [Section titled “Setup”](#setup) ```bash npm install @google/genai @modelcontextprotocol/sdk ``` ```bash export GEMINI_API_KEY="AIza..." ``` ## Code [Section titled “Code”](#code) gemini-agent.ts ```typescript import { GoogleGenAI } from "@google/genai"; import { Client } from "@modelcontextprotocol/sdk/client/index.js"; import { StreamableHTTPClientTransport } from "@modelcontextprotocol/sdk/client/streamableHttp.js"; const ai = new GoogleGenAI({ apiKey: process.env.GEMINI_API_KEY! }); // 1. Connect to ggui over MCP (Streamable HTTP transport). `Bearer dev` // authenticates because `ggui serve --dev-allow-all` accepts any bearer — // local dev only. const mcpClient = new Client({ name: "gemini-ggui-agent", version: "0.1.0" }, {}); await mcpClient.connect( new StreamableHTTPClientTransport(new URL("http://127.0.0.1:6781/mcp"), { requestInit: { headers: { Authorization: "Bearer dev" }, }, }) ); // 2. Bridge MCP tools → Gemini function declarations. const { tools: mcpTools } = await mcpClient.listTools(); const geminiTools = [ { functionDeclarations: mcpTools.map((t) => ({ name: t.name, description: t.description, // Gemini takes JSON Schema as-is via parametersJsonSchema (NOT `parameters`). parametersJsonSchema: t.inputSchema, })), }, ]; // 3. Open a chat. `chats.create` keeps multi-turn state — reuse this handle. const chat = ai.chats.create({ model: "gemini-3.5-flash", config: { tools: geminiTools, systemInstruction: "You drive ggui MCP tools to render interactive UIs. Call the appropriate tool when you need to collect structured data from the user, then continue with their response.", }, }); async function run(userPrompt: string) { console.log(`\nUser: ${userPrompt}`); let response = await chat.sendMessage({ message: userPrompt }); // 4. Drain function calls until the model returns plain text. while (response.functionCalls?.length) { const functionResponses = await Promise.all( response.functionCalls.map(async (fc) => { const result = await mcpClient.callTool({ name: fc.name, arguments: fc.args, }); return { name: fc.name, response: { content: result.content } }; }) ); response = await chat.sendMessage({ message: functionResponses.map((fr) => ({ functionResponse: fr })), }); } console.log(`\nAssistant: ${response.text ?? "(no response)"}`); } await run("I need to schedule a meeting with my team for next week"); await mcpClient.close(); ``` ## Run [Section titled “Run”](#run) ```bash npx tsx gemini-agent.ts ``` ## What Happens [Section titled “What Happens”](#what-happens) 1. You ask Gemini to schedule a team meeting. 2. Gemini calls `ggui_handshake({intent, blueprintDraft})` and gets a `handshakeId` + a `suggestion` (the server matches a [blueprint](/glossary/) or synthesizes one). 3. Gemini calls `ggui_render({handshakeId, props})`; the result carries `{sessionId, resourceUri}` — your host mounts the UI from that MCP-Apps resource. 4. Gemini calls `ggui_consume({sessionId, timeout})`; when the user submits, the `events` array delivers `{intent, actionData, uiContext}`. 5. Gemini replies with plain text. ## Gemini-specific notes [Section titled “Gemini-specific notes”](#gemini-specific-notes) * Function schemas use **`parametersJsonSchema`** (JSON Schema passthrough), not the older `parameters` (subset-of-OpenAPI) field. MCP tools expose `inputSchema` as JSON Schema — pass it through unchanged. * Tools are grouped under `functionDeclarations`; tool replies are `functionResponse` parts inside the `message: PartListUnion`. * Multi-turn state lives on the `ai.chats.create()` handle — reuse `chat` across turns. Don’t call `models.generateContent` for chat flows or you lose history. * Tool-loop latency tip: set `generateContentConfig.thinkingConfig.thinkingLevel` to `MINIMAL` — high thinking levels add tens of seconds per tool turn. * The bridge is generic: every tool ggui exposes over MCP becomes a Gemini function declaration automatically. No per-tool wrapper code. ## Next [Section titled “Next”](#next) * [Claude Agent example](/examples/claude-agent/) — same MCP bridge pattern with Anthropic’s SDK (native MCP server support — no manual bridge needed). * [OpenAI Agent example](/examples/openai-agent/) — same flow with OpenAI function calling. * [How it works](/how-it-works/) — the three channels (bootstrap, MCP, WebSocket) and envelope shapes.

# Generic MCP / Raw HTTP

> Drive ggui from any MCP-compatible agent or via raw JSON-RPC over HTTP — official MCP client library, and language-agnostic curl/Python recipes.

Use ggui from any language or framework — through the official MCP TypeScript client, or raw HTTP with JSON-RPC. There is no ggui-specific wrapper SDK: every endpoint is a vanilla MCP server, so any spec-compliant client works. > Building on top of the Claude Agent SDK instead? See [Claude Agent SDK example](/examples/claude-agent/) — it wires ggui in as a stock MCP server with no extra glue. ## Using the MCP client library (TypeScript) [Section titled “Using the MCP client library (TypeScript)”](#using-the-mcp-client-library-typescript) The official `@modelcontextprotocol/sdk` package speaks the MCP wire end-to-end. Point its `StreamableHTTPClientTransport` at `http://127.0.0.1:6781/mcp` and call ggui’s tools by name. ```typescript import { Client } from "@modelcontextprotocol/sdk/client/index.js"; import { StreamableHTTPClientTransport } from "@modelcontextprotocol/sdk/client/streamableHttp.js"; const client = new Client({ name: "my-agent", version: "0.1.0" }, {}); // `Bearer dev` authenticates because `ggui serve --dev-allow-all` accepts // any bearer — local dev only. await client.connect( new StreamableHTTPClientTransport(new URL("http://127.0.0.1:6781/mcp"), { requestInit: { headers: { Authorization: "Bearer dev", }, }, }) ); // Discover the tool surface. const { tools } = await client.listTools(); console.log( "Available tools:", tools.map((t) => t.name) ); // → ['ggui_handshake', 'ggui_render', 'ggui_consume', // 'ggui_update', 'ggui_get_session', 'ggui_list_sessions', // 'ggui_list_gadgets', 'ggui_list_themes', 'ggui_emit', // 'ggui_list_featured_blueprints', 'ggui_search_blueprints', // 'ggui_render_blueprint'] // handshake → render at the raw MCP layer. const handshakeResult = await client.callTool({ name: "ggui_handshake", arguments: { intent: "feedback form", blueprintDraft: { contract: { /* DataContract — propsSpec, actionSpec, contextSpec, streamSpec */ }, }, }, }); const handshake = JSON.parse(handshakeResult.content[0].text) as { handshakeId: string; suggestion: { origin: "cache" | "agent" | "synth"; blueprintMeta: { blueprintId?: string; contractHash: string }; }; }; const renderResult = await client.callTool({ name: "ggui_render", arguments: { handshakeId: handshake.handshakeId, props: {}, }, }); const { sessionId, resourceUri } = JSON.parse(renderResult.content[0].text) as { sessionId: string; resourceUri: string; }; console.log("Render:", sessionId, "→", resourceUri); ``` The three-noun model: a **tool** is what the agent calls (the MCP methods above); a **gadget** is a renderer-side capability the LLM may opt into when generating the UI; a **blueprint** is a cached recipe the handshake returns as a `suggestion` (with `origin: 'cache' | 'agent' | 'synth'`) — accepting it on render reuses the provisional `blueprintId` and skips regeneration. Need a higher-level Claude-flavored shortcut? The [Claude Agent SDK example](/examples/claude-agent/) registers ggui as an MCP server and lets the agent loop drive `ggui_handshake` / `ggui_render` / `ggui_consume` on its own. *** ## Using raw HTTP (no SDK) [Section titled “Using raw HTTP (no SDK)”](#using-raw-http-no-sdk) Call MCP over JSON-RPC from any language. The flow is **`initialize` → `ggui_handshake` → `ggui_render` → `ggui_consume`** (poll, keyed by `sessionId`). Renders decay implicitly via TTL — no explicit close. ### Hosted vs self-hosted — what to swap [Section titled “Hosted vs self-hosted — what to swap”](#hosted-vs-self-hosted--what-to-swap) Every `curl` below targets a local `ggui serve --dev-allow-all`. To drive the hosted endpoint (coming soon) instead, swap two values: | What | Self-hosted (`ggui serve`) — default | Hosted (`mcp.ggui.ai`) — coming soon | | --------------- | -------------------------------------------------------------------- | ------------------------------------ | | Endpoint URL | `http://127.0.0.1:6781/mcp` | `https://mcp.ggui.ai/apps/` | | `Authorization` | `Bearer dev` (requires `ggui serve --dev-allow-all`; local dev only) | `Bearer ggui_user_...` | `--dev-allow-all` is for local dev only `--dev-allow-all` accepts any bearer — keep the default `127.0.0.1` bind and never expose it publicly. For real bearers use pair-minted ones (the strict default) — see [Pairing](/self-hosted/pairing/), and `--keys-file` to persist them across restarts. ### Step 1 — Initialize the MCP session [Section titled “Step 1 — Initialize the MCP session”](#step-1--initialize-the-mcp-session) ```bash curl -X POST http://127.0.0.1:6781/mcp \ -H "Authorization: Bearer dev" \ -H "Accept: application/json, text/event-stream" \ -H "Content-Type: application/json" \ -d '{ "jsonrpc": "2.0", "id": 1, "method": "initialize", "params": { "protocolVersion": "2025-11-25", "clientInfo": { "name": "my-agent", "version": "1.0" }, "capabilities": {} } }' ``` `protocolVersion` here is the MCP transport spec date, not the ggui protocol draft. The `Accept` header is mandatory — the server rejects requests that don’t accept both `application/json` and `text/event-stream`. Responses come back as a single SSE event (`event: message` + `data: {...}`); parse the `data:` line as JSON — the `# → {...}` comments below show that parsed payload. ### Step 2 — Negotiate a handshake [Section titled “Step 2 — Negotiate a handshake”](#step-2--negotiate-a-handshake) ```bash curl -X POST http://127.0.0.1:6781/mcp \ -H "Authorization: Bearer dev" \ -H "Accept: application/json, text/event-stream" \ -H "Content-Type: application/json" \ -d '{ "jsonrpc": "2.0", "id": 2, "method": "tools/call", "params": { "name": "ggui_handshake", "arguments": { "intent": "Contact form", "blueprintDraft": { "contract": { "propsSpec": { "properties": { "fields": { "schema": { "type": "array", "items": {} } } } }, "actionSpec": { "submit": { "label": "Submit", "schema": { "type": "object" } } } } } } } }' # → { handshakeId, action, suggestion, nextStep? } ``` The returned `suggestion.origin` is the routing discriminator: `'cache'` (a cached blueprint matched — render with `{handshakeId, props}` and omit `override` for a cheap cache delivery), `'agent'` (gen against the draft on render), or `'synth'` (gen against a server-amended contract). `suggestion.blueprintMeta` is always present; it carries a `blueprintId` when the server matched or pre-minted one. See [`ggui_handshake`](/api/mcp-protocol/#ggui_handshake) for the full input + output schemas. ### Step 3 — Render the UI [Section titled “Step 3 — Render the UI”](#step-3--render-the-ui) ```bash curl -X POST http://127.0.0.1:6781/mcp \ -H "Authorization: Bearer dev" \ -H "Accept: application/json, text/event-stream" \ -H "Content-Type: application/json" \ -d '{ "jsonrpc": "2.0", "id": 3, "method": "tools/call", "params": { "name": "ggui_render", "arguments": { "handshakeId": "h_...", "props": {} } } }' # → { sessionId, resourceUri, action, contractHash, blueprintId, variantKey, cache, nextStep? } ``` `props` is REQUIRED (pass `{}` when the contract declares no `propsSpec`); omit `override` to accept the suggestion as-is, or pass `override: {contract?, variance?}` to re-aim. The render comes back as an MCP-Apps resource (`resourceUri`) a host mounts — there is no URL to hand the user; you poll for their actions with `sessionId`. ### Step 4 — Poll for events [Section titled “Step 4 — Poll for events”](#step-4--poll-for-events) ```bash curl -X POST http://127.0.0.1:6781/mcp \ -H "Authorization: Bearer dev" \ -H "Accept: application/json, text/event-stream" \ -H "Content-Type: application/json" \ -d '{ "jsonrpc": "2.0", "id": 4, "method": "tools/call", "params": { "name": "ggui_consume", "arguments": { "sessionId": "4f6b2c0e-…", "timeout": 20 } } }' # → { events: ConsumeEventEntry[], status: "active" | "expired" } ``` Consume is keyed by the `sessionId` from step 3. `timeout` (seconds): an integer in `[0, 25]`; `0` = immediate. Values outside that range reject with `INVALID_PARAMS` (-32602) — the cap dodges infrastructure kill windows (API-gateway 30s HTTP limits). Pick 5–15s typically, 25 max; to wait longer, re-call `ggui_consume` in a loop — a longer wait is your loop, not a bigger timeout. For push-style delivery, prefer the [WebSocket Protocol](/api/websocket-protocol/) over polling; raw HTTP callers loop `ggui_consume` per render. Each entry has `{type: 'action', sessionId, intent, actionData, uiContext, actionId, firedAt}`. See [Envelopes](/protocol/envelopes/) for the wire shape. Renders decay implicitly via TTL — there is no explicit close ceremony. *** ## Python example [Section titled “Python example”](#python-example) A complete end-to-end run with the standard library and `requests`: ```python import json import time import requests API_URL = "http://127.0.0.1:6781/mcp" HEADERS = { "Authorization": "Bearer dev", # ggui serve --dev-allow-all; local dev only "Accept": "application/json, text/event-stream", "Content-Type": "application/json", } request_id = 0 def parse_sse(body: str) -> dict: # Responses come back as a single SSE event; the JSON-RPC payload is # the `data:` line of the `message` event. for line in body.splitlines(): if line.startswith("data:"): return json.loads(line[len("data:"):].strip()) raise RuntimeError(f"no SSE data line in response: {body[:200]}") def mcp_request(method: str, params: dict | None = None) -> dict: global request_id request_id += 1 payload = {"jsonrpc": "2.0", "id": request_id, "method": method} if params: payload["params"] = params return parse_sse(requests.post(API_URL, headers=HEADERS, json=payload).text) def call_tool(name: str, arguments: dict) -> dict: result = mcp_request("tools/call", {"name": name, "arguments": arguments}) if "error" in result: raise RuntimeError(f"MCP error: {result['error']}") return json.loads(result["result"]["content"][0]["text"]) # 1. Initialize the MCP session. mcp_request("initialize", { "protocolVersion": "2025-11-25", "clientInfo": {"name": "python-agent", "version": "1.0"}, "capabilities": {}, }) # Notifications carry no `id` (JSON-RPC), so post this one directly. requests.post(API_URL, headers=HEADERS, json={ "jsonrpc": "2.0", "method": "notifications/initialized", }) # 2. Negotiate the contract. handshake = call_tool("ggui_handshake", { "intent": "Product feedback form", "blueprintDraft": { "contract": { "propsSpec": {"properties": { "rating": {"schema": {"type": "number"}}, "comments": {"schema": {"type": "string"}}, }}, "actionSpec": {"submit": {"label": "Submit feedback", "schema": {"type": "object"}}}, }, }, }) # 3. Render — accept the handshake's suggestion as-is (omit `override`). result = call_tool("ggui_render", { "handshakeId": handshake["handshakeId"], "props": {}, }) session_id = result["sessionId"] print(f"resource: {result['resourceUri']}") # 4. Poll for events — keyed by sessionId. while True: consume = call_tool("ggui_consume", {"sessionId": session_id, "timeout": 20}) for entry in consume["events"]: # ConsumeEventEntry: {type: 'action', intent, actionData, uiContext, ...} print(f"intent={entry['intent']} data={entry['actionData']}") if consume["status"] == "expired": break if consume["events"]: # Got the gesture we needed — exit on first non-empty payload. break time.sleep(2) # Render decays implicitly via TTL — no explicit close. ``` For long-lived UIs, prefer the WebSocket channel (`ws://127.0.0.1:6781/ws` self-hosted; `wss://mcp.ggui.ai/ws` hosted, coming soon) over polling — see [WebSocket Protocol](/api/websocket-protocol/). *** ## See also [Section titled “See also”](#see-also) * [MCP Protocol Reference](/api/mcp-protocol/) — every method, every argument * [Claude Agent SDK example](/examples/claude-agent/) — higher-level loop that drives these same tools * [WebSocket Protocol](/api/websocket-protocol/) — push events instead of polling * [Envelopes](/protocol/envelopes/) — `ActionEnvelope`, `StreamEnvelope` * [OSS Quick Start](/oss-quickstart/) — run your own `ggui serve` in minutes

# OpenAI Agent

> Bridge OpenAI function calling to ggui's MCP server so GPT can show interactive UIs and read back structured user input.

OpenAI’s chat API doesn’t speak MCP, so you bridge ggui’s MCP tools into OpenAI’s function-calling shape yourself. Two packages do the work: * **`@modelcontextprotocol/sdk`** — connects to your ggui MCP endpoint (a local `ggui serve` below), enumerates ggui’s tools, executes tool calls. * **`openai`** — drives the GPT loop and decides when to call those tools. The bridge is mechanical: list MCP tools once, map each to an OpenAI function definition, then forward every `tool_call` GPT emits straight through to `mcpClient.callTool`. ## Setup [Section titled “Setup”](#setup) ```bash npm install openai @modelcontextprotocol/sdk ``` ```bash export OPENAI_API_KEY="sk-..." ``` ## Code [Section titled “Code”](#code) openai-agent.ts ```typescript import OpenAI from "openai"; import { Client } from "@modelcontextprotocol/sdk/client/index.js"; import { StreamableHTTPClientTransport } from "@modelcontextprotocol/sdk/client/streamableHttp.js"; const openai = new OpenAI({ apiKey: process.env.OPENAI_API_KEY }); // 1. Connect to ggui's MCP server (`ggui serve --dev-allow-all` accepts any // bearer — local dev only). const mcpClient = new Client({ name: "openai-ggui-agent", version: "0.1.0" }, {}); await mcpClient.connect( new StreamableHTTPClientTransport(new URL("http://127.0.0.1:6781/mcp"), { requestInit: { headers: { Authorization: "Bearer dev" }, }, }) ); // 2. Enumerate ggui's MCP tools and bridge them into OpenAI's // function-calling tool shape. MCP `inputSchema` is already JSON Schema, // which is exactly what OpenAI's `parameters` expects. const { tools: mcpTools } = await mcpClient.listTools(); const openaiTools: OpenAI.ChatCompletionTool[] = mcpTools.map((t) => ({ type: "function", function: { name: t.name, description: t.description ?? "", parameters: t.inputSchema as Record, }, })); async function chat(userPrompt: string) { console.log(`\nUser: ${userPrompt}`); const messages: OpenAI.ChatCompletionMessageParam[] = [ { role: "system", content: "You are a helpful assistant. Use the available ggui tools to render interactive UIs whenever you need structured input from the user.", }, { role: "user", content: userPrompt }, ]; // 3. Drive the tool-call loop until GPT stops asking for tools. while (true) { const completion = await openai.chat.completions.create({ model: "gpt-5.5", messages, tools: openaiTools, }); const message = completion.choices[0].message; messages.push(message); if (!message.tool_calls || message.tool_calls.length === 0) { console.log(`\nAssistant: ${message.content}`); return; } // 4. Forward every tool call through MCP, feed results back to GPT. for (const call of message.tool_calls) { const result = await mcpClient.callTool({ name: call.function.name, arguments: JSON.parse(call.function.arguments), }); messages.push({ role: "tool", tool_call_id: call.id, content: JSON.stringify(result.content), }); } } } await chat("Help me plan a team dinner for 8 people"); await mcpClient.close(); ``` ## Run [Section titled “Run”](#run) ```bash npx tsx openai-agent.ts ``` GPT calls `ggui_handshake` then `ggui_render`; the tool result carries `{sessionId, resourceUri}` — an MCP-Apps resource your host mounts. GPT then calls `ggui_consume({sessionId})`, and the loop delivers the submitted payload on the next turn. ## How the bridge works [Section titled “How the bridge works”](#how-the-bridge-works) * **MCP enumerates the toolset.** `listTools()` returns whatever ggui exposes (`ggui_handshake`, `ggui_render`, `ggui_consume`, …); you don’t hard-code a tool schema in your agent. * **MCP schemas are already OpenAI-compatible.** `tool.inputSchema` is JSON Schema; OpenAI’s `function.parameters` is JSON Schema. The map is one-to-one. * **Tool dispatch is dumb forwarding.** Every `tool_call` GPT emits is passed verbatim to `mcpClient.callTool` — the MCP server (not your agent) owns render lifecycle, handshake, render, and consume. * **Identity vs grouping.** Identity comes from the `Authorization` header; conversation grouping (resume via `ggui_list_sessions`) comes from the optional `_meta["ai.ggui/host-session"]` request slice. Reusing the same `Client` keeps the transport alive but does not itself group renders. ## OpenAI-specific notes [Section titled “OpenAI-specific notes”](#openai-specific-notes) * Function arguments arrive as a JSON **string** — always `JSON.parse(toolCall.function.arguments)` before forwarding. * Tool results are returned as `{ role: "tool", tool_call_id, content }` messages (not Anthropic’s `tool_result` block shape). * `result.content` from MCP is an array of content blocks; `JSON.stringify`-ing it gives GPT a faithful view of structured payloads. For text-only tools you can pull `result.content[0].text` instead. ## Next [Section titled “Next”](#next) * [Claude Agent example](/examples/claude-agent/) — same outcome via the Claude Agent SDK’s native MCP support (no manual bridging). * [Gemini Agent example](/examples/gemini-agent/) — the same MCP-bridge pattern adapted to Google’s SDK. * [How it works](/how-it-works/) — the three channels (bootstrap, MCP, WebSocket) and envelope shapes.

# OpenClaw Agent

> Install the ggui skill in OpenClaw and let any agent push interactive UIs on demand.

Placeholder integration **OpenClaw** and **ClawHub** are placeholder brand names for an upcoming launch partner — the `clawhub` CLI and the `mcporter.json` skill manifest referenced below do **not** exist as a shipping product yet. The rest of the page (tool catalogue, handshake-first flow, auth headers, MCP config JSON) is canonical and works against any MCP-compatible host today; once OpenClaw ships, only the install command on this page changes. If you’re integrating right now, follow [Generic MCP / Raw HTTP](/examples/generic-mcp/) instead. Install the ggui skill from ClawHub and any OpenClaw agent can generate interactive UIs from a natural-language prompt — forms, dashboards, wizards, anything the model can describe. ## Setup [Section titled “Setup”](#setup) ### 1. Install the skill [Section titled “1. Install the skill”](#1-install-the-skill) ```bash clawhub install ggui ``` This drops the ggui MCP tool catalogue and skill prompt into your agent’s context. ### 2. Pick an endpoint [Section titled “2. Pick an endpoint”](#2-pick-an-endpoint) **OSS / self-hosted** — no signup. Run `ggui serve --dev-allow-all` locally (defaults to `http://127.0.0.1:6781/mcp`; see the [OSS Quick Start](/oss-quickstart/)); any bearer — use `Bearer dev` — authenticates in this mode. Local development only; default `ggui serve` requires a paired bearer. ### 3. Configure MCP [Section titled “3. Configure MCP”](#3-configure-mcp) The skill ships an `mcporter.json` that wires this up automatically. For manual setup: **OSS `ggui serve`** ```json { "mcpServers": { "ggui": { "type": "http", "url": "http://127.0.0.1:6781/mcp", "headers": { "Authorization": "Bearer dev" } } } } ``` **Hosted ggui (coming soon)** — the cloud endpoint is per-app, no `/mcp` suffix: ```json { "mcpServers": { "ggui": { "type": "http", "url": "https://mcp.ggui.ai/apps/", "headers": { "Authorization": "Bearer ${GGUI_API_KEY}" } } } } ``` ## Tool catalogue [Section titled “Tool catalogue”](#tool-catalogue) The skill exposes the ggui MCP tools — the agent never picks a blueprint by hand, the server’s matcher does that under the hood during handshake. **Core render loop** | Tool | Purpose | | ---------------- | -------------------------------------------------------------------------- | | `ggui_handshake` | Negotiate the wire surface for the next UI (returns a `handshakeId`). | | `ggui_render` | Materialize the UI (mints `sessionId`; accept the suggestion or override). | | `ggui_consume` | Long-poll for user gestures (form submits, button clicks). | | `ggui_update` | Patch a delivered render’s props without re-rendering. | | `ggui_emit` | Push frames onto a declared `streamSpec` channel (optional). | **Inspection** | Tool | Purpose | | -------------------- | -------------------------------------------------------------------------- | | `ggui_get_session` | Inspect render state. | | `ggui_list_sessions` | Enumerate this conversation’s sessions for resume (keyed by host-session). | | `ggui_list_gadgets` | Enumerate renderer-side capabilities (map tiles, charts, editors). | | `ggui_list_themes` | List theme presets the agent may apply via `ggui_render({themeId})`. | **Blueprint registry (optional)** | Tool | Purpose | | ------------------------------- | ---------------------------------------------------- | | `ggui_list_featured_blueprints` | Browse curated blueprints the provider has featured. | | `ggui_search_blueprints` | Search the blueprint registry by keyword / intent. | | `ggui_render_blueprint` | Render a manifest-declared blueprint directly by id. | ## Usage [Section titled “Usage”](#usage) The canonical flow is **handshake → render → consume**. Ask your OpenClaw agent: > “Collect feedback about the user’s recent purchase. Ask for a star rating and free-form comments.” The agent will: 1. `ggui_handshake({ intent, blueprintDraft })` — get a `handshakeId` + a routed `suggestion` (origin: cache / agent / synth). 2. `ggui_render({ handshakeId, props })` — accept the suggestion as-is (or pass `override: { contract }` / `{ variance }` to re-aim). Returns `{ sessionId, resourceUri }` — the render is an MCP-Apps resource a host mounts, not a URL. 3. `ggui_consume({ sessionId, timeout: 20 })` — long-poll until the user submits; events arrive keyed by `intent` (your actionSpec key). Renders decay implicitly via TTL — there is no explicit close ceremony. ### Multi-step wizard [Section titled “Multi-step wizard”](#multi-step-wizard) > “Walk the user through a 3-step onboarding: personal info, preferences, confirmation.” Each step is its own fresh `ggui_handshake` + `ggui_render` pair (each minting a new `sessionId`). Carry earlier answers forward by passing them as `props` on the next step’s `ggui_render` (declare the matching propsSpec entries in that step’s `blueprintDraft`) so the next prompt can prefill. See the [Multi-step wizard cookbook](/cookbook/multi-step-wizard/) for back-navigation patterns. ### Live patches without a re-render [Section titled “Live patches without a re-render”](#live-patches-without-a-re-render) > “While the form is open, refresh the available time-slot list every 10s.” Loop `ggui_update({ sessionId, kind: "merge", patch: { slots } })` to mutate props in place (RFC 7396 JSON Merge Patch — `null` deletes a key). Use `{ kind: "replace", props }` for a full props swap. No new render, no flicker. ## How it works [Section titled “How it works”](#how-it-works) ```plaintext OpenClaw Agent ggui server User browser | | | |-- ggui_handshake ------->| | | (matcher picks cached | | | blueprint OR fires | | | synth — invisible) | | |<-- { handshakeId, -------| | | suggestion } | | |-- ggui_render({ -------->| | | handshakeId, props }) | | |<-- { sessionId, ---------| | | resourceUri } | | | |-- mounts resource ---->| | |---- render UI -------->| |-- ggui_consume --------->| | | |<--- submit form -------| |<-- { events } -----------| | | (render decays via TTL) | | ``` Blueprints are an internal cache: the matcher checks them on every handshake. A cache hit returns near-instantly; a miss generates fresh React (expect seconds to \~a minute depending on model). The agent doesn’t pick — it just describes intent. ## Patterns [Section titled “Patterns”](#patterns) * **Describe intent, not markup.** “A 3-question NPS survey” beats “a form with a slider, a textarea, and a submit button.” * **Pass data as `props`, declared in the contract.** Lists of options, prefilled values, the current user — declare them in `blueprintDraft`’s propsSpec and pass the values as `props` on `ggui_render`, not in the intent string. * **Fresh render per step.** Each new screen is a fresh `handshake → render` pair with its own `sessionId`. Prior renders decay via TTL. * **Drain `ggui_consume` in a loop.** Events are cleared after consumption. Match on each event’s `intent` (your actionSpec key, e.g. `submit`); react, then re-call. Exit when `status: "expired"`. ## Troubleshooting [Section titled “Troubleshooting”](#troubleshooting) | Symptom | Fix | | -------------------- | ----------------------------------------------------------------------------------------------------- | | `Unauthorized` | Local: run `ggui serve --dev-allow-all` (or pair a bearer). Hosted (coming soon): set `GGUI_API_KEY`. | | `Session not found` | Session expired (TTL elapsed) — re-handshake to mint a fresh one. | | `Handshake required` | You called `ggui_render` without a `handshakeId` — handshake first. | | Empty `consume` | User hasn’t interacted yet. Keep polling (long-poll up to \~20s). | | Generation failed | Simplify the prompt. (Hosted only, coming soon: check your account has credits.) | ## Next steps [Section titled “Next steps”](#next-steps) * **[MCP Protocol Reference](/api/mcp-protocol/)** — wire-level tool catalogue * **[Claude Agent](/examples/claude-agent/)** — same flow, Claude SDK in TypeScript * **[Generic MCP / Raw HTTP](/examples/generic-mcp/)** — language-agnostic version * **[Feedback Form cookbook](/cookbook/feedback-form/)** — end-to-end recipe with code * **[Glossary](/glossary/)** — gadget vs tool vs blueprint

# Hosted Quickstart

> Ship your first agent UI against hosted ggui (mcp.ggui.ai) in under 5 minutes — no infrastructure required.

Coming soon This page describes the **managed hosted path** (`mcp.ggui.ai` / `console.ggui.ai`), which is **not yet live** — it is not part of GGUI Preview 0.1.0. The self-hosted path is available today: start with the [Quickstart](/oss-quickstart/). This page is kept as a preview of the managed path and goes live when hosted ggui ships. In 5 minutes, your agent will negotiate a UI contract, push a generated form to a user, and receive their submission as structured data — no React code, no front-end build, no infrastructure. ```plaintext Your Agent → mcp.ggui.ai → MCP-Apps render (ui://ggui/render/) → User submits → Agent gets typed data ``` ## Prerequisites [Section titled “Prerequisites”](#prerequisites) * **Node.js** 20+ * **A free `console.ggui.ai` account.** Sign in at [console.ggui.ai](https://console.ggui.ai) with email + password. (`console.ggui.ai` is the end-user dashboard for hosted ggui — see the [glossary](/glossary/) if the `ggui` / `guuey` split confuses you.) ## Step 1: Pick an app and mint an SDK API key [Section titled “Step 1: Pick an app and mint an SDK API key”](#step-1-pick-an-app-and-mint-an-sdk-api-key) 1. Sign in at [console.ggui.ai](https://console.ggui.ai). You land on `/apps` — new accounts come pre-provisioned with one default app; create another with **New App** if you want to scope this quickstart to its own surface (e.g. `feedback-demo`). 2. Open the app and go to **Keys** (`/apps/[appId]/keys`). Click **Mint key**, label it (e.g. `feedback-demo-sdk`), and submit. 3. The key reveals exactly once — **copy both the key (`ggui_user_…`) and the App ID (`app_…`) immediately.** Lose the key and you mint a new one; there’s no recovery. Caution Never commit `ggui_user_*` keys. Use `.env` files and a secret manager in production. ## Step 2: Install the dependencies [Section titled “Step 2: Install the dependencies”](#step-2-install-the-dependencies) ```bash npm install @anthropic-ai/claude-agent-sdk @ggui-ai/protocol # or: pnpm add @anthropic-ai/claude-agent-sdk @ggui-ai/protocol # or: yarn add @anthropic-ai/claude-agent-sdk @ggui-ai/protocol ``` * **`@anthropic-ai/claude-agent-sdk`** runs Claude as a tool-using agent and speaks MCP natively — no wrapper needed. * **`@ggui-ai/protocol`** exports `GGUI_AGENT_SYSTEM_PROMPT`, the canonical system prompt that teaches Claude the handshake → render → consume loop. ## Step 3: Write your agent [Section titled “Step 3: Write your agent”](#step-3-write-your-agent) Create `agent.ts`. The Claude Agent SDK connects to `mcp.ggui.ai` directly via its `mcpServers` config — your code just streams Claude’s messages and lets the model drive the ggui tools. ```typescript import { query } from "@anthropic-ai/claude-agent-sdk"; import { GGUI_AGENT_SYSTEM_PROMPT } from "@ggui-ai/protocol"; // 1. Point Claude's MCP client at your app's hosted endpoint. Bearer-auth // with the `ggui_user_*` key you minted in Step 1. Note: the per-app // cloud endpoint is the bare `/apps/` path — NO `/mcp` suffix // (that suffix is local-`ggui serve`-only). const mcpServers = { ggui: { type: "http" as const, url: "https://mcp.ggui.ai/apps/", headers: { Authorization: `Bearer ${process.env.GGUI_API_KEY!}` }, }, }; // 2. Allow Claude to call every ggui tool. The `mcp____` // naming is the SDK's convention — `` = `ggui` (the key above). const allowedTools = [ "mcp__ggui__ggui_handshake", "mcp__ggui__ggui_render", "mcp__ggui__ggui_update", "mcp__ggui__ggui_emit", "mcp__ggui__ggui_consume", "mcp__ggui__ggui_get_session", ]; async function main() { const prompt = "Collect product feedback from the user. Show a feedback form with a " + "1-5 star rating and a comments text area, wait for them to submit, " + "then summarize what they said."; // 3. Stream the conversation. Claude reads GGUI_AGENT_SYSTEM_PROMPT, // decides when to call ggui_handshake / ggui_render, // polls ggui_consume until the user submits, and reports back — // all without any wrapper SDK on your side. for await (const msg of query({ prompt, options: { mcpServers, allowedTools, systemPrompt: GGUI_AGENT_SYSTEM_PROMPT, }, })) { if (msg.type === "assistant") { for (const block of msg.message.content) { if (block.type === "text") console.log(block.text); } } else if (msg.type === "result") { console.log("Done:", msg.subtype); } } } main().catch(console.error); ``` ## Step 4: Run it [Section titled “Step 4: Run it”](#step-4-run-it) ```bash export ANTHROPIC_API_KEY="sk-ant-..." # for the Claude Agent SDK export GGUI_API_KEY="ggui_user_..." # for mcp.ggui.ai npx tsx agent.ts ``` You’ll see Claude narrate the handshake, call `ggui_render` (which returns `{ sessionId, resourceUri }` — the render surfaces as an MCP-Apps resource at `ui://ggui/render/`, not a clickable link), and then **block on `ggui_consume`**, polling for the user’s submission. That block is the point to notice: run programmatically like this, there is no UI surface for a human to submit through yet — the agent is waiting on a render nobody can see. To actually mount the render and let a user interact, you need an MCP-Apps host. Two ways to get one: * **Embed it in your own app** — Step 5 below mounts the render with the React SDK. * **Run the agent inside an MCP-Apps host** — claude.ai or Claude Desktop renders ggui resources inline (see [Clients](/clients/claude-desktop/)). ## Step 5 (optional): Embed in React [Section titled “Step 5 (optional): Embed in React”](#step-5-optional-embed-in-react) Want the UI inside your own app? Install the React SDK and the MCP-Apps host: ```bash npm install @ggui-ai/react @mcp-ui/client ``` A ggui render is an **MCP-Apps resource**. You drive the conversation with the `useMcpAppsChat` hook and mount each render’s sandboxed iframe with `` (imported directly from `@mcp-ui/client` — ggui doesn’t re-export it): ```tsx import { AppRenderer } from "@mcp-ui/client"; import { useMcpAppsChat } from "@ggui-ai/react/chat-helpers"; function Chat({ agentUrl }: { agentUrl: string }) { const { entries, sessions, send, handleAppMessage } = useMcpAppsChat({ chatEndpoint: `${agentUrl}/agent`, }); // - render `entries` as chat bubbles; call `send(prompt)` to talk to the agent // - mount each `sessions` entry with  — it needs a sandbox-proxy // origin + onReadResource / onCallTool relay + onMessage={handleAppMessage} } ``` `useMcpAppsChat` talks to your **agent backend** (the process running the Step-3 `query()` loop, exposed over HTTP — `@ggui-ai/agent-server` gives you a brand-neutral `POST /agent` endpoint for exactly this). ``’s sandbox + resource-read + tool-call relay wiring is non-trivial; the complete runnable reference is the [`ggui-basic-web`](https://github.com/ggui-ai/ggui/tree/main/samples/apps/ggui-basic-web) sample. **Start there.** ## What just happened [Section titled “What just happened”](#what-just-happened) Under the hood, Claude drove these MCP tool calls against `mcp.ggui.ai`: 1. **`ggui_handshake`** negotiated a **contract** from a natural-language intent + draft, returning a `handshakeId` + a server suggestion (cache / agent / synth). 2. **`ggui_render`** with `{ handshakeId, props }` materialized the contract: ggui matched a cached **blueprint** (or synthesized a fresh React component), minted a `sessionId`, and returned `{ sessionId, resourceUri }` — the render is an MCP-Apps resource at `ui://ggui/render/`, surfaced on the tool result’s `_meta.ui.resourceUri`. (There is no clickable URL on the wire.) 3. A host mounted that resource — your app via `` (Step 5), or an MCP-Apps host like claude.ai inline — and the user submitted the form. 4. **`ggui_consume`** delivered the user’s submit gesture as a `ConsumeEventEntry` (`{ intent, actionData, uiContext, ... }`). Renders decay implicitly via TTL — there is no terminal `close` ceremony. Everything above the wire is `GGUI_AGENT_SYSTEM_PROMPT` + the Claude Agent SDK’s tool loop — no ggui-specific client code on your side. ```plaintext Agent mcp.ggui.ai MCP-Apps host │ │ (your app / claude.ai) │── handshake ───────────→ │ │ │← { handshakeId, ─ │ │ │ suggestion } │ │ │── render(handshakeId, ─ │ │ │ props) ────────────→ │ │ │ │── match/synth blueprint │ │← { sessionId, ─ │ │ │ resourceUri } │ │ │ │── ui://ggui/render/ ──→│ (host mounts iframe) │ │ │ │── consume(sessionId) ───→ │ │ │ │←── submit gesture ────────│ │← { events } ──────────── │ │ │ │ │ │ (render decays via TTL — no explicit close) │ ``` ## Next steps [Section titled “Next steps”](#next-steps) * **[Claude agent example](/examples/claude-agent/)** — canonical reference implementation for the snippet above * **[MCP protocol reference](/api/mcp-protocol/)** — every `ggui_*` tool, request/response, and error code * **[React SDK](/sdk/react/)** — embed ggui renders directly in your own React app with `useMcpAppsChat` + `` * **[Other LLMs](/examples/generic-mcp/)** — raw `@modelcontextprotocol/sdk` recipe; also [OpenAI](/examples/openai-agent/), [Gemini](/examples/gemini-agent/), [OpenClaw](/examples/openclaw-agent/) * **[Feedback-form cookbook](/cookbook/feedback-form/)** — the recipe above, with variations * **[Troubleshooting](/troubleshooting/)** — common issues and fixes * **[Glossary](/glossary/)** — gadget vs tool vs blueprint, ggui vs guuey, and the rest * **[Agentic App Builders](/agentic-app-builders/)** — if your goal is to make an existing app agent-drivable rather than building a fresh agent.

# Glossary

> Lookup reference for ggui terminology — GguiSession, contract, blueprint, channel, connector, gadget, tool, and the wire envelopes that tie them together.

Lookup reference for the terms that run through these docs. Each entry is short and links to where the term is documented in depth. For a narrative walkthrough, see [**How ggui works**](/how-it-works/). ## Identity and lifecycle [Section titled “Identity and lifecycle”](#identity-and-lifecycle) ### GguiSession (a “render”) [Section titled “GguiSession (a “render”)”](#gguisession-a-render) The atomic unit of a ggui exchange — the canonical protocol shape for a single rendered UI. A GguiSession is minted server-side inside `ggui_render` (one per UI emission), lives until its TTL elapses, and is keyed by a stable `sessionId` (opaque string). It is a union of Component / System / McpApps variants and carries a single component mount plus a stream of live-channel deliveries. The server is the authority on its state, not the agent. `render` survives as the verb and in wire constants (`ggui_render`, `ui://ggui/render`, the `ai.ggui/render` slice). Conversation-scoped grouping (sibling GguiSessions inside the same host chat) flows through the unchanged `_meta["ai.ggui/host-session"]` slice — captured ONCE at creation, never lifted onto every GguiSession. The agent does not thread a conversation id; instead each new `ggui_handshake` → `ggui_render` pair mints a fresh `sessionId`. → See [MCP Protocol → ggui\_handshake / ggui\_render](/api/mcp-protocol/) for the lifecycle methods. ### GguiSession contents [Section titled “GguiSession contents”](#gguisession-contents) A single GguiSession packages a compiled component plus its declared contract: * `sessionId` — the stable identifier * `componentCode` — the compiled JavaScript module returned by generation * `props` — the data payload the agent rendered * `propsSpec` / `actionSpec` / `streamSpec` / `contextSpec` — the four contract specs that pin the wire surface * `clientCapabilities` — the package-keyed gadget catalog projected from the contract at render time In wire envelopes you’ll see `sessionId` consistently — there is no separate conversation-session id layer. → See [Envelopes → ActionEnvelope](/protocol/envelopes/#actionenvelope) for the wire shape. ### App (`appId`) [Section titled “App (appId)”](#app-appid) The agent-builder’s tenant scope. An `appId` (`app_…`) groups GguiSessions, blueprints, connectors, and provider keys under one boundary. On self-hosted `ggui serve`, the default app is whatever your `ggui.json` declares; on the hosted ggui cloud (coming soon), the `appId` is minted by the platform. Every `ggui_handshake` / `ggui_render` call is scoped to one `appId`. ### `shortCode` [Section titled “shortCode”](#shortcode) An internal 16-character token minted per render. Not on the agent wire — `ggui_render` returns `sessionId` + `resourceUri` (`ui://ggui/render/`), and hosts mount the GguiSession from the `resourceUri`; the agent never receives a render URL — but `ggui serve` still uses shortCodes for its local render-viewer route (`/r/`). *** ## Wire [Section titled “Wire”](#wire) ggui’s wire is split across orthogonal transports. Two share every framing: **MCP** (agent ↔ server request/response) and the **live channel** (server ↔ renderer WebSocket). A third depends on viewpoint — the [protocol overview](/protocol/overview/) names the **chat channel** (user ↔ wrapped agent, HTTP SSE) as the conversational-stack third; the [architecture overview](/architecture/overview/) names the **bootstrap channel** (server → renderer, one-shot bundle fetch) as the wire-pipeline third. The terms below all live on the live channel unless noted. ### Audience [Section titled “Audience”](#audience) A tag on every MCP handler — one of `agent`, `runtime`, `protocol`, or `ops` — that determines which route the tool surfaces on. `/mcp` serves the union of `agent` + `runtime` handlers (what the model and its runtime see); `/protocol` serves `protocol` (design-time spec/discovery); `/ops` serves `ops` (operator surface). The audience tag is set on every handler factory and encoded in the wire-name prefix (`ggui_protocol_*`, `ggui_ops_*`, `ggui_runtime_*`, or bare `ggui_*`). → See [Audience routes](/architecture/audience-routes/). ### Channel (live channel) [Section titled “Channel (live channel)”](#channel-live-channel) A named outbound stream on the live-channel WebSocket (`ws://127.0.0.1:6781/ws` on `ggui serve`; `wss://mcp.ggui.ai/ws` on the hosted cloud, coming soon). Agents declare channels in their `streamSpec`; the server validates each `StreamEnvelope` against the declared channel’s schema before delivery. A channel has a name (e.g. `message`, `tasks`, `progress`), a payload schema, a state-folding `mode` (`'append'` or `'replace'`), and an optional `replay` policy. Channels prefixed with `_ggui:` are reserved for the server (e.g. `_ggui:preview`, `_ggui:lifecycle`). Agent-authored `streamSpec` cannot declare them. → See [Envelopes → Reserved channels](/protocol/envelopes/#reserved-channels). ### Contract [Section titled “Contract”](#contract) The typed agreement between agent and renderer for a given render: what the props look like, what actions the user can dispatch, what each action’s payload schema is, what live channels the renderer subscribes to. Contracts are authored with `defineContract({…} as const)` so the protocol derives TypeScript types from a single source. A contract has four orthogonal specs: * **`propsSpec`** — initial render data (server → UI, one-time at render) * **`actionSpec`** — discrete events that drive the agent’s next turn (UI → agent, via `ggui_consume`) * **`contextSpec`** — observable state mirrored from UI to server, last-write-wins (UI → server) * **`streamSpec`** — outbound deliveries from agent to UI (agent → UI, via `ggui_emit`) The placement test for the two inbound specs: **does this thing need the agent’s next-turn reasoning?** Yes → `actionSpec`. No → `contextSpec`. There is no third category — `actionSpec` carries events that drive turns; `contextSpec` carries state the agent observes when it next does work. When a delivery violates the contract, the server rejects it with a typed, named failure (a `CONTRACT_VIOLATION` error frame on the live channel, code `-32020`; a typed rejection on the agent’s own tool call) — that’s the contract’s failure-mode surface. Nothing lands on the consume buffer. (The earlier `_ggui:contract-error` channel and `ContractErrorPayload` shape were removed in draft-2026-06-11.) ### `ActionEnvelope` [Section titled “ActionEnvelope”](#actionenvelope) The flat, narrow inbound envelope on the live channel. Carries a single canonical user action — `type` is always `'data:submit'` — plus a payload. The server validates `ActionEventValue` payloads against the render’s `actionSpec`. → See [Envelopes → ActionEnvelope](/protocol/envelopes/#actionenvelope). ### `StreamEnvelope` [Section titled “StreamEnvelope”](#streamenvelope) The outbound delivery on the live channel — one envelope per delivery on a named stream. Carries `channel`, `mode`, `payload`, and (when populated) a server-assigned `seq` for replay correctness. → See [Envelopes → StreamEnvelope](/protocol/envelopes/#streamenvelope). ### `nextStep` tool hint [Section titled “nextStep tool hint”](#nextstep-tool-hint) An author-declared hint on a contract action: `actionSpec[name].nextStep` names an MCP tool — “when the user dispatches this action, that tool is what should run next”. Every action is agent-routed: the server appends the action event to the GguiSession’s consume buffer and the agent reacts on its next turn via `ggui_consume`. The hint rides along on `ActionEventValue.tool` (derived server-side at event-build time) so the agent sees the contract author’s recommendation; the agent owns the call decision — the protocol never binds it. In agent-less deployments (a bare `ggui serve` with no agent process attached), actions take the SAME path: each action event queues on the GguiSession’s consume buffer until an agent attaches and drains it via `ggui_consume({sessionId})` — the server NEVER invokes a tool on the user’s behalf. There is no second routing model. Actions without a `nextStep` are pure event signals — the agent receives `{action, data}` and decides what to do unconstrained. (The retired `dispatch.kind === 'tool' | 'agent'` discriminated union from earlier drafts is gone; outside material still using that vocabulary is stale.) ### `ggui_emit` [Section titled “ggui\_emit”](#ggui_emit) The MCP tool the agent calls to emit a delivery onto a declared `streamSpec` channel of an existing render. Input is `{sessionId, channel, payload}`; the server validates `payload` against `streamSpec[channel].schema` and fans it out as a `StreamEnvelope` on the live channel. The outbound counterpart to `ggui_consume`’s inbound drain. → See [MCP Protocol → `ggui_emit`](/api/mcp-protocol/#ggui_emit). ### `ggui_consume` [Section titled “ggui\_consume”](#ggui_consume) The MCP tool the agent long-polls to drain buffered user events off one render. Consume-once semantics — events drain on read. Call this right after every `ggui_render` whose response carries `nextStep.tool === 'ggui_consume'` (i.e. the rendered contract has a non-empty `actionSpec`). Drained events surface with optional `tool` metadata mirrored from `actionSpec[action].nextStep`. → See [MCP Protocol → `ggui_consume`](/api/mcp-protocol/#ggui_consume). ### `nextStep` (advisory recovery hint) [Section titled “nextStep (advisory recovery hint)”](#nextstep-advisory-recovery-hint) A small wire-shape object the server returns on `ggui_handshake` and `ggui_render`. The handshake form is `{tool: 'ggui_render', example}`; the render form is `{tool: 'ggui_consume', description, example, args: {sessionId}}` — `args` carries the literal value to pass to `ggui_consume`. The `example` is a literal worked call prefilled with current ids, so the agent can recover the next step without re-reading the docs. On `ggui_render`, `nextStep` is emitted ONLY when the rendered contract has a non-empty `actionSpec` and points at `ggui_consume`; pure-display renders get no `nextStep`. Distinct from `actionSpec[name].nextStep`, which is an author-declared **tool hint** on a contract action (see [`nextStep` tool hint](#nextstep-tool-hint)). ### MCP service [Section titled “MCP service”](#mcp-service) A self-contained MCP server mounted at its own HTTP path (e.g. `/docs`, `/playground/todos`) with its own tool catalog, auth adapter, and lifecycle. Distinct from an `McpServerMount`, which aggregates tools onto the audience-filtered shared routes (`/mcp`, `/protocol`, `/ops`); a service stands alone at its mount path. Used when a tool group needs route-level isolation — a different auth posture, a tighter tool catalog, or a public read-only surface. → See [MCP services](/architecture/mcp-services/). *** ## Capabilities [Section titled “Capabilities”](#capabilities) ggui has two symmetric **capability surfaces** — one for what the *agent* can do, one for what the *renderer* can render. Both are declared per-app and bounded by the operator, not the agent. They mirror each other across the wire: the agent uses its tools to drive a render; the renderer uses its gadgets to render the result. | | Renderer (renders UI) | Agent (drives render) | | --------- | ----------------------------------- | ------------------------- | | Unit | **gadget** | **tool** | | Catalog | `clientCapabilities.gadgets` | `agentCapabilities.tools` | | Lifecycle | Loaded at iframe boot, SRI-verified | Bound at render start | | Authoring | `ggui.gadget.json` manifest | MCP tool code | ### Gadget (renderer-side capability) [Section titled “Gadget (renderer-side capability)”](#gadget-renderer-side-capability) A self-contained bundle the UI generator picks up when assembling a UI. Concrete examples: a Leaflet gadget exports a `LeafletMap` component the generated UI renders; a Stripe gadget exposes a checkout flow; a Calendar gadget exposes date-picker behaviour. Gadgets bridge third-party JS into the ggui iframe runtime via the `createGguiGadget` factory; a contract references them through the package-keyed `clientCapabilities.gadgets` map (outer key = npm package, inner key = export name), and they load SRI-verified at boot. Authored as `ggui.gadget.json` manifests and published to a marketplace registry (resolved per command from `--registry` / `ggui.json#registry` / `GGUI_REGISTRY`; installs fall back to `registry.ggui.ai`). The term replaced the older `clientLibraries` / “Client Libraries” name; old external references may still use it. ### Tool (agent-side capability) [Section titled “Tool (agent-side capability)”](#tool-agent-side-capability) An MCP tool the agent has access to when orchestrating renders — `ggui_handshake`, `ggui_render`, `ggui_consume`, plus any custom tools the operator wires in. Tools are the agent-side counterpart of gadgets: gadgets give the *renderer* something to render with; tools give the *agent* something to act with. Declared per-app in `agentCapabilities.tools` and surfaced to the model via the MCP server. Blueprints sit in a different category — they’re not capabilities, they’re *cached responses* (pre-composed UIs) that short-circuit fresh generation. See [Blueprint](#blueprint) below. ### Ops tool [Section titled “Ops tool”](#ops-tool) An MCP tool tagged with `audience: ['ops']`, surfaced on the server’s `/ops` route (e.g. `http://127.0.0.1:6781/ops`). Every action the console UI can perform is also available as an ops tool, so an LLM operator agent can drive the same workflows programmatically — list apps, rotate keys, inspect renders, yank blueprints. Wire-name prefix is `ggui_ops_*`, distinguishing them from agent-facing `ggui_*` tools at the route level. → See [Ops MCP](/api/ops-mcp/). ### Theme [Section titled “Theme”](#theme) A per-app visual overlay. `AppTheme = {mode: 'light' | 'dark', cssVariables, name?}` — where `cssVariables` is a map of `--ggui-*` custom properties — is snapshotted onto each GguiSession at render-commit and applied at the iframe `:root` after the base design tokens, so operator values win the cascade. Presets ship in `@ggui-ai/design` (7 today); agents discover them via `ggui_list_themes` and override per render with `ggui_render({themeId})`. Resolution chain: `GguiSession.themeId` → `App.defaultThemeId` → server fallback. *** ## Generation [Section titled “Generation”](#generation) ### Blueprint [Section titled “Blueprint”](#blueprint) A cached UI primitive — a stable, reusable card whose code, contract, and prompt are pre-computed. Blueprints are matched during `ggui_handshake` by intent + contract similarity (semantic search + LLM rerank); the paired `ggui_render` then serves the cached component — a hit renders in \~100 ms versus \~3 s for fresh generation. Project-local blueprints are authored as `ggui.ui.json` manifests and declared via `ggui.json#blueprints.include` globs; for marketplace publishing, a blueprint repo carries a `ggui.blueprint.json` artifact manifest (`ggui blueprint publish`). Where **gadgets** are ingredients the LLM composes with and **tools** are actions the LLM invokes, **blueprints** are recipes the LLM can return directly — a cache short-circuit for already-solved screens. ### Primitive [Section titled “Primitive”](#primitive) A leaf-level UI component declared in a primitive catalog (e.g. Button, Input, SearchField). Primitives are catalog-declared, not protocol-declared — the agent doesn’t ship a fixed component set; the operator declares which primitives are available via `ggui.json#primitives`. Component levels nest: **primitive** (Button) → **component** (SearchField) → **composite** (LoginForm, Modal) → **template** (ListDetail, Dashboard page). ### `shellType` [Section titled “shellType”](#shelltype) The visual shell the renderer wraps a render in. Three values today: `chat` (conversation pane), `fullscreen` (whole-window app), `spatial` (XR / immersive surface). Carried on `interfaceContext.shellType` and read by the runtime when mounting a render. *** ## Auth [Section titled “Auth”](#auth) ### Anonymous service [Section titled “Anonymous service”](#anonymous-service) An `McpService` mounted with `anonymous: true`. Auth becomes OPTIONAL, not skipped: a presented valid bearer still resolves to the caller’s real identity; only a missing or invalid credential falls back to a synthesized `{identity: {kind: 'builder'}, source: 'anonymous'}` principal. Safe for read-only public surfaces (e.g. the docs MCP service that serves how-to lookups); tools that mutate or read tenant-scoped data must re-impose auth at the handler level. → See [MCP services](/architecture/mcp-services/). ### Connector / connector key [Section titled “Connector / connector key”](#connector--connector-key) A `ggui_user_*` API key that authenticates an agent runtime against a ggui MCP server. Locally, `ggui keys create --keys-file ` mints bearers that `ggui serve --keys-file` accepts — no account needed. On the hosted cloud’s universal endpoint (coming soon), keys are minted via [`ggui keys create`](/cli/login/#manage-keys) after [`ggui login`](/cli/login/) (Preview — managed cloud, coming soon). Connector keys are scoped to one user; revoking via `ggui keys revoke ` is a soft-revoke (the audit row stays, but every subsequent request from the key returns `401 invalid_grant`). Distinct from the **bearer** minted by `ggui serve`’s pairing flow — that bearer is per-session and per-pairing, not user-keyed. ### `wsToken` [Section titled “wsToken”](#wstoken) A short-TTL opaque credential minted at `ggui_render` and delivered to the iframe on `_meta["ai.ggui/render"]`, paired with `wsUrl`. The iframe presents it on the WebSocket upgrade (`?wsToken=`) and in `SubscribePayload.wsToken`; the server validates it against the subscribing `sessionId` + `appId`. Refreshed without a re-handshake via the runtime-audience tool `ggui_runtime_refresh_ws_token`. ### `AuthAdapter` [Section titled “AuthAdapter”](#authadapter) The pluggable seam in `@ggui-ai/mcp-server` that decides who’s authenticated on `/mcp` and the live-channel `/ws` upgrade. The default for `ggui serve` is dev-mode pairing; production composes `createGguiServer({ auth })` with a custom adapter (OIDC, Cognito, custom). → See [`ggui serve` → Production hardening](/cli/serve/#production-hardening). *** ## See also [Section titled “See also”](#see-also) * [Protocol overview](/protocol/overview/) — the three-channel topology these terms hang off. * [Envelopes](/protocol/envelopes/) — wire reference for `ActionEnvelope` and `StreamEnvelope`. * [MCP Protocol](/api/mcp-protocol/) — MCP method reference (`ggui_handshake`, `ggui_render`, `ggui_update`, `ggui_consume`). * [WebSocket Protocol](/api/websocket-protocol/) — live-channel message-type reference.

# How ggui works

> The four moments of a ggui exchange — handshake, render, interact, consume — explained end-to-end in five minutes.

A walk-through for agent developers. You’ll come out of this with a working mental model of what happens between the moment your agent calls `ggui_handshake` and the moment the user submits the form. Five minutes. No setup required — this is conceptual. ## The four moments [Section titled “The four moments”](#the-four-moments) Every ggui exchange is the same four moments, in order: ```plaintext 1. HANDSHAKE Post a draft contract; the server routes a suggestion 2. RENDER Accept or override; the server mints an MCP-Apps resource 3. INTERACT The host mounts it; the user fills the UI and submits 4. CONSUME Drain the user's gestures off a render-scoped pipe ``` The rest of this page expands those four moments into a story. ## 1. Handshake — the wire surface is negotiated [Section titled “1. Handshake — the wire surface is negotiated”](#1-handshake--the-wire-surface-is-negotiated) Your agent’s first call is `ggui_handshake` — the server runs blueprint-search + contract-validation in parallel and returns a routed suggestion. (These are MCP tool calls the LLM emits; there is no client SDK — the shapes below are the tool input → output.) ```ts // ggui_handshake tool — input: ggui_handshake({ intent: "collect feedback after a support chat", blueprintDraft: { contract: { /* propsSpec, actionSpec, ... */ }, }, }); // → { handshakeId, action, suggestion } ``` The returned `suggestion.origin` is `cache` (existing blueprint matched), `agent` (gen against the draft), or `synth` (gen against an amended draft). No UI is generated yet — the agent commits next, on render. Each render is independent: each `handshake → render` pair mints a fresh **GguiSession** — the protocol’s unit for one rendered UI — keyed by `sessionId`. There is no conversation-level session object; conversation-scoped grouping (sibling renders inside the same chat) flows through the `_meta["ai.ggui/host-session"]` slice — captured ONCE at creation. → See [`ggui_handshake`](/api/mcp-protocol/) for the wire shape. ## 2. Render — the UI gets generated (or matched) [Section titled “2. Render — the UI gets generated (or matched)”](#2-render--the-ui-gets-generated-or-matched) Now the agent commits against the prior handshake’s suggestion — `props` is required; omit `override` to accept the suggestion as-is: ```ts // ggui_render tool — props required; omitting `override` accepts // the handshake suggestion: ggui_render({ handshakeId, props: { question: "How did the session go?" }, // or re-aim: override: { contract: {...} } / { variance: {...} } }); // → { sessionId, resourceUri, action, ... } ``` Server-side, materialisation runs one of two paths — the path was already chosen at handshake time, render just executes it: 1. **Cache delivery** (`suggestion.origin === 'cache'`). A matching blueprint was found during handshake; render serves the cached component. \~100ms. 2. **Fresh generation** (`origin === 'agent'` or `'synth'`). The server runs the LLM-driven UI generator (`@ggui-ai/ui-gen`) — plan → impl → check → derive. The output is a TSX component compiled to JS, plus a typed **contract** describing the actions the user can take and the data they can submit. \~3s. Either way, any gadgets the component imports (Leaflet, Stripe, Calendar, …) resolve from the app’s declared gadget set (stdlib floor + `ggui.json#app.gadgets`) and load SRI-verified at iframe boot. The agent gets back a `sessionId` (globally unique UUID for the delivered render) and a `resourceUri` (`ui://ggui/render/`). The render is an MCP-Apps resource — there is no clickable URL the agent forwards; a host mounts the resource. → See [`ggui_render`](/api/mcp-protocol/) for the wire shape. ## 3. Interact — the user fills the UI [Section titled “3. Interact — the user fills the UI”](#3-interact--the-user-fills-the-ui) A host mounts the render — your app via ``, or an MCP-Apps host like claude.ai inline. The renderer: 1. Hits the **bootstrap channel** — fetches the compiled component bundle (SRI-verified) 2. Mounts the component in an iframe with the props the agent rendered 3. Connects the **live channel** — a WebSocket subscription scoped to this render 4. When the user submits, the component dispatches an `ActionEnvelope` like `{ type: "data:submit", payload: {...} }`. The server validates the payload against the contract’s `actionSpec`. The renderer is **stateless** between page loads — props come from the server, state comes from the user, and the server is the source of truth for the render’s state. → See [Envelopes](/protocol/envelopes/) for the live-channel wire reference. ## 4. Consume — the action lands back with the agent [Section titled “4. Consume — the action lands back with the agent”](#4-consume--the-action-lands-back-with-the-agent) Actions are agent-routed. The server queues every gesture on a render-scoped pipe; the agent drains it by calling `ggui_consume` (long-poll, keyed by `sessionId`): ```ts // ggui_consume tool (long-poll) — returns { events, status }: const { events, status } = ggui_consume({ sessionId, timeout: 25 }); for (const event of events) { if (event.intent === "submit_feedback") { await processFeedback(event.actionData); } } ``` Each row is a `ConsumeEventEntry`: `{ type: 'action', sessionId, intent, actionData, uiContext, actionId, firedAt }`. `intent` is the action key from the contract’s `actionSpec`; `actionData` is the typed payload (validated against `actionSpec[intent].schema`). `status` is `'active'` until the render’s TTL elapses (`'expired'`) — exit the loop once you have the events you need, or when `status` is `'expired'`. An `actionSpec` entry may carry a `nextStep: ''` hint naming one of the contract’s `agentCapabilities.tools` — an **advisory** hint for the agent’s planner. Implementations MUST treat it as advisory; the agent owns the call decision. Agent-less `ggui serve` deployments take the same path: events queue on the consume buffer until an agent attaches and drains them — the server never invokes a tool on the user’s behalf. There is no second routing model. When the agent wants to refresh the visible card in response to an event (e.g. show a confirmation, splice in new data), it calls `ggui_update` (keyed by `sessionId`, `kind: 'replace' | 'merge'`) — the iframe receives the new props on the live channel without a fresh `ggui_render`. Then loop back to `ggui_consume`. Rule of thumb: if your reaction ran a domain tool that changed what the card displays, call `ggui_update` *before* re-calling `ggui_consume` — skipping it is the most common wire-compliance bug. → See [`ggui_consume`](/api/mcp-protocol/) and the `ConsumeEventEntry` row shape on the same page. ## What you didn’t have to do [Section titled “What you didn’t have to do”](#what-you-didnt-have-to-do) Notice what your agent code did *not* have to handle: * **No UI authoring.** The component code was generated or matched from cache. * **No WebSocket plumbing.** The renderer connects to the live channel on its own; you didn’t open a socket. * **No state management.** The server holds render state. You called `ggui_consume` and got events. * **No SDK lock-in.** Everything above is plain MCP tool calls — works from any MCP client. That’s the protocol. The OSS `ggui serve` running locally (`ws://127.0.0.1:6781/ws`) is the reference implementation; a hosted endpoint at `mcp.ggui.ai` (`wss://mcp.ggui.ai/ws`) is coming soon — both speak the same wire. ## Next [Section titled “Next”](#next) * **Build something** — [OSS Quick Start](/oss-quickstart/) (local); the [Hosted Quick Start](/getting-started/) with `mcp.ggui.ai` is coming soon * **See the wire** — [MCP Protocol](/api/mcp-protocol/), [Envelopes](/protocol/envelopes/), [WebSocket](/api/websocket-protocol/) * **Look up a term** — [Glossary](/glossary/) * **Look at example agents** — [Claude](/examples/claude-agent/), [OpenAI](/examples/openai-agent/), [Gemini](/examples/gemini-agent/), [raw MCP](/examples/generic-mcp/) * **Already shipped a SaaS?** — [Agentic App Builders](/agentic-app-builders/) covers the (in-design) path to make an existing app agent-drivable.

# OSS Quick Start

> Scaffold a ggui agentic app with `@ggui-ai/create-agentic-app` and run it locally — no account, no cloud, no ggui API key.

Scaffold a complete ggui agentic app and run it locally — your agent, the ggui MCP server, a sample MCP server, and a web client — in one `npx`. ```plaintext agent ── MCP ──→ ggui serve ──→ MCP-Apps resource ui://ggui/render/ ``` ## Prerequisites [Section titled “Prerequisites”](#prerequisites) * **Node.js** 20+ * **pnpm**, **npm**, or **bun** — any recent package manager. * An MCP-capable agent runtime (Claude Desktop, Claude Code, Cursor, or anything that reads `.mcp.json`). * No account, no API key. Everything is local. ## Step 1: Scaffold a project [Section titled “Step 1: Scaffold a project”](#step-1-scaffold-a-project) ```bash npx @ggui-ai/create-agentic-app my-app cd my-app pnpm install # or: npm install / bun install ``` `@ggui-ai/create-agentic-app` scaffolds a small monorepo from an official template (`claude-agent-sdk` / `openai-agents-sdk` / `google-adk`): `servers/ggui/` (the ggui MCP server config), `servers/agent/` (your agent), `servers/mcps/todo/` (a sample MCP server), and `apps/web/` (a web client). See [`@ggui-ai/create-agentic-app`](https://www.npmjs.com/package/@ggui-ai/create-agentic-app) for the template list. ## Step 2: Boot the server [Section titled “Step 2: Boot the server”](#step-2-boot-the-server) Set an LLM key in `.env.local` at the project root (any one — the boot probe walks `anthropic` → `openai` → `google` → `openrouter`): .env.local ```bash ANTHROPIC_API_KEY=sk-ant-... ``` ```bash pnpm dev ``` `pnpm dev` boots the whole project together — the ggui MCP server (`ggui serve` from [`@ggui-ai/cli`](https://www.npmjs.com/package/@ggui-ai/cli), embedding `@ggui-ai/mcp-server`), your agent, the sample MCP server, and the web client. The ggui server (`servers/ggui`) defaults to `http://127.0.0.1:6781`: * **MCP endpoint:** `http://127.0.0.1:6781/mcp` * **WebSocket (live channel):** `ws://127.0.0.1:6781/ws` * **Operator console:** `http://127.0.0.1:6781/` Granular scripts (`dev:ggui`, `dev:agent`, `dev:mcps`, `dev:web`, `dev:stop`) run the pieces individually — see the scaffold’s `README`. ## Step 3: Point an agent runtime at the local MCP [Section titled “Step 3: Point an agent runtime at the local MCP”](#step-3-point-an-agent-runtime-at-the-local-mcp) Add this entry to the scaffold’s existing `.mcp.json` (it already contains a `ggui-dev` helper server): ```json { "mcpServers": { "ggui": { "url": "http://127.0.0.1:6781/mcp", "headers": { "Authorization": "Bearer dev" } } } } ``` Claude Desktop, Claude Code, Cursor, and any runtime that reads `.mcp.json` will discover the agent-facing tool catalogue — `ggui_handshake` / `ggui_render` / `ggui_update` / `ggui_emit` / `ggui_consume`, plus discovery tools (`ggui_get_session`, `ggui_list_sessions`, `ggui_list_gadgets`, `ggui_list_themes`, blueprint search/render). The flow is **`handshake` → `render`**, with `ggui_update` to mutate props on a delivered render. Write a system prompt that tells the LLM when to render UIs — the [Examples](/examples/claude-agent/) section has working recipes per framework. ## What works today [Section titled “What works today”](#what-works-today) * ✅ **Local server + wsToken-gated WebSocket subscribe → ack** work end-to-end — the iframe receives `wsUrl`/`wsToken` on the `ai.ggui/render` slice. Render plumbing is production-shaped, not a mock. * ✅ **`handshake` → `render`** delivers an MCP-Apps resource (`ui://ggui/render/`) that an MCP-Apps host mounts. * ✅ **Component generation is wired via BYOK.** Export `ANTHROPIC_API_KEY` (or `OPENAI_API_KEY` / `GOOGLE_API_KEY` / `OPENROUTER_API_KEY`) before `ggui serve` and `ggui_render` runs the real `@ggui-ai/ui-gen` pipeline (blueprint match → synth). Without any key, `ggui_render` returns a “Connect Claude” card pointing at the local `/settings` page rather than failing. * ✅ **Per-app theming.** The scaffold’s `servers/ggui/ggui.json` sets `theme: {preset: 'indigo', mode: 'dark'}`; agents can list presets via `ggui_list_themes` and override per render with `ggui_render({themeId})`. * 🔒 **Default auth is pair-minted.** `/mcp` rejects bearers that weren’t issued by the pairing flow. Pass `--dev-allow-all` to accept any non-empty bearer as `builder` (safe only on `127.0.0.1`), or swap in a real `AuthAdapter` via `createGguiServer({ auth })` before exposing the server beyond localhost. ## What this is not [Section titled “What this is not”](#what-this-is-not) * **Not a hosted service.** No ggui cloud account, no billing, no managed dashboards. For those, see the [Hosted Quick Start](/getting-started/) (coming soon). * **Not a managed billing surface.** Generation runs on YOUR provider key (BYOK) — every `ggui_render` bills the provider directly. The hosted ggui cloud (coming soon) will bundle credit-metered billing and managed keys; this OSS path runs on your own provider key today. ## Vocabulary [Section titled “Vocabulary”](#vocabulary) * **Tool** — an agent-side action exposed over MCP (`ggui_render`, `ggui_handshake`, …). * **Gadget** — a renderer-side capability surfaced inside the viewer (formerly `clientLibraries`). * **Blueprint** — a cached component recipe matched before generation runs. Full definitions: [Glossary](/glossary/). ## Just the bare MCP server [Section titled “Just the bare MCP server”](#just-the-bare-mcp-server) The scaffold orchestrates everything through `pnpm dev`. If you only want the OSS MCP server — no agent or web supervision — run [`ggui serve`](/cli/serve/) against a `ggui.json` directly (the same `ggui serve` the scaffold’s `servers/ggui` runs): ```bash # = ggui serve --mcp-only --dev-allow-all --port 6781 pnpm --filter ./servers/ggui start ``` Working from a monorepo clone rather than the published packages? Build the workspace copies first: ```bash pnpm --filter @ggui-ai/cli build node packages/ggui-cli/dist/cli.js serve --mcp-only ``` ## What’s next [Section titled “What’s next”](#whats-next) * **[MCP Protocol Reference](/api/mcp-protocol/)** — wire format and tool catalogue (same on OSS and hosted). * **[WebSocket Protocol](/api/websocket-protocol/)** — live-channel envelopes (`ActionEnvelope`, `StreamEnvelope`). * **[Examples](/examples/claude-agent/)** — MCP-config-only integrations with system-prompt recipes (Claude, OpenAI, Gemini, OpenClaw, generic MCP). * **[Hosted Quick Start](/getting-started/)** — managed generation and dashboards on ggui cloud (coming soon). * **[GitHub repo](https://github.com/ggui-ai/ggui)** — source, issue tracker, full monorepo walkthrough.

# Bootstrap handshake

> How a host mounts the ggui renderer iframe, the postMessage contract that crosses that boundary, and the canonical bootstrap failure modes.

A ggui render ships as an [MCP App](https://modelcontextprotocol.io/) — a `ResourceContents` blob whose `text` is a thin-shell HTML document. When a host mounts that blob in an iframe, the shell loads the `@ggui-ai/iframe-runtime` bundle, which opens a live-channel WebSocket and starts rendering. This page is the handshake spec across the iframe boundary. > **Audience:** protocol implementers and third-party MCP host builders. If you’re already on `@ggui-ai/react`, the web consumer surface ([`useMcpAppsChat`](/sdk/react/) from `@ggui-ai/react/chat-helpers` + `` from `@mcp-ui/client`) wraps everything below — React Native’s equivalent host is ``. The protocol underneath those wrappers is what’s documented here. The `ProtocolError` union and the full `ObservabilityEvent` catalog ship as exported types in [`@ggui-ai/iframe-runtime`](https://github.com/ggui-ai/ggui/tree/main/packages/iframe-runtime) (re-exported through `@ggui-ai/react` so host apps need no direct renderer import); version-handshake details live in the [WebSocket Protocol reference](/api/websocket-protocol/). This page is the focused handshake spec. *** ## The boot flow [Section titled “The boot flow”](#the-boot-flow) ```plaintext host renderer (in iframe) │ │ │ 1. fetch ResourceContents from MCP server │ │ (`{contents: [{uri, mimeType:'text/html', text}]}`) │ │ │ │ 2. mount  │ │ (or <iframe src={uri} /> for http(s) URIs) │ │ ──────────────────────────────────────────────────────────▶ │ │ │ 3. <script src={runtimeUrl}> loads │ │ 4. runtime evaluates bundle │ 5. iframe → host: postMessage 'ggui:renderer-ready' │ │ ◀────────────────────────────────────────────────────────── │ │ 6. iframe → host: jsonrpc 'ui/initialize' │ │ ◀────────────────────────────────────────────────────────── │ │ 7. host → iframe: result w/ hostContext (capabilities only) │ │ ──────────────────────────────────────────────────────────▶ │ │ 8. host → iframe: 'ui/notifications/tool-result' carrying │ │ params._meta["ai.ggui/render"] (the bootstrap slice) │ │ ──────────────────────────────────────────────────────────▶ │ │ │ 9. parse _meta["ai.ggui/render"] │ │ 10. WS handshake (live channel) │ │ 11. subscribe + ack │ │ │ ── steady state: tools/call JSON-RPC, ggui:observe stream ── │ ``` Self-contained shells — per-render HTML documents that can inline JSON before the bundle loads — write the same slice synchronously on `globalThis.__GGUI_META__` instead; the runtime reads it directly and skips waiting for step 8. Any step from 3–11 can fail terminally. Failures surface as `postMessage({type: 'ggui:bootstrap-failed', reason, message})` from the iframe; the renderer does not recover. After step 11 (the subscribe ack), failures shift to live-channel `error` frames (e.g. a typed `CONTRACT_VIOLATION` error frame) — see [Envelopes](/protocol/envelopes/). *** ## postMessage contract [Section titled “postMessage contract”](#postmessage-contract) Four event types flow from iframe → host. Every host MUST handle the first three; `ggui:lifecycle` handling MAY be a no-op, but hosts MUST tolerate it. ### `ggui:renderer-ready` [Section titled “ggui:renderer-ready”](#gguirenderer-ready) Emitted once per render, immediately after the runtime bundle evaluates and its status DOM mounts — before `ui/initialize`. It means the bundle loaded; it does NOT mean the live channel is up. Steady-state liveness is signaled by the WS subscribe ack (and lifecycle state `code-ready`). ```ts { type: 'ggui:renderer-ready', version: string } ``` `version` is the iframe-runtime bundle version, not the protocol version. Hosts typically log it for support and diagnostics. ### `ggui:bootstrap-failed` [Section titled “ggui:bootstrap-failed”](#gguibootstrap-failed) Emitted at most once per render, when any boot step from 3–11 fails terminally. The renderer does NOT recover. The host MUST surface this as a user-visible error — naming the `reason` verbatim is the recommended UX (it gives operators a searchable string). ```ts { type: 'ggui:bootstrap-failed', reason: BootstrapFailureReason, // extensibly-closed union — see below message: string, // operator-readable detail } ``` ### `ggui:observe` [Section titled “ggui:observe”](#gguiobserve) Emitted multiple times per render, on happy paths and failures alike. Carries telemetry the host MAY render in a RenderInspector-style view. Hosts SHOULD forward these to their own telemetry pipeline; dropping them is allowed but blinds your operators. ```ts { type: 'ggui:observe', event: ObservabilityEvent } ``` `ObservabilityEvent` is an extensibly-closed union including `schema-version-mismatch`, `subscribe-failed`, `auth-required`. See [the implementer guide](https://github.com/ggui-ai/ggui/blob/main/docs/guides/implementing-ggui-protocol.md#observability) for the full event catalog. *** ## JSON-RPC methods the host responds to [Section titled “JSON-RPC methods the host responds to”](#json-rpc-methods-the-host-responds-to) The iframe-runtime makes JSON-RPC calls on the host over postMessage. Every implementer MUST handle these methods: | Method | When | Host responds with | | ------------------------------- | ----------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | | `ui/initialize` | Once, before bootstrap (step 6 above). Params: `{ appInfo, appCapabilities, protocolVersion }` per MCP Apps spec. | `{ toolOutput: { _meta: { "ai.ggui/render": RenderMeta } }, hostContext? }`. The `hostContext` MAY carry `containerDimensions`, `availableDisplayModes`, `platform`, `deviceCapabilities`. | | `tools/call` | When generated component code issues a direct `tools/call` (e.g. `ggui_runtime_submit_action`). | Forward to the MCP server’s tool registry; return its response verbatim. | | `ui/open-link` | When generated component code calls a navigation primitive. | `{}` after performing host-appropriate navigation (open in new tab, deep-link, etc.). | | `ui/notifications/size-changed` | Iframe content resized; carries the new height. **Notification** (no response required). | n/a — host SHOULD adjust iframe dimensions. | | `ui/message` | Component code surfaces a chat-bound natural-language message. | `{}` after delivering to the chat surface. | | `ui/update-model-context` | Component code mutated a context slot; host forwards to the MCP server (fire-and-forget). | `{}`. | | `ui/request-display-mode` | Component code requests an enum change (`inline` / `fullscreen` / `pip`). | `{}` after honoring (or rejecting) the request. | **Bootstrap delivery.** After answering `ui/initialize`, the host sends a `ui/notifications/tool-result` notification (host → iframe, no response) whose `params._meta["ai.ggui/render"]` carries the bootstrap slice — `params` is a `CallToolResult` per the MCP Apps spec; `params.toolOutput._meta` is accepted as a back-compat alias. Self-contained shells inline the same slice synchronously on `globalThis.__GGUI_META__` and skip this round-trip. Hosts MUST NOT intercept `tools/call` traffic — verbatim forwarding to the MCP server’s tool registry is the only correct behavior. Modifying or filtering tool calls breaks the typed-channel contract enforced server-side. Liveness on the live-channel WebSocket (post-bootstrap) is a separate `type: 'ping'` WebSocket frame, not a postMessage method — see [WebSocket Protocol reference](/api/websocket-protocol/). *** ## `BootstrapFailureReason` [Section titled “BootstrapFailureReason”](#bootstrapfailurereason) Extensibly-closed union. Hosts MUST handle unknown values gracefully (render the raw string, don’t switch-case-throw). The canonical first-party set, grouped by source: ### Parse-time (slice-meta extractor failed) [Section titled “Parse-time (slice-meta extractor failed)”](#parse-time-slice-meta-extractor-failed) Failures observed when the slice-meta extractor runs — after the runtime bundle evaluates, before any live-channel attempt. | Reason | Cause | | -------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | | `MISSING_TOOL_OUTPUT` | The `ui/notifications/tool-result` notification’s `params` was missing or not an object (no usable `CallToolResult` payload). A well-formed `params` that lacks both `_meta` and the back-compat `toolOutput._meta` fails as `MISSING_META_GGUI_BOOTSTRAP` instead | | `MISSING_META_GGUI_BOOTSTRAP` / `BOOTSTRAP_META_MISSING` | `_meta["ai.ggui/render"]` slice absent (synonyms; the error names are legacy labels — the wire key is `ai.ggui/render`) | | `MALFORMED_BOOTSTRAP` | Bootstrap token failed structural parse | | `EXPIRED_BOOTSTRAP` | Bootstrap token’s expiry is in the past | ### Post-parse orchestration [Section titled “Post-parse orchestration”](#post-parse-orchestration) Failures observed after parse but before renderer steady state. | Reason | Cause | | ---------------------- | ------------------------------------------------------------------------------------------------------------------ | | `UI_INITIALIZE_FAILED` | `ui/initialize` round-trip failed before bootstrap was readable | | `WS_HANDSHAKE_FAILED` | WebSocket rejected the bootstrap credential | | `UPGRADE_REQUIRED` | Server-version not in client’s supported set (also surfaces as `kind: 'version'` ProtocolError for finer handling) | ### Transport-observable (pre-WebSocket) [Section titled “Transport-observable (pre-WebSocket)”](#transport-observable-pre-websocket) Failures the host can sometimes diagnose from outside the iframe. | Reason | Cause | | --------------------- | ------------------------------------------------------------------- | | `BUNDLE_FETCH_FAILED` | `<script src={runtimeUrl}>` failed to load | | `CSP_VIOLATION` | Host’s Content-Security-Policy blocked something the renderer needs | | `SESSION_NOT_FOUND` | Server rejected pre-handshake — render expired or never existed | | `AUTH_REJECTED` | Server rejected pre-handshake — auth context invalid | | `(string & {})` | First-party renderers MAY mint new reasons without a major bump. | For `CSP_VIOLATION` specifically, the host’s own CSP is the usual root cause. Recommended UX: ask the user to check their browser console for the blocked directive. *** ## Bundle integrity (out of band) [Section titled “Bundle integrity (out of band)”](#bundle-integrity-out-of-band) The handshake itself carries no SRI hash for `runtimeUrl` — the runtime bundle’s integrity is the server’s responsibility (immutable cache-control + same-origin or trusted CDN). Integrity hashes DO appear on the bootstrap payload, but on adjacent fields, not on the handshake: * **`_meta["ai.ggui/render"].codeHash`** — hex-encoded SHA-256 of the static-component bytes served at `codeUrl`. Paired with `codeUrl` (present together or absent together). Lets consumers verify content addressing without re-parsing the URL. * **`_meta["ai.ggui/render"].gadgets[].bundleSri`** — SHA-384 SRI hash (`sha384-<base64>`) of each operator-registered gadget bundle. When present alongside `bundleUrl`, the iframe-runtime injects the gadget via `<script type="module" integrity>` so the browser refuses execution on mismatch. Absent → integrity-less dynamic `import()` (back-compat for in-tree wrappers). A bootstrap MAY arrive without either field. The handshake’s failure modes (`BUNDLE_FETCH_FAILED`, `CSP_VIOLATION`) do NOT include a hash-mismatch class — when SRI fires, the browser blocks the script and the runtime surfaces it as a downstream `BUNDLE_FETCH_FAILED`. *** ## Recovery posture [Section titled “Recovery posture”](#recovery-posture) | Failure | Recoverable? | | ---------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | | Pre-render bootstrap-failed (`reason`-bearing) | Renderer will NOT auto-recover. Host MAY re-mount the iframe after fixing the root cause (refresh auth, clear CSP, etc.). | | Post-render live-channel errors | Surfaced as typed live-channel `error` frames (e.g. `CONTRACT_VIOLATION`); renderer continues running. See [Envelopes](/protocol/envelopes/). | | `UPGRADE_REQUIRED` (version mismatch) | Terminal under the default `versionPolicy: 'reject'` — the server closes the connection and the failure surfaces as `ggui:bootstrap-failed` with reason `UPGRADE_REQUIRED`. Servers running the legacy `'advisory'` opt-out keep the connection open; the mismatch surfaces as a `schema-version-mismatch` `ggui:observe` event instead, and the host MAY render an inline “update the client” prompt. | *** ## Vanilla quickstart [Section titled “Vanilla quickstart”](#vanilla-quickstart) For non-React hosts, the protocol is plain `<iframe>` + manual postMessage. The React wrapper is \~180 LOC of convenience; everything below is the wire contract. ```html <!doctype html> <iframe id="ggui" style="width:100%;height:100vh;border:0">  ``` The `event.source` check is non-optional — without it, any window can spoof renderer envelopes. Production hosts SHOULD also validate `msg` against [the lifecycle envelope schema](https://github.com/ggui-ai/ggui/blob/main/packages/protocol/src/integrations/mcp-apps.ts) before mirroring state into trusted UI. *** ## Host obligations summary [Section titled “Host obligations summary”](#host-obligations-summary) A host that implements the JSON-RPC methods above plus the four postMessage event handlers honors the protocol’s bootstrap contract. The MUST / SHOULD breakdown: 1. **MUST** honor `_meta["ai.ggui/render"].runtimeUrl` from the resource — the iframe-runtime bundle URL. 2. **MUST** classify `ggui:bootstrap-failed` onto `BootstrapFailureReason` (or render the raw string for unknown values). 3. **MUST** surface `ggui:bootstrap-failed` as a user-visible error. 4. **SHOULD** surface `ggui:observe` events to host telemetry. 5. **MUST NOT** intercept `tools/call` JSON-RPC traffic — forward to the MCP server’s tool registry verbatim. 6. **MUST** narrow `event.source` to the iframe’s `contentWindow` before reading any postMessage data. These obligations are encoded in the [Conformance kit](/protocol/conformance/) — a third-party host that satisfies them passes the host-implementer fixtures. *** ## See also [Section titled “See also”](#see-also) * [Protocol overview](/protocol/overview/) — three-channel topology; bootstrap is the path into the live channel. * [Envelopes](/protocol/envelopes/) — live-channel wire shapes (post-bootstrap traffic). * [Conformance](/protocol/conformance/) — the bar a host implementation must pass. * [`@ggui-ai/iframe-runtime` on GitHub](https://github.com/ggui-ai/ggui/tree/main/packages/iframe-runtime) — `ProtocolError` union, full `ObservabilityEvent` catalog, boot-sequence source. * [`useMcpAppsChat`](/sdk/react/) (from `@ggui-ai/react/chat-helpers`) + `` (from `@mcp-ui/client`) — the web React surface that boxes everything above (React Native’s host is ``). * [`packages/console/src/routes/GguiSessions.tsx`](https://github.com/ggui-ai/ggui/blob/main/packages/console/src/routes/GguiSessions.tsx) — production reference implementation.

# Conformance

> What it means to be ggui-conformant — the 4-criterion contract bar, 6-criterion protocol bar, and the conformance kit as arbiter.

> An implementation is ggui-conformant iff it passes the [conformance kit](https://github.com/ggui-ai/ggui/tree/main/packages/protocol-conformance) — a fixture-based test suite a candidate runs against. Opinion, intent, and prose-level reading of this site are NOT the arbiter; the kit is. This page describes what the ggui protocol promises in exchange for the name “protocol”, what each contract inside it must satisfy, and how the kit makes those promises observable. ## What conformance means [Section titled “What conformance means”](#what-conformance-means) Conformance for ggui means three concrete things: 1. **A third-party implementation can replace the first-party one** — agents, hosts, and ops tools that work against the OSS `ggui serve` work against the third-party server too, without source-level changes. 2. **Every named failure mode produces an observable signal** — operators see violations without running a debugger. 3. **Every breaking change to the protocol is reproducible** — “is this PR breaking?” resolves to a kit run, not a debate. If a layer that calls itself a “protocol” can’t honor these three promises, the layer is something else (a shape, an SDK feature, an author convention) and should be named accordingly. ## The Contract Bar — 4 criteria [Section titled “The Contract Bar — 4 criteria”](#the-contract-bar--4-criteria) A **runtime contract** in ggui MUST satisfy all four. Three of four is not a contract. Documenting a fourth in prose without mechanism is not a contract. ### 1. Named parties [Section titled “1. Named parties”](#1-named-parties) The contract states WHO is on each side. Not “producer” and “consumer” in the abstract — the concrete roles. The `ggui_runtime_submit_action` handler and the consume-buffer pipe. The iframe-runtime and the render-channel server. The contract author and the platform validator. If you can’t name the parties in one sentence, there is no contract — only a data type adrift. * ✅ “The `ggui_runtime_submit_action` handler appends every valid `kind: 'dispatch'` envelope to the render-keyed pending-events pipe; the agent drains it via `ggui_consume`.” * ❌ “Clients should respect the ordering semantics.” ### 2. Explicit obligations per party [Section titled “2. Explicit obligations per party”](#2-explicit-obligations-per-party) Each party has a list of MUSTs and MUST-NOTs. SHOULDs are discipline, not contract — a SHOULD belongs in a style guide. The obligations must be specific enough that a reviewer can literally check a diff against them. * ✅ “Agent-authored `streamSpec` MUST NOT declare a channel whose name starts with `_ggui:`.” * ❌ “Handlers should behave predictably.” An obligation without a verifiable test is a hope, not a contract. ### 3. Defined failure mode per obligation [Section titled “3. Defined failure mode per obligation”](#3-defined-failure-mode-per-obligation) When the obligation breaks, there is a **named, typed, observable failure**. Generic `throw new Error()` is not a failure mode. `console.warn()` is not a failure mode. “Log the outcome to telemetry and carry on” is not a failure mode. Acceptable mechanisms: * A canonical error-code union (`CONTRACT_VIOLATION`, `SESSION_NOT_FOUND`, `SCHEMA_MISMATCH_ERROR`, the runtime-tool rejection codes `INVALID_ACTION_KIND` / `PIPE_NOT_FOUND` / `CONTEXT_TOO_LARGE`). * A typed rejection frame the caller receives on the call that caused it (e.g. a `CONTRACT_VIOLATION` error frame on the live channel, or a `{ok:false, code}` `structuredContent` rejection on a runtime tool). * Boot-time refusal with a message naming the offender (e.g. mount tool-name collision). * Compile-time impossibility (the type has no field → consumers can’t read it). Unacceptable: silent noop, silent degradation, swallowed exception, “it’ll surface in the next call if it’s a real problem.” ### 4. Observable violation [Section titled “4. Observable violation”](#4-observable-violation) An operator can **see the violation without running a debugger**. Surfaces: * An error envelope/frame the caller observes synchronously (the gold standard — a `CONTRACT_VIOLATION` error frame on the live channel when an inbound action violates the contract). * A validation response the caller receives synchronously. * An operator UI that reads the above. * A structured log emitted at a known event name, documented as the contract’s telemetry point. A contract whose violations only show up in production stack traces fails this bar — an operator who inherits the system in six months can’t tell what’s working and what’s silently degraded. *** ## The Protocol Bar — 6 criteria [Section titled “The Protocol Bar — 6 criteria”](#the-protocol-bar--6-criteria) A **protocol** is the emergent layer above a set of contracts. Every layer called “a protocol” MUST satisfy all six. Five of six is a protocol-in-progress; the gap MUST be flagged explicitly and the layer treated as experimental until closed. ### 1. Wire-format specification [Section titled “1. Wire-format specification”](#1-wire-format-specification) A canonical prose spec describing every envelope shape, every field, every value constraint, and — critically — every intentional omission and why. Not just TypeScript types. Types say the shape; the spec says the **semantics**. For ggui, this lives in the site-level wire references — [Envelopes](/protocol/envelopes/), [MCP Protocol](/api/mcp-protocol/), [WebSocket Protocol](/api/websocket-protocol/) — which document every envelope shape, field, and intentional omission. ### 2. Message sequencing and state [Section titled “2. Message sequencing and state”](#2-message-sequencing-and-state) Given envelope X at time T in state S, which transitions are legal? What happens on reconnect? On resume? On concurrent emission? Crash recovery? Out-of-order delivery? A protocol without sequencing is a data type with pretensions. ggui’s sequencing lives in the [three-channel topology](/protocol/overview/) + the `GguiSessionStreamBuffer` interface + the `fromSeq` replay contract. Any new channel or envelope class MUST answer the sequencing question explicitly. ### 3. Version negotiation [Section titled “3. Version negotiation”](#3-version-negotiation) Producer and consumer MUST be able to agree on what version they speak, or explicitly refuse. Stamping a version on every envelope is infrastructure; actually rejecting a mismatched peer is the protocol feature. Pre-launch “advisory stamp only” is acceptable IF the launch-cutover plan names the policy flip (producer-stamp + consumer-reject). Anti-pattern: calling version-stamping “versioning” and shipping. See [Version policy](/protocol/version-policy/) for the post-launch handshake semantics. ### 4. Conformance testability at the seam [Section titled “4. Conformance testability at the seam”](#4-conformance-testability-at-the-seam) A third-party implementer can verify their implementation without running the ggui server. This means: * A public fixture table of envelopes and expected behaviors. * An executable conformance kit (a test package a third party `pnpm add`s and runs). * Coverage of every canonical failure mode — a third-party implementation that passes the kit can be used interchangeably with the first-party one. Without this, “third party adopts the protocol” means “third party reads the spec and hopes.” ### 5. Named failure modes — closed, or extensibly-closed [Section titled “5. Named failure modes — closed, or extensibly-closed”](#5-named-failure-modes--closed-or-extensibly-closed) Every way the protocol can fail has a name, a code, and a shape. Acceptable forms: * **Closed union** — exhaustive by design. Bumping requires a major version bump. Use when the set of values is fixed by an external spec or by a pre-launch design decision the protocol owners control end-to-end. * **Extensibly-closed** — e.g. the `channel_error` code union `'CHANNEL_UNKNOWN' | 'SESSION_NOT_FOUND' | 'SUBSCRIBE_UNAUTHORIZED' | 'POLL_FAILED' | (string & {})`, or the open `code: string` on the WS `error` frame whose canonical literals include `CONTRACT_VIOLATION` and `UPGRADE_REQUIRED`. Consumers MUST handle unknown values gracefully. Adding new values does NOT bump the protocol version. Use when producer-side failure modes are expected to grow (new tool classes, new transports, new preconditions) without consumers needing a new version to render them. Unacceptable: ad-hoc `message: string` as the only failure surface. ### 6. Vendor-neutral separation from implementation [Section titled “6. Vendor-neutral separation from implementation”](#6-vendor-neutral-separation-from-implementation) The protocol package imports nothing from a specific runtime, transport, cloud vendor, or framework. `@ggui-ai/protocol` MUST remain consumable by a third-party implementation that has never seen `@ggui-ai/mcp-server`. Violation of this criterion is the loudest signal that what you have is an SDK feature pretending to be a protocol. *** ## The conformance kit [Section titled “The conformance kit”](#the-conformance-kit) The kit lives at [`packages/protocol-conformance`](https://github.com/ggui-ai/ggui/tree/main/packages/protocol-conformance) in the open repo — a fixture-based test suite a candidate implementation runs against to demonstrate conformance. ```bash pnpm add -D @ggui-ai/protocol-conformance ``` Beyond the WS runner, the package ships pure-function catalogs for the gadget obligations — schema, registration, and resolution conformance — importable from `@ggui-ai/protocol-conformance/{schema-conformance,registration-conformance,resolution-conformance}`; the raw fixture catalog is also exported at `@ggui-ai/protocol-conformance/fixtures`. Programmatic and CLI entry points: ```ts import { runConformance } from "@ggui-ai/protocol-conformance"; const result = await runConformance({ serverUrl: "http://localhost:3000", auth: { kind: "bearer", token: process.env.TOKEN! }, }); if (result.failed.length > 0) process.exit(1); ``` ```bash npx ggui-protocol-conformance --url http://localhost:3000 --auth bearer:$TOKEN ``` `serverUrl` / `--url` take the implementation’s **base** `http://` / `https://` URL — the runner appends `/ws` and derives the `ws://` / `wss://` scheme itself. **Transport.** v1.0 is **WebSocket-only** — the canonical ggui transport (see the [WebSocket Protocol reference](/api/websocket-protocol/)). The kit’s `TransportConfig` is an extensibly-closed union, so later bindings (stdio MCP, HTTP long-poll) can be added post-v1.1 without breaking the public API. **Path-A vs Path-B fixtures.** The fixture catalog spans both wire-observable claims and surface-observable claims. The runner handles **Path-A** — behaviors a runner can assert from WS frames alone (no MCP-Apps-host adapter, no Playwright). **Path-B** fixtures (e.g., `bootstrap-failure`, `props-update`) require a browser-host harness driving Playwright + `page.route()` fault injection + DOM assertion; they are recorded as SKIP on the WS runner, not FAIL. The partition is intentional: Path-A FAILs are vendor-neutrality bugs the server owns; Path-B SKIPs are claims a different driver is responsible for. The kit covers: * **Envelope round-trip** — every fixture in the table can be emitted, observed, validated, and round-tripped. * **Reserved-channel authority** — the implementation rejects agent-authored `streamSpec` entries in the `_ggui:` namespace and validates the platform-owned shapes (`_ggui:preview`, `_ggui:lifecycle`). * **Schema enforcement** — `actionSpec[action].schema` ⊆ the hinted tool’s `inputSchema`, with a `SCHEMA_MISMATCH_ERROR` push-time rejection on violation. * **Sequencing** — `seq` is gap-free per-session, `fromSeq` replay returns the right tail, `replayTruncated` is honored when the requested cursor is unrecoverable. * **Version handshake** — schema-version stamping on every envelope; `UPGRADE_REQUIRED` emission on mismatch (post-launch — pre-launch is advisory). * **Action persistence** — an action frame round-trips to an ack carrying `payload.sequence`, proving the event landed on the GguiSession’s consume buffer (append-then-ack). * **Contract enforcement at receipt** — an action for an undeclared name is rejected with a `CONTRACT_VIOLATION` error frame and nothing reaches the consume buffer. A change is **breaking** iff at least one fixture that passed against version N now fails against version N+1 when the only change is the protocol version. See [Version policy → §2](/protocol/version-policy/#2-breaking-change-definition--the-kit-is-the-arbiter) for the policy this anchors. *** ## Pre-launch vs launch posture [Section titled “Pre-launch vs launch posture”](#pre-launch-vs-launch-posture) Pre-launch (`draft-` versions), some protocol criteria are intentionally at ⚠️ — flagged gaps with owners and closing slices. The point is not zero gaps forever; it is **zero silent gaps**. | Criterion | Pre-launch posture | Launch (v1.0) | | --------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------- | | Version negotiation | Envelope `schemaVersion`: advisory stamp; consumers MUST NOT reject on mismatch. Subscribe handshake (`supportedVersions` / `serverVersion`): server default `versionPolicy: 'reject'` — emits `UPGRADE_REQUIRED` and closes; `'advisory'` is a legacy opt-out for migration windows. | Same mechanisms; envelope stamps stay advisory, the subscribe handshake stays the rejection point. Mismatched majors surface as `UPGRADE_REQUIRED`. | | Conformance kit | Optional today; available for first-party servers + early third-party implementers. | Required before third-party implementers are invited to build on the protocol. | | Schema-compat checker | Render-time + console blueprint-try (`SCHEMA_MISMATCH_ERROR` rejection on the `ggui_render` tool result); operator policy `'reject' \| 'warn' \| 'off'` (default `'reject'`). | Same; default unchanged. | At launch, all load-bearing boundaries MUST be at 4/4 contract + 6/6 protocol for any layer described as a public interop surface. Load-bearing = any boundary a third-party implementer is expected to adopt. Internal-only boundaries may sit at lower scores if the gap is explicitly scoped to internal use. *** ## Anti-patterns [Section titled “Anti-patterns”](#anti-patterns) Things that feel contract-shaped but fail the bar: | Looks like | Is actually | Why | | ------------------------------------- | ---------------------------- | ---------------------------------------------------------- | | ”TypeScript type is the contract” | Shape-only | Types say what, not who/when/what-on-violation | | ”Docstring says MUST” | Author discipline | Prose without mechanism decays | | ”A validator function exists” | Half-contract | Validator without observable failure path wastes the check | | ”Error gets logged” | Not observable | Logs aren’t observability unless somebody reads them | | ”Pre-launch advisory” | Infrastructure, not contract | Valid interim stance; MUST flag the gap + name the flip | | ”We cover the common cases” | Denylist discipline | Rock-paper-scissors with future adversaries | | ”Three similar implementations agree” | Folklore | Interop-by-intuition is not interop | If a new “contract” maps to a row here, either strengthen it to meet the bar or pick a different word. *** ## Vocabulary [Section titled “Vocabulary”](#vocabulary) Used precisely throughout this site: * **Shape** — a TypeScript type with no enforcement story. * **Convention** — author discipline; no mechanism. * **Contract** — passes the 4-criterion bar. * **Protocol** — passes the 6-criterion bar; a set of contracts with sequencing + versioning + conformance-kit. If a doc says “this contract is enforced by convention” — that’s a category error and either the contract framing or the convention framing is wrong. *** ## See also [Section titled “See also”](#see-also) * [Protocol overview](/protocol/overview/) — three-channel topology and reference implementation. * [Version policy](/protocol/version-policy/) — semver semantics, breaking-change definition, deprecation timeline. * [Envelopes](/protocol/envelopes/) — wire shapes the conformance kit asserts on. * [Conformance kit on GitHub](https://github.com/ggui-ai/ggui/tree/main/packages/protocol-conformance) — the test package itself.

# Live-channel envelopes

> Wire shapes for ActionEnvelope (inbound), StreamEnvelope (outbound), and the reserved _ggui namespace.

The live WebSocket carries two envelope shapes: | Envelope | Direction | Carries | | ---------------- | --------------- | ----------------------------------------------- | | `ActionEnvelope` | client → server | A canonical user action (form submit, click, …) | | `StreamEnvelope` | server → client | An outbound delivery on a named stream channel | Canonical wire reference. Shapes below mirror the TypeScript in [`@ggui-ai/protocol`](https://github.com/ggui-ai/ggui/tree/main/packages/protocol/src) verbatim: `types/events.ts` (ActionEnvelope), `types/live-channel.ts` (StreamEnvelope). *** ## ActionEnvelope [Section titled “ActionEnvelope”](#actionenvelope) Inbound live-channel envelope — the body of a `type: 'action'` WebSocket message. Flat, narrow, limited to fields the server actually enforces or diagnostic fields real consumers populate today. ```ts interface ActionEnvelope { sessionId: string; // required — render identity type: EventType; // required — see EventType below payload?: TPayload; // shape depends on `type` clientSeq?: number; // client-monotonic dedup hint schemaVersion?: string; // producer's PROTOCOL_SCHEMA_VERSION (advisory pre-launch) } ``` | Field | Type | Required | Semantics | | --------------- | ----------- | -------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `sessionId` | `string` | Yes | Server enforces subscriber-render binding — envelopes whose `sessionId` doesn’t match the render the WebSocket subscriber is bound to are rejected with `SESSION_MISMATCH`. | | `type` | `EventType` | Yes | Always `data:submit` — the only member of the `EventType` union (see below). The active render’s `actionSpec` gates which action names are accepted. (The pre-actionSpec `EventSubscription` / `DEFAULT_SUBSCRIPTION` gating shapes were deleted from `@ggui-ai/protocol` — there is no subscription gating on the wire.) | | `payload` | `TPayload?` | No | For `type: 'data:submit'` this carries an `ActionEventValue` shape (`{action, data, tool?}`) whose `data` field is validated against the render’s `actionSpec[action].schema`. | | `clientSeq` | `number?` | No | Client-monotonic sequence number for at-least-once dedup. Declared shape; no server enforcement today (no inbound dedup infrastructure yet). Clients SHOULD populate when their transport can replay (e.g., reconnect-with-backfill). | | `schemaVersion` | `string?` | No | Protocol schema version stamped by the producer. Pre-launch: advisory — consumers MUST NOT reject on mismatch. Post-launch: receiving major mismatches surface as `UPGRADE_REQUIRED`. | ### Fields intentionally NOT on the envelope [Section titled “Fields intentionally NOT on the envelope”](#fields-intentionally-not-on-the-envelope) These fields appear on neighboring shapes but are excluded from `ActionEnvelope` by design: | Field | Why omitted | | --------------------------------- | ------------------------------------------------------------------------------------------------------- | | `appId` | Server resolves it from the render; client-claimed values are ignored for enforcement. | | `userId` / `user` | Diagnostic render metadata captured at subscribe time, not per-delivery. | | `deviceInfo` / `interfaceContext` | Same as above — render-level, not action-level. | | `componentId` / `contractHash` | Diagnostic; no enforcement consumer today. | | `timestamp` | Server uses its own clock for ordering + log emission; client-supplied timestamps aren’t authoritative. | | `correlationId` | The doctrine names this for agent-push ↔ user-action pairing; `sessionId` covers the narrow case today. | ### `EventType` [Section titled “EventType”](#eventtype) `EventType` is a single-member union — `'data:submit'` is the only action type the protocol recognizes: | Type | Notes | | ------------- | ----------------------------------------------------------------------- | | `data:submit` | Carries `ActionEventValue` (`{action, data, tool?}`); schema-validated. | The old `data:change` / `lifecycle:focus` / `lifecycle:blur` / `interaction:click` / `interaction:hover` / `interaction:scroll` / `error:validation` / `error:connection` members were removed in `draft-2026-06-12` — they had no producers. ### `ActionEventValue` [Section titled “ActionEventValue”](#actioneventvalue) Payload shape for `data:submit`: ```ts interface ActionEventValue { action: string; // action ID from the contract (e.g. "submit", "archive") data: TData; // action payload (e.g. form data) tool?: string; // MCP tool name mirrored from actionSpec[action].nextStep (when declared) } ``` **Derivation.** `tool` is derived server-side from `actionSpec[action].nextStep` at event-build time — clients do not populate it. When the action has no declared `nextStep`, `tool` is absent — the agent decides the next tool freely from broader context. The hint is advisory either way; the agent owns the call decision. *** ## StreamEnvelope [Section titled “StreamEnvelope”](#streamenvelope) Outbound live-channel envelope — the body of a `type: 'data'` WebSocket message. Carries a single delivery on a named stream channel. ```ts interface StreamEnvelope { sessionId: string; // required channel: string; // required — keys into streamSpec mode: StreamChannelMode; // required — 'append' | 'replace' payload: JsonValue; // required — channel-specific shape complete?: boolean; // terminal marker for completable channels seq?: number; // server-assigned monotonic outbound sequence schemaVersion?: string; // producer's PROTOCOL_SCHEMA_VERSION } ``` | Field | Type | Required | Semantics | | --------------- | ------------------- | -------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `sessionId` | `string` | Yes | Render this delivery belongs to. | | `channel` | `string` | Yes | Channel name. Keys into the agent’s declared `streamSpec`. Names starting with `_ggui:` are server-owned (see [Reserved channels](#reserved-channels)). | | `mode` | `StreamChannelMode` | Yes | State-folding mode. `'append'` (default for narrative streams — message log, telemetry) accumulates deliveries; `'replace'` (default for snapshot streams — task progress, session state) replaces the current value. Senders declare; receivers honor. Typically equals the channel’s declared `mode` on the spec, but the envelope is the authoritative per-delivery signal. | | `payload` | `JsonValue` | Yes | Validated against `streamSpec[channel].schema`. Shape is channel-specific; consumers typecheck via contract inference when they use `defineContract` + `useStream`. | | `complete` | `boolean?` | No | Terminal completion marker — truthy on the last delivery for a completable channel (one declared with `complete: true` on the spec). Consumers use this to transition subscribers into a “channel closed” state. Absent on non-terminal deliveries. | | `seq` | `number?` | No | Render-scoped monotonic outbound sequence. Server-assigned; clients MUST NOT populate it on producer-side inputs. Gap-free within a single render, starting at 1. Used by the client to track `lastSeenSeq` for reconnect (pass it back as `SubscribePayload.fromSeq`) and to dedupe deliveries (at-least-once semantics). OPTIONAL today because hosted cloud doesn’t yet stamp `seq`; OSS `@ggui-ai/mcp-server` always populates it. A future slice promotes this to required. | | `schemaVersion` | `string?` | No | Same advisory-pre-launch / `UPGRADE_REQUIRED`-post-launch semantics as on `ActionEnvelope`. | ### Fields intentionally NOT on the envelope [Section titled “Fields intentionally NOT on the envelope”](#fields-intentionally-not-on-the-envelope-1) | Field | Why omitted | | ----------- | ------------------------------------------------------------------------------------------------------------------------------------------------- | | `replay` | Per-channel policy declared on `streamSpec[channel].replay`. A per-delivery field would imply replay can vary message-to-message, which it can’t. | | `timestamp` | Replay correctness needs `seq` only; timestamp is a future optional addition driven by a concrete client-UX need. | *** ## Reserved channels [Section titled “Reserved channels”](#reserved-channels) Channel names starting with `_ggui:` are **reserved** for the server. Agent-authored `streamSpec` MUST NOT declare any name in this namespace; structural validation rejects every reserved-prefix entry with the message `Stream channel '' is in the reserved '_ggui:' namespace — server-owned channels cannot be declared in agent streamSpec`. Two channel names are recognized today: | Channel | Owner | Payload shape | Purpose | | ----------------- | ------------- | -------------------------------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | `_ggui:preview` | Server (A2UI) | A2UI `ServerMessage` — see `@ggui-ai/preview-a2ui` | Provisional A2UI assembly stream emitted by the server during fresh-gen `ggui_render` flows. The renderer subscribes implicitly and dispatches the payload through its preview surface. | | `_ggui:lifecycle` | Server | `GguiLifecyclePayload` | Generation-progress lifecycle kinds (`handshake_started`, `handshake_completed`, `render_started`, `consume_polling`) the renderer surfaces as a progress affordance. | A typo inside the reserved namespace (e.g. `_ggui:preveiw`) is NOT recognized — delivery falls through to the normal “unknown channel” rejection rather than silently passing as a reserved channel. The `_ggui:contract-error` channel was a reserved channel in earlier drafts but was removed entirely in `draft-2026-06-11` — the channel, its payload shape, and its validator are gone. Contract violations now surface as a typed `CONTRACT_VIOLATION` error frame on the live channel (code `-32020`) on the call that caused them; nothing lands on the consume buffer. *** ## Schema versioning [Section titled “Schema versioning”](#schema-versioning) The `schemaVersion` field is present on both envelopes. It carries the producer’s `PROTOCOL_SCHEMA_VERSION` constant. **Pre-launch (`draft-` versions):** advisory. Consumers MUST NOT reject on mismatch; missing `schemaVersion` fields are normal. **Post-launch:** receiving an envelope whose major version diverges from the consumer’s known major surfaces as a live-channel error with `code: UPGRADE_REQUIRED`. The handshake resolution flow is documented in the [WebSocket Protocol](/api/websocket-protocol/) reference. See [Version policy](/protocol/version-policy/) for the full mapping of what counts as a major / minor / patch bump. *** ## See also [Section titled “See also”](#see-also) * [WebSocket Protocol](/api/websocket-protocol/) — live-channel message types (`subscribe`, `action`, `ack`, `render`, `data`, `error`) that carry these envelope payloads. * [MCP Protocol](/api/mcp-protocol/) — MCP JSON-RPC methods (`ggui_handshake`, `ggui_render`, `ggui_consume`, etc.). * [Protocol overview](/protocol/overview/) — the three-channel topology. * [Version policy](/protocol/version-policy/) — semver semantics, breaking-change definition, deprecation timeline. * [`@ggui-ai/protocol` source](https://github.com/ggui-ai/ggui/tree/main/packages/protocol/src) — canonical TypeScript definitions.

# Protocol overview

> ggui is an open protocol for AI agents to render interactive UIs over three wire channels. MCP-native, framework-agnostic, server-enforced.

`ggui` is an **open protocol** that lets AI agents render interactive UIs to humans. An agent describes what it needs in natural language; ggui generates React components, delivers them over a typed live channel, and routes user actions back through a server-side enforcement point. This site documents the protocol itself, version `draft-2026-06-12` (see `PROTOCOL_SCHEMA_VERSION` in [`@ggui-ai/protocol`](https://github.com/ggui-ai/ggui/tree/main/packages/protocol/src/version.ts)). Everything below is implementable from the spec — the open-source [`ggui serve`](/cli/serve/) is the reference conformant implementation (a hosted endpoint at `mcp.ggui.ai` is coming soon). The `@ggui-ai/*` packages are the reference SDKs. ## Three parties, three channels [Section titled “Three parties, three channels”](#three-parties-three-channels) ```plaintext ┌───────────────────────┐ │ user client │ │ (React / RN / web) │ └─────┬────────────┬────┘ │ │ chat live (WebSocket) (HTTP SSE) ↑↓ │ │ ┌────────▼────┐ ┌───▼────────────┐ │ wrapped- │ │ │ │ agent │ │ core-mcp │ │ (developer │───▶ (session │ │ code) │ MCP authority) │ └─────────────┘ └────────────────┘ MCP (over HTTP) ``` | Channel | Direction | Transport | Purpose | Status | | ------- | ---------------------- | ------------- | ------------------------------------------------- | --------- | | Chat | user ↔ wrapped-agent | HTTP SSE | Conversational surface (chat-completion style) | Optional | | MCP | agent ↔ core-mcp | MCP over HTTP | Tool control + session mutation | Mandatory | | Live | core-mcp ↔ user client | WebSocket | Live UI session delivery + canonical user actions | Mandatory | The MCP and live channels are the **protocol kernel** — every conformant implementation MUST provide them. The chat channel is product-mandatory (the OSS `ggui serve` ships a chat surface) but protocol-optional: an embed that uses ggui purely for typed live UIs, with no conversational chat, is still conformant. The WebSocket endpoint is `ws://127.0.0.1:6781/ws` (default) when running `ggui serve` locally; self-hosters expose it as `wss:///ws`. (A hosted endpoint at `wss://mcp.ggui.ai/ws` is coming soon.) ## Why the live channel is mandatory [Section titled “Why the live channel is mandatory”](#why-the-live-channel-is-mandatory) The typed live-channel contract needs a server-side enforcement point. That point is the live channel itself. A `streamSpec` declares named streams, payload schemas (validated as `StreamEnvelope`s), replay policy, ordering, and completion semantics. Subscriptions declare which events the client UI consumes. None of that means anything unless a non-agent authority validates it on the wire. If the agent shipped bytes directly to the user, the “contract” would be documentation, not contract — see [Conformance](/protocol/conformance/) for the 4-criterion contract bar this enforces. So **no live channel, no enforcement point, no contract.** The live channel and the typed live-channel contract are a pair — reconsider one, you reconsider the other. ## Wire surfaces [Section titled “Wire surfaces”](#wire-surfaces) Each channel maps to a documented wire surface on this site: | Surface | Channel | What it covers | | ---------------------------------------------- | ---------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | [MCP Protocol](/api/mcp-protocol/) | MCP | JSON-RPC methods (canonical lifecycle order): `ggui_handshake`, `ggui_render`, `ggui_consume`, `ggui_update`, `ggui_emit`, `ggui_get_session`, `ggui_list_sessions`, plus gadget/theme/blueprint discovery (`ggui_list_gadgets`, `ggui_list_themes`, `ggui_list_featured_blueprints`, `ggui_search_blueprints`, `ggui_render_blueprint`) | | [WebSocket Protocol](/api/websocket-protocol/) | Live | Subscribe / `ActionEnvelope` / ack / render / `StreamEnvelope`; replay; reconnection | | [MCP Apps support](/api/mcp-apps/) | MCP + Live | Resource shape (`{contents:[{uri,mimeType,text}]}`); host-side iframe-runtime boot | | [OAuth (`ggui serve --oauth`)](/api/oauth/) | MCP | OAuth 2.1 + PKCE + Dynamic Client Registration for hosts with no pre-shared bearer | ## Reference implementation [Section titled “Reference implementation”](#reference-implementation) The open implementation lives at [`github.com/ggui-ai/ggui`](https://github.com/ggui-ai/ggui): * `packages/mcp-server` — the MCP + live-channel server. * `packages/ggui-react` — the host-side SDK. * `packages/ggui-cli` — ships [`ggui serve`](/cli/serve/), which runs the whole protocol locally on one command. The self-hosted open-source path is the available-now story — `ggui serve` runs the whole protocol on one command. A hosted endpoint at `mcp.ggui.ai` (same protocol, managed infrastructure, no self-host required) is coming soon. ## Core vocabulary [Section titled “Core vocabulary”](#core-vocabulary) Three nouns recur on every page. Get them right and the rest of the spec falls into place — see the [glossary](/glossary/) for the full list. * **Gadget** — a renderer-side capability (Mapbox, Leaflet, a charting library). Declared per-app in `ggui.json#app.gadgets` (the `@ggui-ai/gadgets` stdlib is always the floor); loaded at iframe boot. The agent never touches gadget code. * **Tool** — an agent-side action exposed over MCP (`ggui_render`, `ggui_update`, etc.). Tools mutate session state. * **Blueprint** — a cached recipe (component code + contract) keyed by intent. Matched on the fast path; generated on miss. Gadgets and tools are the protocol’s two **symmetric capability surfaces** — gadgets give the renderer something to render with; tools give the agent something to act with. Both are operator-bounded and declared per-app (`clientCapabilities.gadgets`, `agentCapabilities.tools`). See [glossary → Capabilities](/glossary/#capabilities). ## What the protocol is not [Section titled “What the protocol is not”](#what-the-protocol-is-not) * **Not a renderer.** The protocol carries component code and props; how the host evaluates that code is its concern. The reference SDK uses dynamic ESM import plus the `@ggui-ai/iframe-runtime` bundle (the self-contained ESM bundle that boots inside the MCP-Apps iframe), but a conformant host can swap in a sandboxed iframe, a server-rendered HTML pipeline, or anything else. * **Not a UI library.** Components are generated per render by the agent; ggui ships no fixed component set. Primitives (Button, Input) are catalog-declared, not protocol-declared. * **Not framework-coupled.** The wire is JSON over HTTP and WebSocket. React is the most-developed SDK today, but `@ggui-ai/protocol` types and `@modelcontextprotocol/sdk` (Anthropic’s official MCP TypeScript SDK) are framework-neutral. ## Versioning + conformance [Section titled “Versioning + conformance”](#versioning--conformance) The protocol follows semver. The arbiter for “what counts as a breaking change” is the **conformance kit** at [`packages/protocol-conformance`](https://github.com/ggui-ai/ggui/tree/main/packages/protocol-conformance) — a fixture-based test suite that any candidate implementation runs against. A change is **major** iff at least one conformance fixture that passed against version N now fails against version N+1, with the protocol version as the only delta. See the [Version policy](/protocol/version-policy/) for the full semver mapping, deprecation timeline, and version support matrix. ## See also [Section titled “See also”](#see-also) * [MCP Protocol reference](/api/mcp-protocol/) — MCP wire grammar. * [WebSocket Protocol reference](/api/websocket-protocol/) — live-channel wire grammar. * [Envelopes](/protocol/envelopes/) — the live-channel wire shapes: `ActionEnvelope` (inbound) and `StreamEnvelope` (outbound), plus the reserved `_ggui:` channel namespace. * [Bootstrap handshake](/protocol/bootstrap-handshake/) — host-side postMessage + JSON-RPC contract for mounting a session in an iframe. * [Conformance](/protocol/conformance/) — the kit, the bar, and how to claim conformance. * [`ggui serve`](/cli/serve/) — run the protocol locally.

# Version policy

> Major / minor / patch semantics for the ggui protocol, deprecation timeline, support matrix, and CI enforcement.

> **Semver promise:** a change to `@ggui-ai/protocol` is breaking if-and-only-if a consumer built against the prior version now fails the conformance kit. This page defines “major”, “minor”, and “patch” on the ggui protocol, how deprecation windows work, which server ↔ client pairs are supported, and how a breaking change is migrated. The canonical version constant is `PROTOCOL_SCHEMA_VERSION` (an alias of `PROTOCOL_VERSION` in [`@ggui-ai/protocol`](https://github.com/ggui-ai/ggui/blob/main/packages/protocol/src/version.ts)). Current value: `draft-2026-06-12`. Wire-level negotiation runs through the schema-version handshake; mismatches surface as the `UPGRADE_REQUIRED` error. The current server default is `versionPolicy: 'reject'` — on mismatch the server emits `UPGRADE_REQUIRED` and closes the connection; `'advisory'` is a legacy opt-out that keeps it open. *** ## 1. Semver semantics applied to the protocol [Section titled “1. Semver semantics applied to the protocol”](#1-semver-semantics-applied-to-the-protocol) The protocol follows semver. What counts as each bump kind is defined by the **conformance kit** ([`packages/protocol-conformance`](https://github.com/ggui-ai/ggui/tree/main/packages/protocol-conformance)), not by surface-syntax changes in the `@ggui-ai/protocol` package. ### Major (N → N+1) [Section titled “Major (N → N+1)”](#major-n--n1) A bump is **major** iff at least one conformance-kit fixture that passed against version N now fails against version N+1 when the only change is the protocol version. Examples of changes that are major: * Renaming a **reserved channel** (e.g., `_ggui:preview` → `_ggui:assembly`). * Removing a canonical live-channel error code that the kit asserts (e.g., deleting `CONTRACT_VIOLATION`). * **Narrowing** an extensibly-closed union (e.g., collapsing `SubmitActionKind` variants so previously-recognized kinds now fail validation). * Changing the shape of an **envelope** (`ActionEnvelope`, `StreamEnvelope`) in a way that invalidates fixtures emitted by prior-version producers — required-field additions, required-field renames, required-field type changes. * Tightening a `MUST` / `MUST NOT` clause in the spec such that prior-conformant implementations become non-conformant. * Changing handshake semantics in a way that rejects a subscribe that would previously have succeeded (e.g., the `versionPolicy` default flip `advisory` → `reject` — already shipped on 2026-04-24 as a pre-launch config-default change with its own release note; post-v1.0, a flip of this shape is major). * Removing or replacing a **transport binding** previously declared conformant. ### Minor (N.x → N.(x+1)) [Section titled “Minor (N.x → N.(x+1))”](#minor-nx--nx1) A bump is **minor** iff every conformance-kit fixture that passed against N.x still passes against N.(x+1), and the delta is additive. Examples of changes that are minor: * Adding a new **`channel_error` code** literal (consumers that don’t recognize it degrade per the extensibly-closed union rule). * Adding a new **observability event kind** to the renderer’s `ggui:observe` postMessage union (extensibly-closed — see the [Bootstrap handshake](/protocol/bootstrap-handshake/) page). * Adding an **optional field** on an envelope. Pre-existing consumers see `undefined` and MUST behave as before. * Adding a new **reserved channel** whose absence is not a failure mode for prior-version consumers. * Adding a new **canonical live-channel error code** (the `code` field is typed open). * Adding a new **transport binding** (e.g., stdio MCP, HTTP long-poll) while keeping WebSocket canonical. ### Patch (N.x.y → N.x.(y+1)) [Section titled “Patch (N.x.y → N.x.(y+1))”](#patch-nxy--nxy1) A bump is **patch** iff the only changes are documentation clarifications, fixture additions that do not change assertions on prior-version consumers, or internal implementation adjustments to first-party packages that do not affect the wire or the kit. Examples: * Spec prose clarifications that close a reader-ambiguity without changing obligations. * New conformance-kit fixtures that assert behavior already required by the prior version’s spec text. * Bug fixes in first-party servers / clients that bring them back into conformance (the bar was always there — the code now honors it). ### The `draft-` prefix [Section titled “The draft- prefix”](#the-draft--prefix) Pre-v1 versions use a `draft-` prefix (e.g., `draft-2026-06-12`). While `draft-`, the semver rules above describe **intent**, not **obligation** — the protocol reserves the right to ship breaking changes without a migration doc until the first stable release tags `v1.0`. **Rule flip at v1.0:** once `PROTOCOL_SCHEMA_VERSION` drops the `draft-` prefix, every major bump MUST have a migration doc (CI-enforced) and every deprecation MUST honor the window described in §3. *** ## 2. Breaking change definition — the kit is the arbiter [Section titled “2. Breaking change definition — the kit is the arbiter”](#2-breaking-change-definition--the-kit-is-the-arbiter) > **A change is breaking if-and-only-if a consumer built against the prior version now fails the conformance kit.** Anchor every future “is this breaking?” debate to a kit run. Opinion, intent, and prose-level spec reading are NOT the arbiter — the kit is. Operational consequences: 1. If you’re unsure whether a PR is breaking, **run the kit** against the prior-version tag and against the PR branch. If fixtures that passed before now fail, the PR is major. 2. If the kit passes but you “feel” the change is risky, **add a fixture** that captures the worry. Either the fixture passes (the change is not breaking) or it fails (the change is breaking and you just found out) — both outcomes are productive. 3. If you think a change is major but the kit disagrees, the kit has a gap. Patch the kit in the same PR; the patched kit decides. This anchor means every change’s breaking-ness is **observable and reproducible**, not a matter of taste. *** ## 3. Deprecation timeline [Section titled “3. Deprecation timeline”](#3-deprecation-timeline) Deprecation is the only graceful path from a minor-version ship to a major-version removal. ### The window [Section titled “The window”](#the-window) * **v(N)** — the version in which a surface is first tagged `@deprecated` (in TSDoc) AND documented in the release notes as deprecated. The surface MUST continue to work identically to the prior version; `@deprecated` is a signal to consumers, not a behavior change. * **v(N+1)** — deprecated surface still works. Callers SHOULD migrate in this window. Release notes repeat the deprecation warning. * **v(N+2)** — removal is allowed here at the earliest. Removal is a major bump, so this version is v(N+2) = major. The two-minor window guarantees consumers saw at least one full minor cycle with the `@deprecated` warning before removal. ### Minimum [Section titled “Minimum”](#minimum) **At least two minor versions MUST elapse between `@deprecated` and removal.** If N is the first deprecated version and N+2 is the major that removes it, the intermediate v(N+1) minor MUST ship. Shorter windows (v(N) deprecated → v(N+1) removed) are NOT allowed for non-security changes. ### Security-fix escape hatch [Section titled “Security-fix escape hatch”](#security-fix-escape-hatch) A **critical security fix** MAY skip the window — shipping a breaking change without a prior `@deprecated` minor — if and only if: 1. The release note explicitly names the CVE or security class. 2. The release note explains why the window was skipped. 3. A migration doc per §5 ships alongside the release. Non-security urgency (e.g., “this mistake is embarrassing”) is NOT grounds for skipping the window. ### What “deprecated” means on the wire [Section titled “What “deprecated” means on the wire”](#what-deprecated-means-on-the-wire) The protocol does not have a wire-level “deprecated field” marker. Deprecation lives in TSDoc on the `@ggui-ai/protocol` types and in the release notes. Consumers parsing the wire cannot detect a field is `@deprecated` from the frame alone — they learn it from the package changelog. ### Policy-default flips count as breaking [Section titled “Policy-default flips count as breaking”](#policy-default-flips-count-as-breaking) Changing a **default** that the wire handshake depends on is breaking under §2 because consumers that relied on the prior default observably fail against the new default. The canonical example already happened pre-launch: the `versionPolicy` default flip `advisory` → `reject` shipped on 2026-04-24 as a config-default change with its own release note. Post-v1.0, a flip of that shape MUST use the same window: ship the new value as an opt-in in v(N), document the flip in the release notes, then flip the default in v(N+2). *** ## 4. Version support matrix [Section titled “4. Version support matrix”](#4-version-support-matrix) Which server versions support which client versions. Populated at launch (v1.0); the row below is illustrative — the protocol is pre-v1 (`draft-2026-06-12`) today: | Server `PROTOCOL_SCHEMA_VERSION` | Min client | Max client | Status | EOL date | | -------------------------------- | ---------- | ---------- | ------------------------ | -------- | | *`1.0.x`* | *`1.0.0`* | *`1.0.x`* | *current (illustrative)* | — | ### Column meanings [Section titled “Column meanings”](#column-meanings) * **Server version** — the `PROTOCOL_SCHEMA_VERSION` the server emits in the subscribe-ack payload’s `serverVersion`. * **Min client / Max client** — the lowest / highest entries in the client’s `CLIENT_SUPPORTED_VERSIONS` set guaranteed to subscribe successfully. * **Status** — one of: * `current` — actively maintained; all fixes land here. * `security-only` — no new features; only security patches. * `EOL` — no longer maintained. Servers on EOL versions MAY refuse to start. * **EOL date** — the date `current` transitions to `security-only`, or `security-only` transitions to EOL. ### Matrix update cadence [Section titled “Matrix update cadence”](#matrix-update-cadence) * New **minor** ship → append a row; prior row may update Max client. * New **major** ship → append a new row; prior-major row transitions to `security-only` for at least 6 months before EOL. * Security-only → EOL → at least 6 additional months. ### Out-of-matrix clients [Section titled “Out-of-matrix clients”](#out-of-matrix-clients) If a client targets a version outside the matrix, the server emits `UPGRADE_REQUIRED`. Whether the connection stays open depends on the server’s `versionPolicy` at the time. *** ## 5. Migration-guide requirement [Section titled “5. Migration-guide requirement”](#5-migration-guide-requirement) Every major bump MUST ship a migration doc at `docs/protocol/migrations/v-to-v.md`. (Current pre-v1 drafts use date-named migration docs — e.g. `2026-06-05-gguisession-reintroduction.md`; the `v-to-v.md` pattern applies from v1.0.) The CI check (§7) enforces the file’s existence; content follows a standard template with required sections: * **What changed** — 1–2 paragraph summary. * **Why** — motivation (conformance-kit failure mode that drove the change). * **Breaking-change summary** — bullet list of renamed / removed / tightened surfaces. * **Migration steps per consumer kind** — fixture authors, ConformanceHost implementers, agent builders, SDK consumers. * **Timeline** — dates for v(N) deprecation ship, v(N+1) soft cut, v(N+2) removal. * **Rollback procedure** — how to pin to prior version if a consumer can’t migrate fast enough. *** ## 6. Release cadence expectations [Section titled “6. Release cadence expectations”](#6-release-cadence-expectations) These are targets, not promises: * **Minor (N.x → N.(x+1)):** roughly monthly. Additive adjustments driven by conformance-kit fixture gaps, new observability events, or new canonical error codes tend to accumulate on this cadence. Empty minor cycles are skipped rather than force-shipped. * **Major (N → N+1):** annual or slower. Rare and pre-announced. A major ship is a disruption event for every downstream — the protocol leans hard on minor + deprecation windows to defer majors. * **Patch (N.x.y → N.x.(y+1)):** ad hoc. Doc corrections, fixture additions that tighten existing assertions, and first-party impl bug fixes ship when ready. Announcements ship in the version history maintained in [`@ggui-ai/protocol`’s `version.ts`](https://github.com/ggui-ai/ggui/blob/main/packages/protocol/src/version.ts) and in the public protocol release notes. *** ## 7. CI enforcement [Section titled “7. CI enforcement”](#7-ci-enforcement) Two workflows guard this policy: * **Spec drift** — ensures spec envelope code blocks match the `@ggui-ai/protocol` types. Catches structural drift that would otherwise ship as silent breakage. * **Version migration** — reads `PROTOCOL_SCHEMA_VERSION` on the PR head and on the base ref; if the **major** component changed, asserts the migration doc exists in the PR diff. The second check is the policy’s teeth: you cannot ship a major bump without the migration doc the policy requires. *** ## Appendix: Worked examples [Section titled “Appendix: Worked examples”](#appendix-worked-examples) ### A.1 Adding a new live-channel error code [Section titled “A.1 Adding a new live-channel error code”](#a1-adding-a-new-live-channel-error-code) **Scenario:** a new failure mode — `TOOL_DENIED` — needs to land alongside the existing canonical literals on the WS `error` / `channel_error` frame. * **Bump kind:** minor. Prior-version consumers that don’t know `TOOL_DENIED` see an extensibly-closed (`code: string`) value and MUST degrade gracefully. No kit fixture asserting the prior closed set should fail. * **Required work:** add the constant to `@ggui-ai/protocol`, document it as a canonical live-channel error-code literal, add a conformance-kit fixture asserting the new code round-trips. * **Migration doc:** none needed (not a major bump). ### A.2 Removing a canonical error-code literal [Section titled “A.2 Removing a canonical error-code literal”](#a2-removing-a-canonical-error-code-literal) **Scenario:** `SCHEMA_MISMATCH_ERROR` is merged into `CONTRACT_VIOLATION` with a structured `causedBy`. * **Bump kind:** major. A consumer that pattern-matches on `SCHEMA_MISMATCH_ERROR` now never sees it, fails its fixture. * **Required work:** first ship `@deprecated SCHEMA_MISMATCH_ERROR` in v(N), continue emitting both `SCHEMA_MISMATCH_ERROR` and the new shape during v(N+1), then remove `SCHEMA_MISMATCH_ERROR` emission in v(N+2). * **Migration doc:** `vN-to-v(N+1).md` (enforced by CI on the major-bump PR). Covers the `causedBy` shape, fixture changes, and consumer-side pattern-match migration. ### A.3 Renaming a reserved channel [Section titled “A.3 Renaming a reserved channel”](#a3-renaming-a-reserved-channel) **Scenario:** `_ggui:preview` → `_ggui:assembly` for parity with transport-layer conventions. * **Bump kind:** major. Every consumer that subscribes to the old name now fails. Every fixture that references the old name needs updating. * **Required work:** can NOT ship via deprecation (reserved-channel names are matched verbatim). Must bundle with other breaking changes into the next major. Migration doc is non-trivial — every producer AND consumer changes. * **Operational note:** changes of this shape are exactly why the protocol prefers extensibly-closed unions + additive fields + separate channels over in-place renames. ### A.4 Clarifying a spec MUST [Section titled “A.4 Clarifying a spec MUST”](#a4-clarifying-a-spec-must) **Scenario:** the spec’s `refresh` semantics paragraph has two readings; the kit’s fixture matched only one. * **Bump kind:** patch IF the kit’s existing assertion already covered the intended reading (prose clarifies code); minor IF the clarification surfaces a new required behavior that prior-version impls weren’t necessarily honoring (new assertion); major IF the clarification tightens behavior such that previously-conformant impls now fail. * **Test:** run the kit against the prior-version tag. If it passes, the clarification is patch/minor. If it fails, it was major all along — the original spec text was ambiguous and shipping the clarification is a breaking change. *** ## See also [Section titled “See also”](#see-also) * [Protocol overview](/protocol/overview/) — three-channel topology and reference implementation. * [Conformance kit](https://github.com/ggui-ai/ggui/tree/main/packages/protocol-conformance) — the test suite this policy treats as the arbiter. * [Conformance](/protocol/conformance/) — the 4-criteria contract bar + 6-criteria protocol bar this policy gates.

# Gadgets SDK

> Wrap any 3rd-party browser library (Leaflet, Mapbox, Stripe, …) into an LLM-callable React hook with `createGguiGadget` from `@ggui-ai/gadgets`.

`@ggui-ai/gadgets` is the standard library of browser-capability hooks **and** the wrapper SDK that lets you bind any 3rd-party library into the same uniform contract the LLM already knows. * **STDLIB hooks** — `useGeolocation`, `useCamera`, `useClipboardWrite`, `useClipboardPaste`, `useNotifications`, `useFilePicker`, `useMicrophone`. * **Wrapper SDK** — `createGguiGadget` for Leaflet, Mapbox, Stripe, Chart.js, Three.js, or anything else with a browser API. See [Glossary](/glossary/) for the gadget / tool / blueprint vocabulary, and [Marketplace Registry](/sdk/marketplace/) for distribution.   ## Installation [Section titled “Installation”](#installation) ```bash npm install @ggui-ai/gadgets ``` ## Authoring a wrapper [Section titled “Authoring a wrapper”](#authoring-a-wrapper) ```ts import { createGguiGadget } from "@ggui-ai/gadgets"; export const useLeafletMap = createGguiGadget< // TOutput — what the hook resolves with on `status: 'completed'`. { containerRef: (el: HTMLDivElement | null) => void }, // TOptions — what the calling component passes in. { center: [number, number]; zoom: number } >({ // Canonical hook name — the export name a contract references // under clientCapabilities.gadgets[][]. The // `use`-prefixed camelCase name marks this export as a hook. hook: "useLeafletMap", // REQUIRED teaching text. Synth + decision LLM both see this with // a 300-char per-entry budget; description is preserved, usage // truncates first when over budget. description: "Render an interactive Leaflet map with tile layer and pan/zoom controls. Returns a container ref to attach to a .", usage: "Mount when the intent names a rendered map (location browsing, route preview, points-of-interest grid). Pass `center: [lat, lng]` + `zoom: 2..20`.", // REQUIRED example. JsonValue — concrete shape the LLM uses to // pattern-match on cold gen. example: { call: "const map = useLeafletMap({ center: [37.7749, -122.4194], zoom: 12 });", returns: { status: "completed", value: { containerRef: "" }, }, }, // Optional anti-patterns / library quirks. Surfaces in the // boilerplate generator's prompt; the LLM uses them to avoid known // misuse patterns. gotchas: "Leaflet requires the container  to have a non-zero height before the map mounts — apply `style={{ height: 400 }}` (or similar) directly. Default-marker icons require leaflet.css to be in the document; the styleUrl on this descriptor covers that.", // REQUIRED. `version` is part of the blueprint cache key, so a // wrapper bump automatically invalidates stale cached generations. version: "0.0.1", // `package` AND `version` are both REQUIRED — bare npm name, exact // semver pin (no URLs, no ranges). `bundleUrl` is optional; when // set it wins for boilerplate import emission. // // Resolution order (operator wins over author wins over spec // default): explicit `bundleUrl` → operator `bundleHost` override // → author `bundleHost` → spec default `registry.ggui.ai`. With // `bundleHost` + `package` + `version` the server assembles // `https:///bundles////bundle.js` // — prefer this over hardcoded `bundleUrl`. package: "@my-org/ggui-leaflet", bundleHost: "registry.ggui.ai", // Optional companion CSS bundle. The wrapper SHOULD self-inject // any required CSS in its hook; the styleUrl is purely an origin // declaration so the renderer's CSP allows the load. Same // `bundleHost`-based resolution applies — explicit `styleUrl` // overrides the host-derived `style.css` path. // API-call origins (XHR / fetch / WebSocket / ``). The // renderer's Content-Security-Policy unions these into BOTH // connect-src AND img-src so map tiles loaded via  work. connect: ["https://tile.openstreetmap.org"], // Public env keys this wrapper consumes (see "Public env channel" // below). Both wire + registry zod schemas enforce the same // `GGUI_PUBLIC_APP_[A-Z0-9_]+` prefix as App.publicEnv. Leaflet // doesn't need a token; Mapbox does. // requires: ["GGUI_PUBLIC_APP_MAPBOX_TOKEN"], // The React hook. Wrappers bundle their underlying library at // build time (esbuild / tsup) and `import` it inside the hook. // The function MUST return the standard `{value, status, start, …}` // shape that satisfies `GadgetHook`. hookImpl: (props) => { // … wrap Leaflet's lifecycle into ggui's stable hook contract … return { value: { containerRef: () => undefined }, status: "completed", start: async () => undefined, }; }, }); ``` The factory **synchronously zod-validates** the spec at module load. Missing `description` / `usage` / `example`, or a missing `package` / `version`, throws `WrapperConformanceError` with the field path. Authors see the error at import time — not at runtime. `useLeafletMap` is a callable hook; the immutable wrapper descriptor is attached as `useLeafletMap.descriptor`. Operators register the descriptor on `App.gadgets`. For distribution, wrappers ship as a `ggui.gadget.json` manifest (see [Marketplace Registry](/sdk/marketplace/)). The registry’s publish endpoint computes the bundle’s `sha384` and stamps it on the registered entry’s `bundleSri`; `ggui gadget install` writes the descriptor (including `bundleSri` + resolved `bundleUrl`) into `ggui.json#app.gadgets[]`. Hand-authored entries omit `bundleSri` — the iframe runtime falls back to integrity-less dynamic `import()`. Non-stdlib registered descriptors also carry `typesUrl` (HTTPS URL to the wrapper’s `.d.ts`) and `typesSri` (SHA-384 SRI over those types), so the generator can type-check against the wrapper’s real surface. `typesUrl` is REQUIRED at registration time for any non-stdlib package; `typesSri` is stamped by the registry on publish (optional for hand-authored entries). ## Operator registration [Section titled “Operator registration”](#operator-registration) Add the descriptor (or its JSON shape) to `ggui.json#app.gadgets`: A `GadgetDescriptor` is a PACKAGE — `package` + `version` + transport metadata declared once, plus an `exports[]` array carrying each export’s teaching text: ```json { "app": { "slug": "leaflet-demo", "name": "Leaflet gadget demo", "gadgets": [ { "package": "@my-org/ggui-leaflet", "version": "0.0.1", "bundleHost": "registry.ggui.ai", "connect": ["https://tile.openstreetmap.org"], "exports": [ { "hook": "useLeafletMap", "description": "Render an interactive Leaflet map…", "usage": "Mount when the intent names a rendered map…", "example": { "call": "const map = useLeafletMap({ center: [37.7749, -122.4194], zoom: 12 });", "returns": { "status": "completed", "value": { "containerRef": "" } } }, "gotchas": "Leaflet requires the container …" } ] } ] } } ``` A package that ships more than one export (two hooks, or a hook plus a React `component`) adds further entries to the same `exports` array — the transport metadata is declared once and shared. The CLI threads `app.gadgets` into the in-process `AppMetadataStore` so the same singleton powers `ggui_list_gadgets`, handshake-time prompt injection, render-time validation + enrichment, and CSP derivation. ## Contract gadget refs vs `GadgetDescriptor` [Section titled “Contract gadget refs vs GadgetDescriptor”](#contract-gadget-refs-vs-gadgetdescriptor) Two distinct shapes — do not confuse them. One is the wire map a host agent authors on a contract; the other is the registry-side package descriptor an operator registers. Both are organized around a gadget **package** that ships one or more **exports**, where each export is either a hook (a `use`-prefixed camelCase name) or a component (a PascalCase name): * **Contract gadget refs** — `clientCapabilities.gadgets` on a `DataContract` is a **package-keyed two-level map** carrying IDENTITY ONLY. The outer key is the npm package name; the inner key is the export name; the inner value is a `GadgetExportUse` — `{ description?, usage? }`, usually just `{}` (optional intent-specific override prose). There is NO `hook` / `component` field on the wire — the export-name GRAMMAR is the discriminator: `useGeolocation` is a hook, `LeafletMap` is a component. There is NO `version`, NO `permission`, NO transport metadata, and NO `binding` key. The wire carries `(package, exportName)` and nothing else. * **`GadgetDescriptor`** — the full registry-side PACKAGE descriptor (what operators register on `App.gadgets`). Package-level identity (`package` + `version`) plus transport metadata (bundle URLs, `connect`, `typesUrl` / `typesSri`) declared once, plus an `exports: GadgetExport[]` array (≥1). Each `GadgetExport` is a field-presence-discriminated union — a hook export `{ hook, … }` or a component export `{ component, … }` — carrying that export’s teaching text + per-export `permission`. `version`, transport metadata, `permission`, and `requires` ALL resolve server-side from this descriptor at render time; the contract author never restates them. `createGguiGadget` authors a single-export package; multi-export packages add further `exports` entries. ## Agent reference shape [Section titled “Agent reference shape”](#agent-reference-shape) Once a package is registered on `App.gadgets`, contracts reference its exports through the package-keyed `clientCapabilities.gadgets` map — outer key = package name, inner key = export name: ```ts const contract = { // … propsSpec / actionSpec / contextSpec … clientCapabilities: { gadgets: { "@my-org/ggui-leaflet": { // PascalCase key ⇒ a component export. Empty value is the // common case; an optional `{ description?, usage? }` override // sharpens the teaching text for this specific intent. LeafletMap: {}, }, "@ggui-ai/gadgets": { // `use`-prefixed camelCase key ⇒ a hook export. useGeolocation: {}, }, }, }, }; ``` The wire is identity-only: render resolves each `(package, exportName)` pair against the registered `GadgetDescriptor` and enriches the persisted `ComponentGguiSession` with the canonical `version`, transport metadata, teaching text, and `permission`. To walk the map in code, call `listContractGadgets(contract)` — it flattens the package-keyed map into a `readonly GadgetUse[]`, each entry `{ package, name, description?, usage? }`. References to unregistered exports reject at render validation with `gadget_not_registered` and a did-you-mean hint when a close stdlib match exists (Levenshtein < 3 cutoff). Render also rejects with `gadget_package_mismatch` when a referenced export name belongs to a different registered package than the one keyed. ## What the renderer derives [Section titled “What the renderer derives”](#what-the-renderer-derives) When ANY gadget in `clientCapabilities.gadgets` declares `bundleUrl` / `styleUrl` / `connect[]`, the render’s spec-canonical `_meta.ui.csp` block carries the derived `{ connectDomains, resourceDomains }` (per the MCP Apps spec). The MCP-Apps host applies them to the sandboxed iframe as a Content-Security-Policy — conceptually: ```plaintext script-src 'self' 'unsafe-inline' ; style-src 'self' 'unsafe-inline' ; connect-src 'self' ; img-src 'self' data:  ``` * `'unsafe-inline'` on `script-src` is needed because the render shell embeds the bootstrap as an inline `