docs: Little Blue (Plan 7 brick 2) design spec

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-04 21:33:01 +10:00
parent 6ceb27fa2f
commit e58090e607
1 changed files with 80 additions and 0 deletions
--- a/docs/superpowers/specs/2026-06-04-little-blue-design.md
+++ b/docs/superpowers/specs/2026-06-04-little-blue-design.md
@@ -0,0 +1,80 @@
+# Little Blue — Design
+
+**Date:** 2026-06-04 · **Component:** Void 2.0 · **Phase:** Plan 7 (Agent Layer), brick 2 · **Status:** Approved (design)
+
+## Goal
+
+Give Little Blue, the homelab caretaker agent, the ability to **fix things** — restart lab services and power-manage Proxmox guests — through a least-privilege, tiered-approval, fully-audited action framework. The LLM is the weakest link by design: it can only name an action id from a fixed, version-controlled whitelist; it never constructs commands.
+
+## Background
+
+- Void app runs on CT 311 (`.216`) and today has **no execution access** to any host (SSH to Z = Permission denied; no Proxmox token).
+- Shared agent-chat foundation exists (`run_turn.js`, `agent_chat.js`, `personas/`) — Little Blue reuses it like Yerin did.
+- `config/services.json` provides the service→host map (action targets). `pending_changes` provides the approval lifecycle to mirror.
+- Little Blue today = the read-only health band UI + `littleblue_avatar.js`. No `little-blue` agent seeded yet.
+
+## Decisions (locked)
+
+1. **Tiered actions:** `safe` → execute directly + audit; `risky` → queue for owner approval, then execute.
+2. **Two execution channels:** scoped **Proxmox API token** for guest power; **SSH forced-command wrapper** for in-guest service restarts. Both enforce the whitelist **server-side**.
+3. **One brick:** action framework + manual UI + conversational Little Blue together.
+4. **Whitelist = `config/actions.json`** (version-controlled, immutable at runtime) — the single source of truth.
+5. **Dedicated `agent_actions` queue** (not `pending_changes`) — isolate command execution from entity-CRUD apply.
+
+## Architecture
+
+### 1. Action whitelist — `config/actions.json`
+Array of action defs. Nothing outside this file is runnable.
+```json
+{ "id": "restart-caddy-ct100", "label": "Restart Caddy on mediastack", "kind": "service_restart", "host": "ct100", "service": "caddy", "tier": "safe" }
+{ "id": "stop-ct107",          "label": "Stop iVentoy (CT 107)",       "kind": "guest_power",    "node": "z", "vmid": 107, "op": "stop", "tier": "risky" }
+```
+Tier convention: `service_restart` → `safe`; guest `start` → `safe`; guest `stop`/`shutdown`/`reboot` → `risky`. The `tier` is explicit per action (config wins).
+
+`lib/actions/registry.js`: `loadActions()` (read + validate config), `getAction(id)`, `listActions()`. Validation rejects unknown `kind`/`op`/`tier` at load.
+
+### 2. Execution channels — `lib/actions/channels/`
+- `proxmox.js`: `power(node, vmid, op)` → Proxmox REST (`POST /nodes/{node}/lxc|qemu/{vmid}/status/{op}`) with `Authorization: PVEAPIToken=...`. Env: `PROXMOX_API_URL`, `PROXMOX_API_TOKEN`. Token scoped server-side to `VM.PowerMgmt` on whitelisted guests only.
+- `ssh.js`: `restart(host, actionId)` → spawns `ssh -i <key> -o BatchMode=yes voidact@<host-ip> <actionId>`. The host's `authorized_keys` pins `command="/usr/local/bin/void-act"` (forced). The wrapper holds its OWN whitelist copy, reads `SSH_ORIGINAL_COMMAND` (the action id), maps it to `systemctl restart <service>`, and refuses anything else. The Void can send only an id; the host decides what runs.
+- Both adapters take an injectable transport (fetch / spawn) for tests; they NEVER interpolate a raw shell command from caller input.
+
+### 3. Action service — `lib/actions/service.js`
+- `runAction(id, actor)`: `getAction(id)` (404 if absent). `safe` → execute via channel now, audit, return `{ executed:true, result }`. `risky` → `agentActions.create({...status:'pending'})`, return `{ queued:true, action_row_id }`. No execution for risky.
+- `approveAction(rowId, owner)`: claim pending row → execute via channel → `status:'executed'`+result, or `status:'failed'`+error; audit.
+- `rejectAction(rowId, owner)`: `status:'rejected'`; audit.
+- `execute(action)`: dispatch on `kind` → `proxmox.power` / `ssh.restart`. Single choke point; audits every call.
+
+### 4. `agent_actions` table — new migration `016_agent_actions.sql`
+`id uuid pk, action_id text, params jsonb, agent_id uuid null, tier text, status text default 'pending' (pending|executed|failed|rejected), result jsonb, requested_by jsonb, resolved_by jsonb, created_at, resolved_at`. Repo `lib/db/repos/agent_actions.js`: `create`, `listPending`, `getById`, `resolve(id,status,result,by)` (claims `WHERE status='pending'`), `recent(limit)`.
+
+### 5. API — `lib/api/routes/actions.js` (owner-gated, mounted at `/api/actions`)
+- `GET /` → whitelist (+ optional live guest/service status later).
+- `POST /:id/run` → `runAction` (safe executes, risky queues).
+- `GET /pending` → queued risky actions.
+- `POST /pending/:rowId/approve` | `/pending/:rowId/reject`.
+- `GET /recent` → recent executed/failed (audit view).
+
+### 6. Little Blue — agent + tools + chat
+- Migration seeds `little-blue` agent (`kind:'claude'`, capabilities `{ read:true, act:true }`).
+- MCP `blueRegistry` (`lib/ai/agent/tools/blue/`): `service_status` (read health/registry), `list_actions` (the whitelist she may use), `propose_action(action_id)` → calls `service.runAction` (safe runs + reports; risky queues + tells the owner it needs approval). She only passes ids; never commands.
+- `companion-stdio.js` already selects registry by `VOID_TOOL_REGISTRY`; add `blue` → `blueRegistry`.
+- Persona (`personas/index.js` key `little-blue`): the small blue water-creature caretaker who keeps the lab alive — warm, protective, practical; calls `list_actions`/`service_status` before proposing; explains what she'll do and that risky fixes wait for the owner's nod.
+- Chat surface: `#/little-blue` view (reuses `agent_chat`, `toolLabels` for blue tools) + a **manual Actions panel**: whitelisted actions with Run buttons (risky → creates an approval card), and a pending-actions queue with Approve/Reject. Sidebar nav entry.
+
+## Safety model
+Least-privilege server-side on both channels (SSH forced-command wrapper + scoped PVE token) → even a fully compromised Void only triggers whitelisted actions. Tiered approval gates destructive ops. Full audit of request→approve→execute→result. The whitelist lives in version control, not a runtime-editable store. No code path constructs a shell command from agent/user free text.
+
+## Provisioning (deploy-time, owner-authorized)
+Create the scoped Proxmox API token (`VM.PowerMgmt` on chosen guests); generate the Void's SSH keypair; deploy `/usr/local/bin/void-act` wrapper + restricted `authorized_keys` (`command=`, `no-port-forwarding,no-pty`) on each target host; populate `config/actions.json` with the real action set; set `PROXMOX_API_URL`/`PROXMOX_API_TOKEN` + SSH key path in the app `.env`. Done interactively at the deploy task.
+
+## Testing (vitest + supertest, serial)
+- `registry`: loads/validates `actions.json`; rejects bad defs.
+- `channels/proxmox`: mocked fetch → asserts correct URL/op/token header; never called for non-whitelisted.
+- `channels/ssh`: mocked spawn → asserts argv is `[..., 'voidact@<ip>', '<action-id>']` with **no** shell string; rejects ids with shell metachars.
+- `service`: safe → executes via stub channel; risky → queues (no channel call); approve → executes; reject → no execution; all audited.
+- `agent_actions` repo: create/listPending/resolve claim-once.
+- routes: run (safe/risky), pending list, approve/reject (owner-gated).
+- Little Blue route: `propose_action` safe runs + reports; risky queues; uses fake-claude fixture. Frontend: manual.
+
+## Out of scope (YAGNI / later)
+Live guest/service status polling in the actions list; runtime-editable whitelist; arbitrary-command tool; multi-step remediation playbooks; Little Blue acting on a schedule/cron (that's the scheduled-agent track). SSH wrapper supports only `service_restart`; Proxmox token only `VM.PowerMgmt`.