Donna Management GUI¶

Developer-facing control panel for monitoring, debugging, and configuring the Donna AI assistant system.

Separate from the end-user Flutter app — this is the development/ops tool for understanding agent behavior, tracking costs, iterating on prompts/configs, and debugging data flows.

Tech Stack¶

Layer	Choice	Notes
Frontend	React 18 + Vite + TypeScript	SPA with client-side routing
Components	Ant Design 5 (dark theme)	Full admin component library
Charts	Recharts	Time series, bar, area, pie charts
HTTP Client	Axios	Proxied through Vite in dev
Backend	FastAPI (existing app, extended)	New `/admin/*` routes on port 8200
Deployment	Docker (nginx) + compose	Port 8400 in production
Dev server	Vite	Port 5173, proxies to FastAPI

Architecture¶

donna-ui (React SPA, port 5173/8400)
    │
    ├── /admin/* ──► FastAPI (port 8200)
    │                   ├── SQLite (donna_tasks.db)
    │                   ├── Loki HTTP API (port 3100)
    │                   └── Config/prompt files (disk)
    │
    └── Static assets (nginx in production)

The GUI communicates exclusively through the FastAPI admin API. No direct database or Loki access from the frontend.

Admin API Endpoints¶

All endpoints are under the /admin prefix. No authentication required today — this is a local dev tool bound to the loopback interface.

If /admin/* is ever exposed externally, the minimum bar would be:

A shared-secret bearer token (or Cloudflare Access / Tailscale header) enforced via a single FastAPI dependency — Depends(require_admin) — applied at the router layer so every admin_* router inherits it in one place.
Write-side endpoints (PUT /admin/configs/{filename}, POST /admin/skills/{id}/state, automation create/update/delete, POST /admin/skill-runs/{id}/capture-fixture) rate-limited more aggressively than read endpoints, and logged to invocation_log with caller identity.
No new persistence — reuse the existing access/auth infrastructure in src/donna/api/routes/admin_access.py (IP / device / caller tables) rather than adding a parallel admin-user table.

This is a note, not a plan — implementation is deferred until the tool leaves the loopback.

Dashboard KPIs¶

Endpoint	Description
`GET /admin/dashboard/parse-accuracy?days=30`	Parse accuracy over time from correction_log vs invocation_log
`GET /admin/dashboard/agent-performance?days=30`	Per-agent call counts, latency, cost, success rates
`GET /admin/dashboard/task-throughput?days=30`	Tasks created vs completed, status distribution, overdue count
`GET /admin/dashboard/cost-analytics?days=30`	Daily/monthly spend, by task_type, by model, projections

Log Viewer¶

Endpoint	Description
`GET /admin/logs?event_type=&level=&service=&search=&start=&end=&limit=50&offset=0`	Paginated log query via Loki with fallback to invocation_log
`GET /admin/logs/trace/{correlation_id}`	All events for a correlation ID (trace timeline)
`GET /admin/logs/event-types`	Static event type hierarchy for tree filter

Invocations¶

Endpoint	Description
`GET /admin/invocations?task_type=&model=&is_shadow=&limit=50&offset=0`	Paginated invocation log
`GET /admin/invocations/{id}`	Single invocation with full output JSON

Tasks (Admin)¶

Endpoint	Description
`GET /admin/tasks?status=&domain=&priority=&search=&limit=50&offset=0`	Extended task list with agent/nudge/quality fields
`GET /admin/tasks/{id}`	Full task detail + linked invocations, nudges, corrections, subtasks

Configs & Prompts¶

Endpoint	Description
`GET /admin/configs`	List YAML config files with metadata
`GET /admin/configs/{filename}`	Read config file content
`PUT /admin/configs/{filename}`	Write config file (validates YAML, atomic write)
`GET /admin/prompts`	List prompt template files
`GET /admin/prompts/{filename}`	Read prompt file content
`PUT /admin/prompts/{filename}`	Write prompt file (atomic write)

Agents¶

Endpoint	Description
`GET /admin/agents`	List all agents with config + summary metrics from invocation_log
`GET /admin/agents/{name}`	Detailed agent view: config, recent invocations, daily latency, tool usage, cost summary

Shadow Scoring¶

Endpoint	Description
`GET /admin/shadow/comparisons?task_type=&days=30&limit=50`	Pair primary and shadow invocations by input_hash or task_id proximity
`GET /admin/shadow/stats?days=30`	Aggregate shadow vs primary quality and cost stats
`GET /admin/shadow/spot-checks?limit=50&offset=0`	Invocations flagged for review (spot_check_queued or quality < 0.7)

Preferences¶

Endpoint	Description
`GET /admin/preferences/rules?enabled=&rule_type=&limit=50`	List learned preference rules with filters
`PATCH /admin/preferences/rules/{id}`	Toggle rule enabled/disabled state
`GET /admin/preferences/corrections?field=&task_type=&limit=50&offset=0`	Paginated correction log
`GET /admin/preferences/stats`	Aggregate preference and correction statistics

Pages¶

Session 1 (Implemented)¶

Route	Page	Description
`/`	Dashboard	4 KPI sections: Parse Accuracy, Agent Performance, Task Throughput, Cost Analytics
`/logs`	Log Viewer	Event type tree filter, level/service filters, correlation trace drawer, entity links

Session 2 (Implemented)¶

Route	Page	Description
`/configs`	Config Editor	Structured form editing for agents/models/task_types/states YAML. Raw YAML tab with Monaco editor. Diff view before save.
`/prompts`	Prompt Editor	Monaco markdown editor with live preview. Template variable inspector. Schema link from task_types.yaml.
`/agents`	Agent Details	Card grid with metrics. Detail view: config, latency chart, tool usage chart, cost summary, recent invocations.
`/tasks`	Task Browser	Filterable table (status/domain/priority/agent/search). Detail view with state timeline, linked invocations/nudges/corrections/subtasks.

Session 3 (Implemented)¶

Route	Page	Description
`/shadow`	Shadow Scoring	Side-by-side diff of primary vs shadow model outputs. Quality score scatter plot + trend chart. Spot-check queue. Filter by task_type/days.
`/preferences`	Preference Manager	Learned rules table with confidence scores, enable/disable toggle. Correction history with filters. Rule provenance drawer. Stats cards.
Polish	UX Refinements	Keyboard shortcuts (`g`+key nav, `r` refresh, `Esc` close), saved filter presets (Log Viewer), CSV export on all tables, anomaly toast notifications on dashboard refresh.

Design Decisions Made¶

Separate admin API prefix (/admin/*) rather than a separate FastAPI service — keeps deployment simple, shares DB connection, avoids service sprawl.
No auth on admin routes — this is a local dev tool. Can add admin auth later if exposed externally.
Ant Design over shadcn/ui — richer out-of-the-box admin components (tables, trees, drawers, forms).
Loki as primary log source with SQLite fallback — structured logs already flow to Loki via Promtail. Direct SQLite queries cover invocation_log when Loki is down.
Manual refresh + 30s auto-refresh on dashboard — avoids WebSocket complexity for a dev tool.
pnpm as package manager — fast, disk-efficient.
Dark theme only — this is a developer tool, dark mode is the default.

Frontend Project Structure¶

donna-ui/
├── package.json
├── vite.config.ts
├── tsconfig.json
├── tsconfig.node.json
├── index.html
├── Dockerfile
├── nginx.conf
├── src/
│   ├── main.tsx                          # App entry, ConfigProvider + BrowserRouter
│   ├── App.tsx                           # Route definitions
│   ├── vite-env.d.ts
│   ├── theme/
│   │   └── darkTheme.ts                  # Ant Design dark theme config
│   ├── api/
│   │   ├── client.ts                     # Axios instance with error handling
│   │   ├── dashboard.ts                  # Dashboard KPI fetchers
│   │   ├── logs.ts                       # Log query fetchers
│   │   ├── invocations.ts               # Invocation log fetchers
│   │   ├── configs.ts                    # Config/prompt CRUD fetchers
│   │   ├── agents.ts                     # Agent list/detail fetchers
│   │   ├── tasks.ts                      # Task list/detail fetchers
│   │   ├── shadow.ts                     # Shadow scoring fetchers
│   │   └── preferences.ts               # Preference rules/corrections fetchers
│   ├── components/
│   │   ├── Layout.tsx                    # App shell: sidebar + header + content
│   │   ├── PageShell.tsx                 # Placeholder for unbuilt pages
│   │   └── RefreshButton.tsx             # Manual refresh with "updated X ago"
│   ├── hooks/
│   │   └── useKeyboardShortcuts.ts       # Global keyboard nav (g+key, r refresh, Esc close)
│   ├── utils/
│   │   └── csvExport.ts                  # CSV export utility for tables
│   └── pages/
│       ├── Dashboard/
│       │   ├── index.tsx                 # 2x2 grid layout, time range selector
│       │   ├── ParseAccuracyCard.tsx     # Accuracy %, corrections breakdown
│       │   ├── AgentPerformanceCard.tsx  # Per-agent metrics, latency chart
│       │   ├── TaskThroughputCard.tsx    # Created vs completed, status pie
│       │   └── CostAnalyticsCard.tsx     # Spend charts, budget progress bar
│       ├── Logs/
│       │   ├── index.tsx                 # Two-panel layout, URL filter state
│       │   ├── EventTypeTree.tsx         # Checkbox tree sidebar filter
│       │   ├── LogTable.tsx              # Paginated, expandable, color-coded
│       │   └── TraceView.tsx             # Correlation trace drawer/timeline
│       ├── Configs/
│       │   ├── index.tsx                 # Layout with sidebar + structured/raw tabs
│       │   ├── ConfigFileList.tsx        # Sidebar file menu
│       │   ├── RawYamlEditor.tsx         # Monaco YAML editor
│       │   ├── StructuredEditor.tsx      # Routes to form by filename
│       │   ├── SaveDiffModal.tsx         # Monaco diff before save
│       │   └── forms/
│       │       ├── AgentsForm.tsx        # Agent cards with tools/autonomy
│       │       ├── ModelsForm.tsx        # Model defs, routing, cost, quality
│       │       ├── TaskTypesForm.tsx     # Collapsible task type editor
│       │       └── StatesForm.tsx        # SVG state diagram + transitions table
│       ├── Prompts/
│       │   ├── index.tsx                 # Split editor + preview layout
│       │   ├── PromptFileList.tsx        # Sidebar file menu
│       │   ├── VariableInspector.tsx     # Jinja2 variable extractor
│       │   └── MarkdownPreview.tsx       # Simple markdown renderer
│       ├── Agents/
│       │   ├── index.tsx                 # Card grid + detail toggle
│       │   ├── AgentCard.tsx             # Summary card with metrics
│       │   └── AgentDetail.tsx           # Config, charts, invocations
│       ├── Tasks/
│       │   ├── index.tsx                 # Filter bar + table
│       │   ├── TaskFilters.tsx           # Status/domain/priority/search
│       │   ├── TaskTable.tsx             # Paginated table with color tags
│       │   └── TaskDetail.tsx            # Full detail with state timeline
│       ├── Shadow/
│       │   ├── index.tsx                 # Filter controls, stats cards, tabbed layout
│       │   ├── ComparisonTable.tsx        # Side-by-side diff, expandable rows
│       │   ├── SpotCheckTable.tsx         # Flagged invocations with quality bars
│       │   └── ShadowCharts.tsx           # Scatter plot + trend line chart
│       └── Preferences/
│           ├── index.tsx                  # Stats cards, rules/corrections tabs
│           ├── RulesTable.tsx             # Rules with confidence bar, enable toggle
│           ├── CorrectionsTable.tsx        # Paginated correction history
│           └── RuleDetailDrawer.tsx        # Rule provenance with supporting corrections

Backend Files¶

src/donna/api/routes/
├── admin_dashboard.py      # 4 dashboard KPI endpoints
├── admin_logs.py           # Log query + trace endpoints
├── admin_invocations.py    # Invocation log browse/detail
├── admin_tasks.py          # Extended task list/detail
├── admin_config.py         # Config/prompt file CRUD (read + write)
├── admin_agents.py         # Agent list/detail with merged metrics
├── admin_shadow.py         # Shadow scoring comparisons, stats, spot-checks
└── admin_preferences.py    # Preference rules CRUD, corrections, stats

Reused Existing Code¶

Class/Module	File	Used For
`CostTracker`	`src/donna/cost/tracker.py`	Cost analytics dashboard queries
`Database` / `TaskRow`	`src/donna/tasks/database.py`	Task queries, field definitions
`InvocationMetadata`	`src/donna/logging/invocation_logger.py`	Schema reference for invocation queries
`load_*_config()`	`src/donna/config.py`	Reading YAML config files
Agent activity patterns	`src/donna/api/routes/agents.py`	SQL query patterns for invocation_log

Docker Deployment¶

Service	Port	Compose File
donna-ui	8400	`docker/donna-ui.yml`
donna-api	8200	`docker/donna-app.yml` (existing)
donna-loki	3100	`docker/donna-monitoring.yml` (existing)

Progress Tracking¶

Session 1 — Status: COMPLETE¶

Backend admin API routes (5 files)
Register routes in FastAPI app
Frontend project scaffold (React + Vite + Ant Design dark theme)
API client layer (Axios + typed fetchers)
Dashboard page (4 KPI cards: Parse Accuracy, Agent Performance, Task Throughput, Cost Analytics)
Log Viewer (event type tree filter, level/service filters, paginated table, correlation trace drawer)
Page shells (6 placeholders with feature descriptions for sessions 2-3)
Docker files (Dockerfile, nginx.conf, donna-ui.yml compose)

Session 2 — Status: COMPLETE¶

Config editor pages (structured YAML editing for agents/models/task_types/states + raw YAML fallback)
Prompt editor with Monaco markdown editor, live preview, template variable inspector
Agent detail views (card grid + detail with charts and invocation feed)
Task browser with filterable table, detail view, state timeline, linked entities
PUT endpoints for config/prompt editing (YAML validation, atomic writes)
Agent API endpoints (GET /admin/agents, GET /admin/agents/{name})
Save diff modal with Monaco diff editor
@monaco-editor/react for YAML/markdown editing
CSS/SVG state machine diagram for task_states.yaml

Session 3 — Status: COMPLETE¶

Shadow scoring comparison view (side-by-side diff, scatter plot, trend chart, spot-check queue)
Preference rules manager (rules table with toggle, correction history, rule provenance drawer, stats)
UX polish (keyboard shortcuts, saved filter presets, CSV export on all tables, empty states)
Anomaly notifications (daily cost, parse accuracy, overdue tasks)
Backend: admin_shadow.py (3 endpoints), admin_preferences.py (4 endpoints)