agent-runtime/testcases/domain_context.md
Nico 2d649fa448 v0.15.3: Domain context, iterative plan-execute, FK mappings, ES6 node inspector
Eras Expert domain context:
- Full Heizkostenabrechnung business model (Kunde>Objekte>Nutzeinheiten>Geraete)
- Known PK/FK mappings: kunden.Kundennummer, objekte.KundenID, etc.
- Correct JOIN example in SCHEMA prompt
- PA knows domain hierarchy for better job formulation

Iterative plan-execute in ExpertNode:
- DESCRIBE queries execute first, results injected into re-plan
- Re-plan uses actual column names from DESCRIBE
- Eliminates "Unknown column" errors on first query

Frontend:
- Node inspector: per-node cards with model, tokens, progress, last event
- Graph switcher buttons in top bar
- Clear button in top bar
- Nodes panel 300px wide
- WS reconnect on 1006 (deploy) without showing login
- Model info emitted on context HUD events

Domain context test: 21/21 (hierarchy, JOINs, FK, PA job quality)
Default graph: v4-eras

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-29 18:34:42 +02:00

1.4 KiB

Domain Context

Tests that the expert understands the Eras business domain: Heizkostenabrechnung, Kunde→Objekt→Nutzeinheit→Geraet hierarchy, and can formulate correct JOINs without guessing column names.

Setup

  • clear history

Steps

1. Expert knows the hierarchy

  • send: wie viele Objekte haben Kunden im Durchschnitt?
  • expect_trace: has tool_call
  • expect_response: not contains "Error" or "error" or "Unknown column"
  • expect_response: length > 20

2. Expert can JOIN kunden and objekte

  • send: zeig mir die Top 5 Kunden mit den meisten Objekten
  • expect_trace: has tool_call
  • expect_response: not contains "Error" or "error" or "Unknown column"
  • expect_response: length > 20

3. Expert understands Nutzeinheiten belong to Objekte

  • send: how many Nutzeinheiten does the system have total?
  • expect_trace: has tool_call
  • expect_response: not contains "Error" or "error" or "Unknown column"
  • expect_response: length > 10

4. Expert understands Geraete belong to Nutzeinheiten

  • send: which Objekt has the most Geraete?
  • expect_trace: has tool_call
  • expect_response: not contains "Error" or "error" or "Unknown column"
  • expect_response: length > 20

5. PA formulates good job descriptions

  • send: gib mir eine Uebersicht ueber Kunde 2
  • expect_trace: has routed
  • expect_response: length > 20
  • expect_response: not contains "clarify" or "specify" or "what kind"