Nico a2bc6347fc v0.13.0: Graph engine, versioned nodes, S3* audit, DB tools, Cytoscape

Architecture:
- Graph engine (engine.py) loads graph definitions, instantiates nodes
- Versioned nodes: input_v1, thinker_v1, output_v1, memorizer_v1, director_v1
- NODE_REGISTRY for dynamic node lookup by name
- Graph API: /api/graph/active, /api/graph/list, /api/graph/switch
- Graph definition: graphs/v1_current.py (7 nodes, 13 edges, 3 edge types)

S3* Audit system:
- Workspace mismatch detection (server vs browser controls)
- Code-without-tools retry (Thinker wrote code but no tool calls)
- Intent-without-action retry (request intent but Thinker only produced text)
- Dashboard feedback: browser sends workspace state on every message
- Sensor continuous comparison on 5s tick

State machines:
- create_machine / add_state / reset_machine / destroy_machine via function calling
- Local transitions (go:) resolve without LLM round-trip
- Button persistence across turns

Database tools:
- query_db tool via pymysql to MariaDB K3s pod (eras2_production)
- Table rendering in workspace (tab-separated parsing)
- Director pre-planning with Opus for complex data requests
- Error retry with corrected SQL

Frontend:
- Cytoscape.js pipeline graph with real-time node animations
- Overlay scrollbars (CSS-only, no reflow)
- Tool call/result trace events
- S3* audit events in trace

Testing:
- 167 integration tests (11 test suites)
- 22 node-level unit tests (test_nodes/)
- Three test levels: node unit, graph integration, scenario

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-29 00:18:45 +01:00

1.2 KiB

Raw Blame History

Pub Conversation

Tests multi-turn conversation with context tracking, language switching, and memorizer state updates across a social scenario.

Setup

clear history

Steps

1. Set the scene

send: Hey, Alice and I are heading to the pub tonight
expect_response: length > 10
expect_state: situation contains "pub" or "Alice"

2. Language switch to German

send: Wir sind jetzt im Biergarten angekommen
expect_response: length > 10
expect_state: language is "de" or "mixed"

3. Context awareness

send: Was sollen wir bestellen?
expect_response: length > 10
expect_state: topic contains "bestell" or "order" or "pub" or "Biergarten"

4. Alice speaks

send: Alice says: I'll have a Hefeweizen please
expect_response: length > 10
expect_state: facts any contains "Alice" or "Hefeweizen"

5. Ask for time (tool use)

send: wie spaet ist es eigentlich?
expect_response: matches \d{1,2}:\d{2}

6. Back to English

send: Let's switch to English, what was the last thing Alice said?
expect_state: language is "en" or "mixed"
expect_response: contains "Alice" or "Hefeweizen"

7. Mood check

send: This is really fun!
expect_state: user_mood is "happy" or "playful" or "excited"

1.2 KiB Raw Blame History

Pub Conversation

Setup

Steps

1. Set the scene

2. Language switch to German

3. Context awareness

4. Alice speaks

5. Ask for time (tool use)

6. Back to English

7. Mood check

1.2 KiB

Raw Blame History