Agency Implementation Roadmap¶

This roadmap translates the conceptual agency design into an incremental implementation plan.

Each phase should yield a testable, usable system.
Phases are cumulative: later work extends existing modules instead of replacing them.
Items are written as checkable bullets ([ ]) so progress can be tracked.
Implementation should follow the integration contracts defined in agency-integration.md for Conversation, Memory/AMS, Emotion, Scheduler, and Embodiment.

Phase 0 – Foundations & Enablement¶

Goal: Ensure the existing platform can host an always-on agency loop with clear extension points.

Conversation & Config Wiring
Expose enable_agency feature flag and configuration options in core.conversation and related configs.
Define a minimal AgencyEngine/service interface and register it via LifecycleManager / ai_registry.
Integrate basic agency context hooks into ConversationEngine (e.g. pass active goals, agency state into prompts).
Implement backend.services.agency_engine.AgencyPlugin.process to call the shared agency orchestrator and return structured suggestions/goals.
Wire AgencyPlugin into conversation flows where proactive/autonomous behaviour is allowed by policy.
Persistence & Telemetry Prereqs
Define core tables/collections (if needed) for goals, plans, agency logs, self-reflection notes.
Ensure logging and telemetry are rich enough to support evaluation and self-reflection (IDs, timestamps, outcomes).

Phase 1 – Goal System & Planning Skeleton (First Testable Agent)¶

Goal: Move from stateless chatbot to a goal- and plan-aware companion with persistent intentions.

Exit condition: AICO keeps track of goals across sessions, can form simple multi-step plans, and can proactively act on them in a controlled way.

Phase 2 – Memory, World Model & Relationship Integration¶

Goal: Ground goals and plans in rich memory and world understanding, not just recent turns.

Exit condition: Goals and plans are meaningfully influenced by long-term memory, social context, and world structure; AICO feels more consistent and “aware” over time.

Phase 3 – Curiosity, Intrinsic Motivation & Hobbies¶

Goal: Give AICO her own intrinsic drives and hobbies that generate agent-self goals.

Exit condition: AICO regularly pursues self-generated curiosity and hobby goals, visibly distinct from direct user requests, within user-configurable bounds.

Phase 4 – Goal Arbiter, Values/Ethics & Meta-Control¶

Goal: Introduce a clear decision layer that balances user goals, curiosity, hobbies, and maintenance under constraints.

Exit condition: AICO's behaviour is governed by an explicit meta-control layer, and users can understand and influence why some goals are pursued and others are not.

Phase 5 – Self-Reflection, Self-Model & Behavioural Learning¶

Goal: Enable AICO to evaluate her own behaviour and adapt policies and skills over time.

Exit condition: AICO periodically updates how she behaves based on her own experience, in a traceable way, without changing the overall architecture.

Phase 6 – Advanced Policies & World Model Sophistication¶

Goal: Upgrade internal decision-making and world modelling while keeping interfaces stable.

Exit condition: Internals of curiosity, world modelling, and goal selection are more principled and data-driven, while the external behaviour remains backward compatible and explainable.

Phase 7 – Embodiment as Cognitive Substrate & Polishing¶

Goal: Use the 3D flat and embodiment not just for presentation, but as a cognitive scaffold and final polish.

Embodied Cognition Patterns
Define internal tasks and routines that are always represented through spatial metaphors (desk work, reading on couch, organizing room).
Use environment layout and artefacts as memory cues and anchors for long-term projects and hobbies.
Integration with Real-World Context (optional)
Optionally connect agency state and embodiment with real devices/context (e.g., phone, calendar, home automation) under strict user control.
Refinement & Evaluation
Define evaluation metrics and test scenarios for agency quality (usefulness, coherence, autonomy, user comfort).
Iterate on prompts, policies, and UX based on real usage data.

Exit condition: AICO behaves as a coherent, self-motivated, relationship-centric companion whose inner life (goals, curiosities, hobbies, reflections) is legible through conversation and embodiment, with the full conceptual architecture implemented in practice.