From 662383138ca84bb2820bbc16e0a221ecb78d7bf7 Mon Sep 17 00:00:00 2001
From: Zhaofeng Zhang <24791380+vcfgv@users.noreply.github.com>
Date: Fri, 26 Sep 2025 11:29:20 +0800
Subject: [PATCH 1/3] docs: update README to clarify development installation
 and enhance project structure details

---
 python/README.md | 61 ++++++++++++++++++++++++++++++++++++++----------
 1 file changed, 49 insertions(+), 12 deletions(-)

diff --git a/python/README.md b/python/README.md
index b18f1fe86..1afff933b 100644
--- a/python/README.md
+++ b/python/README.md
@@ -6,7 +6,7 @@ ValueCell is a community-driven, multi-agent platform for financial applications
 
 ### Development Installation
 
-Install the package in development mode with all dependencies:
+Install the package in development mode with all dependencies (including testing tools like pytest, pytest-cov, and diff-cover):
 
 ```bash
 uv sync --group dev
@@ -23,17 +23,54 @@ uv sync
 - `valuecell/` - Main package
   - `adapters/` - External system adapters
   - `agents/` - Agent implementations
-  - `api/` - FastAPI application
-  - `core/` - Core types and utilities
-  - `services/` - Business logic services
-  - `examples/` - Usage examples
-  - `tests/` - Test suite
-
-## Running Tests
-
-```bash
-pytest
-```
+  - `config/` - Configuration and settings
+  - `contrib/` - Community-contributed modules
+  - `core/` - Core types, orchestration, and utilities
+  - `server/` - API server components
+  - `utils/` - Shared utility helpers
+  - `tests/` - Test suite (module-level tests)
+  
+Top-level folders:
+
+- `examples/` - End-to-end examples and notebooks
+- `configs/` - Agent cards, locales, etc.
+- `third_party/` - Third-party integrations (isolated)
+
+### valuecell/core structure
+
+Core contains the orchestration engine, types, and building blocks used by agents and the server.
+
+- `valuecell/core/`
+  - `agent/`
+    - `card.py` - Agent capability/config card definitions
+    - `client.py` - Client helpers for invoking agents
+    - `connect.py` - Wiring utilities to connect agents and handlers
+    - `decorator.py` - Decorators/executors for wrapping agent functions
+    - `listener.py` - Event/listener primitives for agent events
+    - `responses.py` - Response primitives and helpers
+    - `tests/` - Unit tests for the agent module
+  - `conversation/`
+    - `conversation_store.py` - Conversation-level lifecycle and storage
+    - `item_store.py` - Pluggable item storage backends (in-memory/SQLite)
+    - `manager.py` - High-level conversation manager
+    - `models.py` - Pydantic models for conversation data
+    - `tests/` - Unit tests for conversation components
+  - `coordinate/`
+    - `models.py` - Types for coordination/planning
+    - `orchestrator.py` - Orchestrates planning, tool calls, and streaming
+    - `planner.py` - Planner implementation for step generation
+    - `planner_prompts.py` - Prompt templates for planning
+    - `response.py` - Unified response model for streaming and final results
+    - `response_buffer.py` - Buffers and aggregates streaming responses
+    - `response_router.py` - Routes responses to sinks/handlers
+    - `tests/` - Unit and e2e tests for coordination
+  - `task/`
+    - `manager.py` - Manages task creation, lifecycle, and querying
+    - `models.py` - Pydantic models for tasks
+    - `tests/` - Unit tests for tasks
+  - `types.py` - Shared core types and enums
+  - `constants.py` - Core constants
+  - `exceptions.py` - Core exception types
 
 ## Third Party Agents Integration
 

From d557192518f120d0b3c6d92cecd558eb82b8377f Mon Sep 17 00:00:00 2001
From: Zhaofeng Zhang <24791380+vcfgv@users.noreply.github.com>
Date: Fri, 26 Sep 2025 12:00:12 +0800
Subject: [PATCH 2/3] docs: add CORE_ARCHITECTURE.md to detail module
 collaboration and execution flow

---
 docs/CORE_ARCHITECTURE.md | 191 ++++++++++++++++++++++++++++++++++++++
 python/README.md          |  76 +++++----------
 2 files changed, 212 insertions(+), 55 deletions(-)
 create mode 100644 docs/CORE_ARCHITECTURE.md

diff --git a/docs/CORE_ARCHITECTURE.md b/docs/CORE_ARCHITECTURE.md
new file mode 100644
index 000000000..ecac79e66
--- /dev/null
+++ b/docs/CORE_ARCHITECTURE.md
@@ -0,0 +1,191 @@
+# ValueCell Core Architecture
+
+This document explains how the modules under `valuecell/core/` collaborate at runtime. Instead of listing files, it focuses on the end-to-end execution flow, key abstractions, and important design choices (async, reentrancy, and human-in-the-loop).
+
+## Highlights
+
+- Async, re-entrant orchestrator: `process_user_input` is a streaming async entrypoint that can pause for HITL and resume safely.
+- Planner with HITL: pauses on missing info/risky steps via `UserInputRequest` (asyncio.Event), resumes after user feedback to produce an adequate plan.
+- Streaming pipeline: `Response` → `ResponseBuffer` (buffered vs immediate) → `ResponseRouter` to UI and Store, with stable item IDs for partial aggregation.
+- Agent2Agent (A2A) integration: tasks call remote agents via `a2a-sdk`; status events drive routing; agents can be wrapped by lightweight decorators/servers.
+- Conversation memory: in-memory/SQLite stores enable reproducible history, fast "resume from last", and auditability.
+- Robustness: typed errors, side-effects (e.g., fail task) from router, and room for retry/backoff policies where appropriate.
+
+## High-level flow
+
+The orchestration loop ingests a user input, plans next steps, optionally requests human input to resolve ambiguity, and then executes tasks via remote agents (Agent2Agent, A2A). Responses stream back incrementally and are routed to the appropriate sinks (UI, logs, stores).
+
+```mermaid
+flowchart TD
+  U[User Input] --> O[process_user_input (Orchestrator)]
+  O -->|analyze input + context| P[Planner]
+  P -->|adequate plan| PL[Plan]
+  P -->|needs clarification| HITL[HITL: clarification / approval]
+  HITL --> UI[UI / Operator]
+  UI -->|feedback| P
+  PL --> T[Tasks]
+  T --> A2A[A2A calls]
+  A2A --> RA[Remote Agents]
+  RA --> SR[Streamed Responses]
+  SR --> RB[ResponseBuffer]
+  RB --> RR[ResponseRouter]
+  RR --> UI
+  RR --> ST[Store]
+```
+
+### Sequence: async and reentrancy
+
+```mermaid
+sequenceDiagram
+  autonumber
+  participant U as User/UI
+  participant O as Orchestrator
+  participant CS as ConversationStore/ItemStore
+  participant P as Planner
+  participant RB as ResponseBuffer
+  participant RR as ResponseRouter
+  participant ST as Store
+  participant A2A as A2A Client
+  participant RA as Remote Agent
+
+  U->>O: user_input(query, meta)
+  O->>CS: load conversation context
+  CS-->>O: context/items
+  O->>P: create_plan(user_input, callback)
+  alt needs clarification
+    P-->>O: UserInputRequest(prompt)
+    O-->>U: PLAN_REQUIRE_USER_INPUT(prompt)
+    U->>O: provide_user_input(response)
+    O->>P: resume with response
+  end
+  P-->>O: ExecutionPlan(tasks)
+  loop each task
+    O->>A2A: execute(task)
+    A2A->>RA: request(stream)
+    RA-->>O: TaskStatusUpdateEvent (streaming)
+    O->>RB: annotate/ingest(resp)
+    RB-->>O: SaveItem(s)
+    O->>RR: route(resp)
+    RR-->>U: stream to UI
+    RR-->>ST: persist SaveItem(s)
+  end
+  O-->>U: done
+```
+
+## Orchestrator: process_user_input
+
+The orchestrator entrypoint (conceptually `process_user_input`) receives a user message (plus context IDs) and coordinates the entire lifecycle:
+
+1. Load conversation context and prior items (via ConversationStore/ItemStore)
+2. Normalize the query into a typed request model
+3. Delegate to the Planner to derive an actionable plan
+4. If the plan needs confirmation or extra parameters, trigger Human-in-the-Loop (HITL)
+5. Execute the plan as one or more tasks
+6. Stream partial responses while executing
+7. Persist results and emit final responses
+
+The orchestrator is async and re-entrant:
+
+- All I/O boundaries (`await`) are explicit to support concurrency
+- If a human confirmation is required, the orchestrator can pause, surface a checkpoint, and resume later when feedback arrives
+- Reentrancy is supported by idempotent response buffering and conversation state: resuming continues from the last acknowledged step
+
+### Streaming model
+
+Responses are produced incrementally while tasks execute:
+
+- `Response` represents typed chunks (tokens, tool results, notifications)
+- `ResponseBuffer` accumulates and aggregates partials into stable snapshots
+- `ResponseRouter` fans out to multiple sinks (UI streams, logs, stores)
+
+This allows the UI to render partial progress while long-running steps (such as remote agent calls) are still in flight.
+
+## Planner: intent → plan (with HITL)
+
+The Planner turns a natural-language user input into an executable plan. Its responsibilities include:
+
+- Interpreting the user’s goal and available agent capabilities
+- Identifying missing parameters and ambiguities
+- Producing a typed plan describing the steps and tool/agent calls
+
+Human-in-the-loop is integrated into planning:
+
+- When the planner detects insufficient information or risky actions, it emits a “clarification/approval” checkpoint
+- The orchestrator surfaces that checkpoint via the router to the UI/user
+- Once the user adds information or approves the step, the orchestrator resumes with an updated plan context
+
+Under the hood:
+
+- `planner.py` encapsulates the decision logic
+- `planner_prompts.py` centralizes prompt templates (when LLM-based planning is used)
+- `coordinate/models.py` defines plan/step data models used by both planner and orchestrator
+
+## Task execution
+
+After planning, the orchestrator executes each task. A task is an atomic unit that typically invokes a remote agent to perform work.
+
+Execution characteristics:
+
+- Tasks are awaited asynchronously; independent tasks may run concurrently when safe
+- Each task emits structured responses (tool results, logs, progress) as it runs
+- Failures are converted into typed errors and can trigger retries or compensating steps (policy-dependent)
+
+The conversation and item stores record inputs/outputs for reproducibility and auditing.
+
+## A2A integration: talking to remote agents
+
+Each task uses the Agent2Agent (A2A) protocol to interact with remote agents:
+
+- Request/response schemas are defined by the agent capability “cards” and message models
+- The local runtime uses `a2a-sdk` to send/receive over the selected transport (HTTP or others)
+- Streaming results are fed into `ResponseBuffer` and routed live to clients
+
+This protocol boundary makes agents location-transparent: they can run locally, remotely, or be swapped without changing the orchestrator.
+
+## Agent implementation: decorators and wiring
+
+Remote agents can be embedded with a very small footprint using the core agent decorator and wiring utilities:
+
+- `agent/decorator.py` wraps a plain async function into a fully-typed agent handler
+- `agent/connect.py` wires the decorated function into the runtime (registration, routing)
+- `agent/card.py` describes capabilities, inputs, and outputs so the planner can select it
+
+The planner can select this capability when it fits the user’s goal, and the orchestrator will route a task through A2A to execute it.
+
+## Conversation and memory
+
+`conversation_store.py` and `item_store.py` abstract conversation history and per-item storage:
+
+- In-memory and SQLite backends are available
+- Filtering and pagination support efficient context retrieval
+- Latest items can be fetched for fast “resume from last” behaviors
+
+This memory layer underpins reentrancy and auditability.
+
+## Async & reentrancy details
+
+- All external calls (planning, remote agents, storage) are awaited
+- `ResponseBuffer` enables idempotent aggregation of partial output so a resumed session can safely replay or continue
+- Orchestrator checkpoints (HITL) are modeled as explicit yield points; upon resumption, the same context IDs lead the flow to continue from the next step
+- Backpressure: routers can apply flow control when sinks are slow
+
+## Error handling & resilience
+
+Typical edge cases and policies:
+
+- Missing parameters → HITL clarification
+- Planner errors → structured failure with user-facing guidance
+- Agent timeouts → retry/backoff policies; partial results remain in the buffer
+- Transport errors → surfaced via typed exceptions; orchestration may retry or abort
+- Consistency → conversation records ensure inputs/outputs are durable
+
+## Extensibility
+
+- Add a new agent: create a capability card, implement a decorated async handler, register/connect it
+- Add a new store: implement the `ItemStore`/`ConversationStore` interfaces
+- Add a new transport: integrate a compatible adapter and update the A2A client wiring
+- Customize planning: extend planner prompts/logic and enrich plan models
+
+---
+
+In short, the orchestrator coordinates an async, re-entrant loop of plan → execute → stream, with human checkpoints where appropriate. Tasks talk A2A to remote agents, and the response pipeline keeps users informed in real time while maintaining durable, reproducible state.
diff --git a/python/README.md b/python/README.md
index 1afff933b..f8852bac1 100644
--- a/python/README.md
+++ b/python/README.md
@@ -1,6 +1,26 @@
 # ValueCell Python Package
 
-ValueCell is a community-driven, multi-agent platform for financial applications.
+> A community-driven, multi-agent platform for financial applications — typed, async-first, and built for orchestration.
+
+## Highlights
+
+- Async, re-entrant orchestrator: streaming `process_user_input` can pause for human-in-the-loop (HITL) and resume safely.
+- Planner with HITL: pauses on missing info/risky steps via async `UserInputRequest`, resumes after user feedback to produce an adequate plan.
+- Streaming pipeline: `Response` → `ResponseBuffer` (buffered vs immediate with stable item_id) → `ResponseRouter` to UI and Store.
+- Agent2Agent (A2A) integration: first-class support via a2a-sdk for agent-to-agent protocols, message schemas, and optional HTTP server interop.
+- Conversation memory: in-memory/SQLite stores enable reproducible history, fast "resume from last", and auditability.
+- Robustness & extensibility: typed events/errors, router side-effects (e.g., fail task), and clear seams to add agents, stores, transports, and planner logic.
+
+See detailed flow diagrams and design notes in `../docs/CORE_ARCHITECTURE.md`.
+
+## Quickstart
+
+Set up the environment and verify your install:
+
+```bash
+uv sync --group dev
+uv run python -c "import valuecell as vc; print(vc.__version__)"
+```
 
 ## Installation
 
@@ -18,60 +38,6 @@ uv sync --group dev
 uv sync
 ```
 
-## Project Structure
-
-- `valuecell/` - Main package
-  - `adapters/` - External system adapters
-  - `agents/` - Agent implementations
-  - `config/` - Configuration and settings
-  - `contrib/` - Community-contributed modules
-  - `core/` - Core types, orchestration, and utilities
-  - `server/` - API server components
-  - `utils/` - Shared utility helpers
-  - `tests/` - Test suite (module-level tests)
-  
-Top-level folders:
-
-- `examples/` - End-to-end examples and notebooks
-- `configs/` - Agent cards, locales, etc.
-- `third_party/` - Third-party integrations (isolated)
-
-### valuecell/core structure
-
-Core contains the orchestration engine, types, and building blocks used by agents and the server.
-
-- `valuecell/core/`
-  - `agent/`
-    - `card.py` - Agent capability/config card definitions
-    - `client.py` - Client helpers for invoking agents
-    - `connect.py` - Wiring utilities to connect agents and handlers
-    - `decorator.py` - Decorators/executors for wrapping agent functions
-    - `listener.py` - Event/listener primitives for agent events
-    - `responses.py` - Response primitives and helpers
-    - `tests/` - Unit tests for the agent module
-  - `conversation/`
-    - `conversation_store.py` - Conversation-level lifecycle and storage
-    - `item_store.py` - Pluggable item storage backends (in-memory/SQLite)
-    - `manager.py` - High-level conversation manager
-    - `models.py` - Pydantic models for conversation data
-    - `tests/` - Unit tests for conversation components
-  - `coordinate/`
-    - `models.py` - Types for coordination/planning
-    - `orchestrator.py` - Orchestrates planning, tool calls, and streaming
-    - `planner.py` - Planner implementation for step generation
-    - `planner_prompts.py` - Prompt templates for planning
-    - `response.py` - Unified response model for streaming and final results
-    - `response_buffer.py` - Buffers and aggregates streaming responses
-    - `response_router.py` - Routes responses to sinks/handlers
-    - `tests/` - Unit and e2e tests for coordination
-  - `task/`
-    - `manager.py` - Manages task creation, lifecycle, and querying
-    - `models.py` - Pydantic models for tasks
-    - `tests/` - Unit tests for tasks
-  - `types.py` - Shared core types and enums
-  - `constants.py` - Core constants
-  - `exceptions.py` - Core exception types
-
 ## Third Party Agents Integration
 
 ⚠️ **Caution**: Isolate third‑party libraries in separate virtual environments (uv, venv, virtualenv, or conda) to prevent dependency conflicts between components.

From b9b4ab3660fac01b5b8032d105f09db914cd42cf Mon Sep 17 00:00:00 2001
From: Zhaofeng Zhang <24791380+vcfgv@users.noreply.github.com>
Date: Fri, 26 Sep 2025 13:32:49 +0800
Subject: [PATCH 3/3] docs: refine CORE_ARCHITECTURE.md to clarify execution
 flow and remove redundant details

---
 docs/CORE_ARCHITECTURE.md | 14 ++++++--------
 1 file changed, 6 insertions(+), 8 deletions(-)

diff --git a/docs/CORE_ARCHITECTURE.md b/docs/CORE_ARCHITECTURE.md
index ecac79e66..0b4c8dd51 100644
--- a/docs/CORE_ARCHITECTURE.md
+++ b/docs/CORE_ARCHITECTURE.md
@@ -1,6 +1,6 @@
 # ValueCell Core Architecture
 
-This document explains how the modules under `valuecell/core/` collaborate at runtime. Instead of listing files, it focuses on the end-to-end execution flow, key abstractions, and important design choices (async, reentrancy, and human-in-the-loop).
+This document explains how the modules under `valuecell/core/` collaborate at runtime.
 
 ## Highlights
 
@@ -76,13 +76,11 @@ sequenceDiagram
 
 The orchestrator entrypoint (conceptually `process_user_input`) receives a user message (plus context IDs) and coordinates the entire lifecycle:
 
-1. Load conversation context and prior items (via ConversationStore/ItemStore)
-2. Normalize the query into a typed request model
-3. Delegate to the Planner to derive an actionable plan
-4. If the plan needs confirmation or extra parameters, trigger Human-in-the-Loop (HITL)
-5. Execute the plan as one or more tasks
-6. Stream partial responses while executing
-7. Persist results and emit final responses
+1. Delegate to the Planner to derive an actionable plan
+2. If the plan needs confirmation or extra parameters, trigger Human-in-the-Loop (HITL)
+3. Execute the plan as one or more tasks
+4. Stream partial responses while executing
+5. Persist results and emit final responses
 
 The orchestrator is async and re-entrant: