Spaces:

DataQuests
/

DeepCritical

Running

App Files Files Community

DeepCritical / docs /architecture /graph-orchestration.md

Joseph Pollack

demo launches

53c4c46 unverified 12 days ago

preview code

raw

history blame

4.51 kB

Graph Orchestration Architecture

Overview

Phase 4 implements a graph-based orchestration system for research workflows using Pydantic AI agents as nodes. This enables better parallel execution, conditional routing, and state management compared to simple agent chains.

Graph Structure

Nodes

Graph nodes represent different stages in the research workflow:

Agent Nodes: Execute Pydantic AI agents
- Input: Prompt/query
- Output: Structured or unstructured response
- Examples: KnowledgeGapAgent, ToolSelectorAgent, ThinkingAgent
State Nodes: Update or read workflow state
- Input: Current state
- Output: Updated state
- Examples: Update evidence, update conversation history
Decision Nodes: Make routing decisions based on conditions
- Input: Current state/results
- Output: Next node ID
- Examples: Continue research vs. complete research
Parallel Nodes: Execute multiple nodes concurrently
- Input: List of node IDs
- Output: Aggregated results
- Examples: Parallel iterative research loops

Edges

Edges define transitions between nodes:

Sequential Edges: Always traversed (no condition)
- From: Source node
- To: Target node
- Condition: None (always True)
Conditional Edges: Traversed based on condition
- From: Source node
- To: Target node
- Condition: Callable that returns bool
- Example: If research complete → go to writer, else → continue loop
Parallel Edges: Used for parallel execution branches
- From: Parallel node
- To: Multiple target nodes
- Execution: All targets run concurrently

Graph Patterns

Iterative Research Graph

[Input] → [Thinking] → [Knowledge Gap] → [Decision: Complete?]
                                              ↓ No          ↓ Yes
                                    [Tool Selector]    [Writer]
                                              ↓
                                    [Execute Tools] → [Loop Back]

Deep Research Graph

[Input] → [Planner] → [Parallel Iterative Loops] → [Synthesizer]
                           ↓         ↓         ↓
                        [Loop1]  [Loop2]  [Loop3]

State Management

State is managed via WorkflowState using ContextVar for thread-safe isolation:

Evidence: Collected evidence from searches
Conversation: Iteration history (gaps, tool calls, findings, thoughts)
Embedding Service: For semantic search

State transitions occur at state nodes, which update the global workflow state.

Execution Flow

Graph Construction: Build graph from nodes and edges
Graph Validation: Ensure graph is valid (no cycles, all nodes reachable)
Graph Execution: Traverse graph from entry node
Node Execution: Execute each node based on type
Edge Evaluation: Determine next node(s) based on edges
Parallel Execution: Use asyncio.gather() for parallel nodes
State Updates: Update state at state nodes
Event Streaming: Yield events during execution for UI

Conditional Routing

Decision nodes evaluate conditions and return next node IDs:

Knowledge Gap Decision: If research_complete → writer, else → tool selector
Budget Decision: If budget exceeded → exit, else → continue
Iteration Decision: If max iterations → exit, else → continue

Parallel Execution

Parallel nodes execute multiple nodes concurrently:

Each parallel branch runs independently
Results are aggregated after all branches complete
State is synchronized after parallel execution
Errors in one branch don't stop other branches

Budget Enforcement

Budget constraints are enforced at decision nodes:

Token Budget: Track LLM token usage
Time Budget: Track elapsed time
Iteration Budget: Track iteration count

If any budget is exceeded, execution routes to exit node.

Error Handling

Errors are handled at multiple levels:

Node Level: Catch errors in individual node execution
Graph Level: Handle errors during graph traversal
State Level: Rollback state changes on error

Errors are logged and yield error events for UI.

Backward Compatibility

Graph execution is optional via feature flag:

USE_GRAPH_EXECUTION=true: Use graph-based execution
USE_GRAPH_EXECUTION=false: Use agent chain execution (existing)

This allows gradual migration and fallback if needed.