Spaces:
Running
Running
Commit
·
953b850
1
Parent(s):
481bdd7
fix: markdown lint warnings in roadmap.md
Browse files
docs/implementation/roadmap.md
CHANGED
|
@@ -75,7 +75,9 @@ tests/
|
|
| 75 |
## Phased Execution Plan
|
| 76 |
|
| 77 |
### **Phase 1: Foundation & Tooling (Day 1)**
|
|
|
|
| 78 |
*Goal: A rock-solid, CI-ready environment with `uv` and `pytest` configured.*
|
|
|
|
| 79 |
- [ ] Initialize `pyproject.toml` with `uv`.
|
| 80 |
- [ ] Configure `ruff` (strict) and `mypy` (strict).
|
| 81 |
- [ ] Set up `pytest` with sugar and coverage.
|
|
@@ -84,7 +86,9 @@ tests/
|
|
| 84 |
- **Deliverable**: A repo that passes CI with `uv run pytest`.
|
| 85 |
|
| 86 |
### **Phase 2: The "Search" Vertical Slice (Day 2)**
|
|
|
|
| 87 |
*Goal: Agent can receive a query and get raw results from PubMed/Web.*
|
|
|
|
| 88 |
- [ ] **TDD**: Write test for `SearchHandler`.
|
| 89 |
- [ ] Implement `src/tools/pubmed.py` (PubMed E-utilities).
|
| 90 |
- [ ] Implement `src/tools/websearch.py` (DuckDuckGo).
|
|
@@ -93,14 +97,18 @@ tests/
|
|
| 93 |
- **Deliverable**: Function that takes "long covid" -> returns `List[Evidence]`.
|
| 94 |
|
| 95 |
### **Phase 3: The "Judge" Vertical Slice (Day 3)**
|
|
|
|
| 96 |
*Goal: Agent can decide if evidence is sufficient.*
|
|
|
|
| 97 |
- [ ] **TDD**: Write test for `JudgeHandler` (Mocked LLM).
|
| 98 |
- [ ] Implement `src/prompts/judge.py` (Structured outputs).
|
| 99 |
- [ ] Implement `src/agent_factory/judges.py` (LLM interaction).
|
| 100 |
- **Deliverable**: Function that takes `List[Evidence]` -> returns `JudgeAssessment`.
|
| 101 |
|
| 102 |
### **Phase 4: The "Loop" & UI Slice (Day 4)**
|
|
|
|
| 103 |
*Goal: End-to-End User Value.*
|
|
|
|
| 104 |
- [ ] Implement `src/orchestrator.py` (Connects Search + Judge loops).
|
| 105 |
- [ ] Build `src/app.py` (Gradio with Streaming).
|
| 106 |
- **Deliverable**: Working DeepCritical Agent on HuggingFace.
|
|
|
|
| 75 |
## Phased Execution Plan
|
| 76 |
|
| 77 |
### **Phase 1: Foundation & Tooling (Day 1)**
|
| 78 |
+
|
| 79 |
*Goal: A rock-solid, CI-ready environment with `uv` and `pytest` configured.*
|
| 80 |
+
|
| 81 |
- [ ] Initialize `pyproject.toml` with `uv`.
|
| 82 |
- [ ] Configure `ruff` (strict) and `mypy` (strict).
|
| 83 |
- [ ] Set up `pytest` with sugar and coverage.
|
|
|
|
| 86 |
- **Deliverable**: A repo that passes CI with `uv run pytest`.
|
| 87 |
|
| 88 |
### **Phase 2: The "Search" Vertical Slice (Day 2)**
|
| 89 |
+
|
| 90 |
*Goal: Agent can receive a query and get raw results from PubMed/Web.*
|
| 91 |
+
|
| 92 |
- [ ] **TDD**: Write test for `SearchHandler`.
|
| 93 |
- [ ] Implement `src/tools/pubmed.py` (PubMed E-utilities).
|
| 94 |
- [ ] Implement `src/tools/websearch.py` (DuckDuckGo).
|
|
|
|
| 97 |
- **Deliverable**: Function that takes "long covid" -> returns `List[Evidence]`.
|
| 98 |
|
| 99 |
### **Phase 3: The "Judge" Vertical Slice (Day 3)**
|
| 100 |
+
|
| 101 |
*Goal: Agent can decide if evidence is sufficient.*
|
| 102 |
+
|
| 103 |
- [ ] **TDD**: Write test for `JudgeHandler` (Mocked LLM).
|
| 104 |
- [ ] Implement `src/prompts/judge.py` (Structured outputs).
|
| 105 |
- [ ] Implement `src/agent_factory/judges.py` (LLM interaction).
|
| 106 |
- **Deliverable**: Function that takes `List[Evidence]` -> returns `JudgeAssessment`.
|
| 107 |
|
| 108 |
### **Phase 4: The "Loop" & UI Slice (Day 4)**
|
| 109 |
+
|
| 110 |
*Goal: End-to-End User Value.*
|
| 111 |
+
|
| 112 |
- [ ] Implement `src/orchestrator.py` (Connects Search + Judge loops).
|
| 113 |
- [ ] Build `src/app.py` (Gradio with Streaming).
|
| 114 |
- **Deliverable**: Working DeepCritical Agent on HuggingFace.
|