📦 EqualifyEverything / equalify-reflow-docs

📄 feedback-system.md · 163 lines
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163# Human Feedback System

The feedback system adds iterative human review to the pipeline viewer. It follows a **GitHub PR review** pattern: submit feedback, review proposed changes, iterate or approve.

## Architecture

### Session Lifecycle

```
Pipeline SSE stream completes
    │
    ├─ SessionStore.create(result, structure, section_map)
    │   └─ Returns session_id in SSE "done" event
    │
    ▼
POST /feedback (submit edits + comments)
    │
    ├─ EDIT items: selector + new_text → direct str_replace, no LLM
    │
    ├─ COMMENT items: description → decompose_feedback() → run_revision()
    │   └─ LLM operates on a DEEP COPY (no side effects)
    │
    └─ Returns CandidateChange[] for review
    │
    ▼
POST /review (accept/reject per change)
    │
    ├─ Accepted → str_replace on real result → new version (v4, v5, ...)
    ├─ Rejected + comment → saved for next feedback round
    │
    └─ action: "request_changes" → loop back to /feedback
         action: "approve" → finalize session
    │
    ▼
POST /approve (finalize, no more rounds)
```

### Key Components

| Component | Location | Purpose |
|-----------|----------|---------|
| `session_store.py` | `src/services/` | In-memory session store with 1hr TTL |
| `pipeline_feedback.py` | `src/api/` | REST endpoints for feedback workflow |
| `pipeline_viewer.py` | `src/services/` | Service methods (resolve_selector, apply_direct_edits, etc.) |
| `pipeline_viewer_models.py` | `src/services/` | Data models (FeedbackItem, CandidateChange, etc.) |
| `revision.py` | `src/agents/prompts/` | Prompt module for decomposition + revision agents |

### Data Models

**FeedbackItem** — A single piece of human feedback:
- `type`: `edit` (direct replacement) or `comment` (LLM handles)
- `selector`: TextSelector with `exact`, optional `prefix`/`suffix`
- `new_text`: Replacement text (edits only)
- `description`: What's wrong / rationale

**CandidateChange** — A proposed change awaiting review:
- `change`: DocumentChange (page, old_text, new_text, reasoning)
- `source_type`: `direct_edit` or `ai_revision`
- `source_feedback_id`: Links back to originating FeedbackItem

**ChangeReview** — Human decision on a candidate:
- `decision`: `accept` or `reject`
- `comment`: Optional (rejection comments feed next round)

### Session Store

In-memory singleton at `src/services/session_store.py`. Each `PipelineSession` holds:
- `result`: PipelineViewerResult (mutated as changes are applied)
- `structure`: StructureResult (for outline context)
- `section_map`: SectionMap (for section context)
- `feedback_history`: All prior feedback rounds
- `candidate_changes`: Current pending candidates
- `revision_round`: Counter for feedback iterations
- `finalized`: Lock flag

Sessions expire after 1 hour of inactivity. Appropriate for the dev tool; production would use Redis.

## API Endpoints

Base: `/api/v1/pipeline/sessions`

### POST `/{session_id}/feedback`

Submit a batch of feedback items. Returns candidate changes.

```json
// Request
{
  "items": [
    {
      "id": "uuid1",
      "type": "edit",
      "selector": {"exact": "teh", "prefix": "fix ", "suffix": " typo"},
      "new_text": "the",
      "description": "Fix typo"
    },
    {
      "id": "uuid2",
      "type": "comment",
      "page": 3,
      "description": "The table headers are wrong",
      "feedback_type": "content"
    }
  ]
}

// Response
{
  "candidates": [...],
  "revision_round": 1
}
```

### POST `/{session_id}/review`

Accept/reject individual candidate changes.

```json
// Request
{
  "reviews": [
    {"change_id": "c1", "decision": "accept"},
    {"change_id": "c2", "decision": "reject", "comment": "Too aggressive"}
  ],
  "action": "request_changes"  // or "approve"
}

// Response
{
  "applied_count": 1,
  "rejected_count": 1,
  "new_version": "v4",
  "rejection_comments": ["Too aggressive"],
  "finalized": false
}
```

### POST `/{session_id}/approve`

Finalize session. No more feedback rounds.

### GET `/{session_id}/state`

Returns current versions, latest markdown, pending candidate count, feedback round count.

## TextSelector Resolution

Three-tier strategy to find text in the document:

1. **Prefix + exact + suffix** — Most precise, disambiguates repeated text
2. **Exact match alone** — Falls back when prefix/suffix don't match
3. **Fuzzy fallback** — Uses `_fuzzy_find_line()` for approximate single-line matches

## Design Decisions

**LLM on deep copy**: Comment feedback runs decomposition + revision agents on a `deepcopy()` of the result. The actual result is only mutated when changes are explicitly accepted. This eliminates rollback logic.

**Two feedback types**: EDITs are deterministic (zero LLM cost). COMMENTs reuse the existing decomposition + revision pipeline that was already built.

**Backward compatible**: The SSE `done` event gains a `session_id` field. Existing clients that don't consume it are unaffected.

**Batched reviews**: All feedback items in a round are submitted together, producing a combined set of candidates. This enables the LLM to consider all feedback holistically.