1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115---
title: Tutorial โ Process your first PDF with the web app
date: 2026-04-21
author: Equalify Tech Team
description: Open the Equalify Reflow web app in your browser, upload a PDF, approve the PII review, and download an accessible markdown version of the document end-to-end.
---
# Tutorial: Process your first PDF with the web app
By the end of this walkthrough you will have opened the Equalify Reflow web app at `https://reflow.equalify.uic.edu/`, converted a real PDF into an accessible document, approved (or cancelled) the PII review gate, and downloaded the accessible markdown. Plan for **~20 minutes**, including reading time for the output.
The web app is the right path when you don't run WordPress but still want to convert documents through a browser interface. For day-to-day reference on the web app's controls, see [use the web app](../how-to/use-the-web-app). This tutorial walks you through the path end-to-end; that page is the task reference after you're set up.
## What you need
- **A modern browser.** Current Chrome, Firefox, Edge, or Safari. The viewer uses standard web APIs; nothing needs to be installed.
- **The web app URL.** The UIC-hosted instance is at `https://reflow.equalify.uic.edu/`. There's no sign-in and no API key โ open the URL and you're there. The service is protected by per-IP rate limits (see [use the web app ยง limits](../how-to/use-the-web-app#limits)) rather than authentication.
- **A PDF.** Up to **100 MB** and **50 pages**. Course materials, slide decks, handouts, and articles are the intended scope. Scanned PDFs work โ OCR runs automatically.
## 1. Open the web app
Point your browser at `https://reflow.equalify.uic.edu/`. You'll land directly on the upload screen โ no login, nothing to configure. At the top you'll see the Equalify Reflow header with a Beta badge. Below it, a row of pipeline phase tabs (greyed out until a document is loaded) and a dashed drop zone in the middle of the page that says **Drop a PDF here or click to upload**.

## 2. Upload a PDF
Three ways to start:
- **Drag and drop** a PDF from your file manager onto the drop zone.
- **Click** the drop zone to open a file picker.
- **Keyboard**: tab to the drop zone, then press **Enter** or **Space** to open the file picker.
As soon as the upload succeeds the web app begins processing and the stage tabs begin to light up.
## 3. Watch the pipeline run
The tab row at the top walks through five public phases, in order:
1. **Extraction** โ IBM Docling parses the PDF structure (and runs OCR if the document is scanned).
2. **Analysis** โ the pipeline classifies the document and builds a structure dossier.
3. **Headings** โ heading levels are inferred and reconciled across the document.
4. **Translation** โ each page is edited by a multimodal model to match what the visual page communicates.
5. **Assembly** โ per-page markdown is joined into a single document and page-break artefacts are removed.
A tab with a spinner is the phase currently running. A green checkmark means a phase completed. An amber skip icon means a phase was skipped (most commonly OCR, when the PDF already had a text layer).
Expected duration for a 6โ10 page document: **~2โ5 minutes**. Longer documents scale roughly linearly.
## 4. Approve the PII review
Before any AI processing begins, the web app runs a PII (personally identifiable information) scan on the extracted text using Microsoft Presidio. If the scan finds anything that looks like an email address, phone number, SSN, credit card, or similar pattern, the pipeline **pauses** on the PII review panel and waits for a human decision.
On the PII panel you'll see:
- A list of findings grouped by type (Email address, Phone number, and so on) with counts.
- A **Show matches / Mask matches** toggle so you can reveal the specific strings that were flagged.
- Two buttons: **Continue anyway** and **Cancel processing**.
Choose **Continue anyway** if the matches are expected for the document (for example, an instructor's office email on a syllabus). Choose **Cancel processing** if something unexpected surfaced โ no document data is sent to the AI pipeline once you cancel. If the scan finds nothing, the panel shows a green "No sensitive information detected" message and the pipeline continues without prompting.
Reflow is designed for course materials only. If a document contains student records or sensitive PII beyond the occasional contact detail, cancel and handle it outside Reflow.
## 5. Review the output in the pipeline viewer
Once the pipeline finishes, the web app opens into its working layout:
- A **page sidebar** on the left listing every page (click a number to jump to it).
- A **page image panel** in the centre-left showing the original PDF page.
- A **rendered markdown panel** in the centre-right showing the converted, accessible version of the current page.
- A **changes panel** on the far right showing how many edits the AI made in the currently selected phase and a **View Details** button that opens every change with its before, after, and the AI's reasoning.

Click through each phase tab to see what that step produced. The Analysis tab is special โ it replaces the changes panel with a **structure metadata** panel showing the document outline, page attributes, footnotes, and detected code blocks. This is the context every downstream phase uses to make decisions.
If anything about the document prompted a warning (scanned pages, unusual layout), a yellow warnings banner appears above the working area.
## 6. Do a quick accessibility review
The single most valuable quality check is the **heading outline**. On the Analysis tab, scan the outline list: does the H1 match the document title, and are H2s the things you'd expect to be top-level sections? If the skeleton looks right, the rest of the output is usually solid. For the full reviewer checklist, see [interpret the output](../how-to/interpret-the-output).
## 7. Download the accessible output
From the rendered markdown panel's toolbar you can:
- **Download markdown** โ the current stage's full-document markdown.
- **Copy markdown** โ copy the current page's markdown to your clipboard.
- **Copy page image** โ copy the current PDF page image as a PNG.
For the final accessible output, select the **Assembly** tab and use its download control. Figures extracted from the document are embedded inline in the rendered preview and in the downloaded markdown as base64 data URIs, so a single markdown file travels with its images.
## 8. Leave feedback (optional)
When the pipeline completes and feedback is enabled on the instance, a **Feedback** button appears in the stats bar. Use it to report anything that looked wrong โ incorrect text, broken tables, missing alt text, or a heading at the wrong level.
Reports are reviewed by the Equalify team and directly inform pipeline improvements. See [provide feedback](../how-to/provide-feedback) for the details on each category.
## Troubleshooting
| Problem | First thing to try |
|---|---|
| Pipeline finishes in a few seconds with no AI improvements visible | The web app server's AWS Bedrock or Anthropic credentials likely expired. This is an instance-operator issue, not something you caused. Contact the team running the instance and ask them to refresh credentials. |
| Upload screen shows an "Unsupported Document" error straight after extraction | The PDF is an AcroForm or XFA form, encrypted, empty, or over the 50-page limit. The error panel names the specific reason. Upload a different PDF. |
| Stuck on the PII review panel | The pipeline is waiting for your decision. Click **Continue anyway** to proceed or **Cancel processing** to abort. If you've reloaded the page and lost the session, re-upload the PDF. |
| Progress bar appears frozen | The browser may have lost its live-progress connection. The web app falls back to polling the server; give it 30 seconds before reloading. |
| The output is missing a whole paragraph | Rare but possible on the first run. Click **New PDF** and re-upload โ transient failures almost always clear on retry. |
## Where to go next
- [Use the web app](../how-to/use-the-web-app) โ day-to-day reference for the web app's controls, phases, and feedback flow
- [Interpret the output](../how-to/interpret-the-output) โ the reviewer's 4-minute quality scan for a converted document
- [Provide feedback](../how-to/provide-feedback) โ how to submit corrections and issue reports from the web app
- [How it works](../explanation/how-it-works) โ what the pipeline is actually doing during those five phases