Back to Reflow

Understanding the Output

Edit on GitHub
Published March 23, 2026 · By Equalify Tech Team

Understanding the Output

When Equalify Reflow processes a PDF, it produces an accessible markdown document and a set of extracted images. This guide explains what the output includes, how to evaluate its quality, and what the system's current limitations are.

What You Get

Accessible Markdown

The primary output is a single markdown file containing the full document content with:

Extracted Figures

Images, charts, diagrams, and photos are extracted from the PDF and saved as separate files. Each figure includes:

During the Translation stage, a specialist sub-agent generates alt text for each figure and embeds it directly in the markdown (e.g., ![Description of chart](figures/figure-1.png)). Decorative images like logos are identified and left with empty alt text, following WCAG best practices.

The Change Ledger

Every edit the pipeline makes is recorded in a change ledger. Each entry includes:

The ledger is available through the API (GET /api/v1/documents/{job_id}/ledger) and in the pipeline viewer's Changes panel.

Evaluating Quality

What to Check

When reviewing a converted document, focus on these areas:

Structure

Content Accuracy Tables Images Formatting

Quality by Document Type

Some document types convert better than others:

Document TypeTypical QualityCommon Issues
Syllabi and course materialsHighOccasional heading level disagreements
Policy documentsHighComplex nested numbering schemes
Letters and memosHighLetterhead content may be over-described
Academic chaptersMediumFootnote ordering, reading order in multi-column layouts
Presentations (slides)MediumSlide boundaries, text embedded in images
Infographics and postersLowerSpatial relationships lost when linearized
Brochures with complex layoutsLowerMulti-column reading order confusion

Known Limitations

The system is designed for course materials — syllabi, academic papers, policy documents, presentations, and similar content. The following document types are outside the current scope and may produce lower-quality results:

When the pipeline detects a document type it handles poorly, it emits warnings in the response. These warnings appear in both the API response and the viewer interface.

Providing Feedback

If you find an issue in a converted document, see the Providing Feedback guide for how to report issues and suggest corrections through the WordPress plugin.