ReforMe: Re-Shaping Documents with Contextual Prompting and Layout-Aware Propagation

arXiv CS Wednesday 03 June 2026, 04:00 UTC By Nabin Khanal, Tongyan Wang, Jui-Cheng Chiu, Ningning Nicole Kong, Hannah Yanhua Zong, Yingjie Victor Chen 1 min read

Key Points

arXiv:2606.03266v1 Announce Type: new Abstract: Digitizing complex documents with handwritten content, irregular tables, and heterogeneous layouts remains challenging, as traditional Optical Character Recognition (OCR) systems fail to capture writing nuances, author-specific conventions, and document structure, and recent LLM-based approaches lack mechanisms for precise, scalable correction. We present an interactive document digitization system that integrates layout-aware parsing, OCR, and LLM-based reconstruction with user-driven refinement. The system is informed by a formative study that identifies key challenges and interaction needs in real-world digitization workflows. It supports both direct edits and natural-language instructions, and introduces a layout-aware propagation mechanism that generalizes user corrections across structurally similar regions. This enables not only efficient error correction but also document re-shaping into structured, analyzable representations. We evaluate the system through a within-subjects user study (n=12) on real-world documents. Results show improved correction efficiency and reduced repetitive effort, demonstrating more effective and controllable document digitization procedure.

Optical Character Recognition (ORG) OCR (ORG) LLM (ORG)

Originally published by arXiv CS Read original →

ReforMe: Re-Shaping Documents with Contextual Prompting and Layout-Aware Propagation

Related Stories

Waymo built a virtual driver to study how humans react to surprises on the road

Rare tiger cub from litter of four dies

The SpaceX IPO could lead to 8% of America’s current-account deficit being refinanced in a single day

'Don’t give parents more to do to keep kids safe online - they need help, not homework'