I turn images, scanned documents or PDFs into clean structured data using multimodal LLMs (Gemini / Qwen-VL): structured
extraction with bounding boxes, an LLM proofreading/QC stage, and JSON/CSV output — delivered with real accuracy and cost
metrics. Includes prompt design, OpenRouter fallback for quota limits, and a simple review UI. Great for document automation,
data-entry replacement, and translation pipelines