AuraExtract — Intelligent Invoice & Receipt Data Extractor
The extraction engine uses intelligent regex pattern matching that handles real-world invoice layouts — column-per-line PDF formats, inline tabular formats, and plain text documents. It detects 10 fields automatically and...
A fully offline document summarizer built in pure Python. Uses TF-IDF scoring, position weighting, and Jaccard deduplication to extract the most important sentences from any PDF, DOCX, or TXT file — each labeled with a relevance percentage.
Built AuraChat v3.0 — a fully offline Document Intelligence desktop app in pure Python. Users upload any PDF, Word, or TXT file and ask questions in plain English. The system returns cited answers with confidence scores instantly.