OCR Prodigy

Bhavarth Bhangdia

Backend Engineer

OCR Prodigy

OCR Prodigy is an application designed to recognize Thai ID cards using Optical Character Recognition (OCR). It employs the Google Vision API for OCR processing, integrates with MongoDB for data storage, and provides REST API endpoints using Node.js for CRUD operations.

Objective

The goal of this application is to accurately extract data from Thai ID cards via OCR processing and save the extracted information in a MongoDB database for retrieval.

Features

OCR Processing: Utilizes Google Vision API for OCR of Thai ID cards.
Data Extraction: Extracts key information like name, last name, ID number, date of birth, date of issue, and date of expiry.
User Interface: Allows users to upload Thai ID card images (png, jpeg, jpg) with a 2MB file size limit.
JSON Output: Displays JSON output on the UI for successful OCR operations.
History and Filtering: Provides options to query previous OCR results.
API Endpoints: Offers CRUD operations for OCR records.
Advanced Features: Includes error handling, unit tests, code comments, etc.

Technologies Used

Frontend: ReactJS
Backend: Node.js
Database: MongoDB
OCR Processing: Google Vision API

Installation

Initialization:
git init
Frontend
cd Frontend cd ocr npm install npm start
Backend Setup:
cd Backend npm install npm run dev ##UI of webpage

After Uploading the given sample id photo

Output of text extraction is given below.

API Endpoints

POST /api/users/upload: Create a new ID card entry.
PUT /api/users/:id: Update an existing ID card entry.
DELETE /api/users/:id: Delete an ID card entry.
GET /api/users: Retrieve all ID card entries.
GET /api/users/:id: Retrieve one ID card entry using its ID.
Partner With Bhavarth
View Services

More Projects by Bhavarth