Upload, Scan, and Extract Text from Documents Instantly
Start ScanningThis project — "Intelligently Extract Text & Data from Documents using OCR & NER" — was developed by Subrat Gupta as a practical implementation of modern Computer Vision and NLP techniques. The goal is to accurately extract key entities such as names, phone numbers, emails, and organizations from scanned documents, with a special focus on business cards. While this demonstration uses business cards for data privacy reasons, the framework is easily adaptable to documents like invoices, shipping bills, or other financial records.
To build this project, I integrated two core technologies in Data Science:
In the Computer Vision module, the system scans the uploaded document, enhances its quality, and detects the position of textual elements. Then, using NLP techniques, it processes the extracted text, cleans it, and uses a custom-trained NER model to identify structured data.
Python Libraries used in Computer Vision Module:
Python Libraries used in Natural Language Processing:
The entire project is divided into multiple development stages for better understanding and modular implementation:
Stage 1: Project Setup
Stage 2: Data Preparation
Stage 3: Manual Data Labeling (BIO Tagging)
Stage 4: Data Preprocessing
Stage 5: Model Training
Stage 6: Deployment and Prediction
Developed By: Subrat Gupta
B.Tech CSE, GITAM University