The OCR Photo Scanner App is a web application that allows users to upload images, perform Optical Character Recognition (OCR) using Tesseract, and extract text from the images. The app features user authentication for secure access, a landing page, and a smooth user experience for both signed-in and guest users.
-
Optical Character Recognition (OCR):
Extract text from images using the Tesseract OCR engine. -
User Authentication:
Users can sign up, log in, and log out to access personalized features. -
Landing Page:
An aesthetically designed landing page welcoming users and explaining the app’s functionalities. -
Image Upload:
Upload images directly from your device to the app. -
Real-time Text Extraction:
Get instant results after uploading an image. -
Secure Access:
Only authenticated users can upload and scan images.
Check out the live demo of the app here.
- Frontend: React JS
- Backend: Firebase
- OCR Engine: Tesseract.js
- Authentication: Firebase Auth
- Database: Firestore
To get started with the project locally, follow these steps:
- Node.js installed on your machine
- Tesseract.js installed
- Firebase or any other database service set up
- Git installed
git clone https://github.com/yourusername/ocr-photo-scanner.git
cd ocr-photo-scanner
npm install
npm run dev
The app should now be running at http://localhost:5173
.
-
Sign up / Log in:
Create an account or log in with your existing credentials. -
Upload an Image:
Once logged in, navigate to the OCR scanner page and upload an image. -
Get Text Output:
The app will process the image and display the extracted text in real-time.
If you'd like to contribute, please fork the repository and use a feature branch. Pull requests are warmly welcome.
- Fork it
- Create your feature branch (
git checkout -b feature/new-feature
) - Commit your changes (
git commit -am 'Add some feature'
) - Push to the branch (
git push origin feature/new-feature
) - Create a new Pull Request
This project is licensed under the MIT License. See the LICENSE file for details.