Spaces:

Shabdobhedi
/

OCR-Document-Search-App-Using_gemini-1

Sleeping

App Files Files Community

OCR-Document-Search-App-Using_gemini-1 / README.md

Shabdobhedi

Update README.md

a464a1e verified 5 months ago

preview code

raw

history blame contribute delete

2.88 kB

	---
	title: OCR-Document-Search-App-Using Gemini-1
	emoji: 😻
	colorFrom: gray
	colorTo: purple
	sdk: streamlit
	sdk_version: 1.38.0
	app_file: app.py
	pinned: false
	license: mit
	short_description: Extract Hindi and English Text from image
	---




	# OCR and Document Search Web Application Prototype(Using gemini-1.5)

	## Introduction
	This project is a web-based prototype designed to perform Optical Character Recognition (OCR) on images containing text in both Hindi and English. The application allows users to upload an image, extract the text, and search for specific keywords within the extracted content. The goal is to demonstrate the functionality of OCR technology in a user-friendly web application.



	## Technologies
	The following technologies and libraries were used in this project:

	- Python: Programming language used for development.
	- Google Generative AI: Library used for performing OCR on images.
	- Streamlit: Framework for building the web application interface.
	- Pillow: Library for image processing.

	## Requirements
	To run this project, you will need the following libraries installed:

	```plaintext
	streamlit
	google-generativeai
	Pillow
	```





	## Project Structure



	```

	/OCR-Document-Search-App-Using_gemini-1.5-pro
	│
	├── app.py # Main application file
	├── ocr_utils.py # Utility functions for OCR processing
	└── requirements.txt # List of required libraries

	```
	# How to Run the Application

	## 1.Clone the Repository:


	```
	git clone https://github.com/pratim808/OCR-Document-Search-App-Using_gemini-1.5-pro.git
	```

	## 2.Navigate to the Project Directory:


	```
	cd OCR-Document-Search-App-Using_gemini-1.5-pro
	```

	## 3.Install Dependencies:

	```
	pip install -r requirements.txt
	```
	## 4.Run the Application:

	```
	streamlit run app.py
	```
	## 5.Access the Application:
	```
	Open your web browser and go to http://localhost:8501.
	```



	## Features


	- Image Upload: Users can upload images in JPG, JPEG, or PNG format.
	- Text Extraction: The application uses EasyOCR to extract text from the uploaded image.
	- Keyword Search: Users can enter keywords to search within the extracted text, with matching sections highlighted.
	## Deployment
	The application can be deployed on platforms such as Streamlit Sharing or Hugging Face Spaces. Follow their instructions to make your application accessible via a public URL.
	## Live URL

	https://huggingface.co/spaces/Shabdobhedi/OCR-Document-Search-App-Using_gemini-1
	## Conclusion
	This project demonstrates how OCR technology can be integrated into a web application, providing an efficient tool for extracting and searching text from images.
	## Demo

	https://drive.google.com/file/d/1RKYpN6zbqqWFhCHslFAPxJqDgLZbkma6/view?usp=sharing



	Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference