OCR Image Menu Text Extractor

Overview

This script extracts text from images of restaurant menus using Optical Character Recognition (OCR) and then enhances and structures the extracted text using a language model. The script processes images in a specified directory and outputs the structured data to a JSON file.

Features

OCR Extraction: Uses pytesseract to extract text from images.
Text Enhancement: Utilizes a language model (litellm) to convert raw text into structured JSON.
Multiprocessing: Processes multiple images in parallel to improve performance.
Supports Multiple Formats: Handles various image formats such as JPEG, PNG, BMP, and TIFF.

Requirements

Python 3.12
pytesseract
Pillow (PIL)
litellm
multiprocessing (part of Python standard library)

You can install the required Python packages using pip:

pip install -r requirements.txt

Configuration

IMAGES_DIR: Directory where the images are located (default: images/).
OUTPUT_FILE: File where the structured JSON data will be saved (default: output.json).

Usage

Place the script in your desired directory.
Ensure your images are located in the directory specified by IMAGES_DIR.
pip install -r requirements.txt
python main.py

Script Details

Functions

extract_text(image_path: str) -> str: Extracts text from the specified image using OCR.
enhance_and_jsonify_text(raw_text: str) -> dict: Enhances raw text and converts it into structured JSON using a language model.
process_image(image_file: Path) -> dict: Processes a single image file and returns the result as a dictionary.
process_images_from_directory(directory_path: str) -> dict: Processes all images in the given directory using multiprocessing.
main(directory_path: str, output_file: str) -> None: Main function to process images and write results to a JSON file.

📜 License

Distributed under the MIT License. See LICENSE for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.github		.github
images		images
.env.sample		.env.sample
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
main.py		main.py
output.json		output.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Image Menu Text Extractor

Overview

Features

Requirements

Configuration

Usage

Script Details

Functions

📜 License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OCR Image Menu Text Extractor

Overview

Features

Requirements

Configuration

Usage

Script Details

Functions

📜 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages