Stars
🏡 Open source home automation that puts local control and privacy first.
A high-throughput and memory-efficient inference and serving engine for LLMs
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Get your documents ready for gen AI
A community-supported supercharged document management system: scan, index and archive all your documents
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Faster Whisper transcription with CTranslate2
OCR, layout analysis, reading order, table recognition in 90+ languages
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Open-source offline translation library written in Python
Self-hosted, local only NVR and AI Computer Vision software. With features such as object detection, motion detection, face recognition and more, it gives you the power to keep an eye on your home,…
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.
A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning models.
A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
The World's Leading Cross Platform AI Engine for Edge Devices
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR
AI-powered radio signal classifier using RTL-SDR + ARM SBC. Identifies FM, NOAA Weather, APRS, FRS/GMRS, ISM sensors, pagers with 96.9% accuracy. Complete pipeline: capture → train → classify. Requ…
Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder 436C
tesseractXplore a tesseract ease of use gui with full control
Remove Silence From Audio using pydub