Ivan2getdmodelz

Ivan2getdmodelz

Stars

30 stars written in Python

Clear filter

home-assistant / core

🏡 Open source home automation that puts local control and privacy first.

Python 85,999 37,192 Updated Apr 10, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,974 15,402 Updated Apr 10, 2026

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 75,286 10,222 Updated Apr 6, 2026

docling-project / docling

Get your documents ready for gen AI

Python 57,459 3,911 Updated Apr 10, 2026

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 55,674 10,721 Updated Apr 10, 2026

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 38,155 4,400 Updated Apr 10, 2026

paperless-ngx / paperless-ngx

A community-supported supercharged document management system: scan, index and archive all your documents

Python 37,965 2,412 Updated Apr 10, 2026

google / langextract

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 35,556 2,414 Updated Apr 10, 2026

ocrmypdf / OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 33,173 2,299 Updated Apr 8, 2026

JaidedAI / EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 29,261 3,552 Updated Dec 5, 2025

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 22,060 1,787 Updated Nov 19, 2025

datalab-to / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 19,568 1,346 Updated Apr 3, 2026

chidiwilliams / buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Python 18,592 8000 1,372 Updated Mar 29, 2026

LibreTranslate / LibreTranslate

Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.

Python 14,136 1,445 Updated Apr 7, 2026

vikhyat / moondream

tiny vision language model

Python 9,556 755 Updated Nov 14, 2025

RapidAI / RapidOCR

📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.

Python 6,276 612 Updated Apr 8, 2026

mindee / doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 6,006 635 Updated Mar 29, 2026

argosopentech / argos-translate

Open-source offline translation library written in Python

Python 5,841 438 Updated Apr 4, 2026

roflcoopter / viseron

Self-hosted, local only NVR and AI Computer Vision software. With features such as object detection, motion detection, face recognition and more, it gives you the power to keep an eye on your home,…

Python 2,702 328 Updated Apr 9, 2026

Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.

Python 2,359 230 Updated Jan 30, 2026

JuergenFleiss / aTrain

A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning models.

Python 1,095 78 Updated Apr 9, 2026

mittagessen / kraken

OCR engine for all the languages

Python 977 162 Updated Apr 7, 2026

openpaperwork / pyocr

A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab

Python 929 150 Updated Jun 13, 2018

johnolafenwa / DeepStack

The World's Leading Cross Platform AI Engine for Edge Devices

Python 810 122 Updated Jun 13, 2024

sbrunner / deskew

Library used to deskew a scanned document

Python 507 45 Updated Apr 2, 2026

felixdittrich92 / OnnxTR

OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR

Python 177 18 Updated Mar 29, 2026

TrevTron / rtl-ml

AI-powered radio signal classifier using RTL-SDR + ARM SBC. Identifies FM, NOAA Weather, APRS, FRS/GMRS, ISM sensors, pagers with 96.9% accuracy. Complete pipeline: capture → train → classify. Requ…

Python 140 15 Updated Mar 26, 2026

GNOME / ocrfeeder

Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder 436C

Python 94 30 Updated Mar 15, 2026

JKamlah / tesseractXplore

tesseractXplore a tesseract ease of use gui with full control

Python 28 9 Updated Nov 10, 2021

NeuralFalconYT / Remove-Silence-From-Audio

Remove Silence From Audio using pydub

Python 9 4 Updated Feb 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly