Stars
PostgreSQL connection pooler, load balancer and database sharder.
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
The Arcade Learning Environment (ALE) -- a platform for AI research.
C++20 idiomatic APIs for the Apache Arrow Columnar Format
A flexible distributed key-value database that is optimized for caching and other realtime workloads.
😎 A curated list of awesome MLOps tools
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
A modern replacement for Redis and Memcached
Bear is a tool that generates a compilation database for clang tooling.
A cross-platform, OpenGL terminal emulator.
🌙 LunarVim is an IDE layer for Neovim. Completely free and community driven.
A composable and fully extensible C++ execution engine library for data management systems.
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…
High performance server-side application framework
Official Rust implementation of Apache Arrow