delta-io / delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
8000
See what the GitHub community is most excited about this week.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Open-source high-performance RISC-V processor
♞ lichess.org: the forever free, adless and open source chess server ♞
State of the Art Natural Language Processing
Avro Data Source for Apache Spark
The Scala 3 compiler, also known as Dotty.
Spark reference applications
Protocol buffer compiler for Scala.
Chisel: A Modern Hardware Design Language
A Spark port of TFOCS: Templates for First-Order Conic Solvers (cvxr.com/tfocs)
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
Performance tests for Apache Spark
Redshift data source for Apache Spark
CSV Data Source for Apache Spark 1.x
RISC-V Torture Test
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
Stanford CoreNLP wrapper for Apache Spark
Rocket Chip Generator
proof-of-concept implementation of Pig-on-Spark integrated at the logical node level
An sbt plugin for deploying code to Databricks Cloud