Skip to content
View andygrove's full-sized avatar

Highlights

  • Pro

Organizations

@apache
Block or Report

Block or report andygrove

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Rust implementation of the Iceberg REST Catalog specification.

Rust 91 6 Updated Jul 25, 2024

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,086 393 Updated Jul 26, 2024

Apache DataFusion Comet Spark Accelerator

Rust 689 128 Updated Jul 26, 2024
Rust 1 Updated Apr 30, 2024

Rust-based WebAssembly bindings to read and write Apache Parquet data

Rust 484 19 Updated Jul 23, 2024

Pure Rust Iceberg Implementation

Rust 166 17 Updated Jun 19, 2024

Car Universal Rental Business (CURB)

TypeScript 2 1 Updated Apr 1, 2024

Utility to run/debug Spark RAPIDS in REPL

Jupyter Notebook 7 3 Updated Nov 6, 2023

Apache Iceberg

Rust 529 113 Updated Jul 25, 2024

reproducible benchmark of database-like ops

R 138 27 Updated Jul 11, 2024

A high-performance, zero-overhead, extensible Python compiler using LLVM

C++ 13,997 497 Updated Jul 18, 2024

Apache Arrow Ballista Python bindings

Shell 32 8 Updated Feb 10, 2024

Making data lake work for time series

Python 1,107 59 Updated Sep 30, 2023

Open Source ElasticSearch Alternative. Parseable helps you search and get insights from your logs in the most simple way possible.

Rust 1,799 93 Updated Jul 25, 2024

Data pipeline example written in Rust with Polars and DataFusion DataFrame package

Rust 37 1 Updated Mar 12, 2023

Rust bindings for the Python interpreter

Rust 11,595 711 Updated Jul 25, 2024

Ergonomic bindings to duckdb for Rust

Rust 442 91 Updated Jul 23, 2024

Apache Spark - A unified analytics engine for large-scale data processing

Scala 39,000 28,121 Updated Jul 26, 2024

Experimental DataFusion Optimizer

Rust 46 6 Updated Jun 9, 2023

Apache DataFusion Python Bindings

Rust 328 64 Updated Jul 25, 2024

Database connectivity API standard and libraries for Apache Arrow

C# 337 85 Updated Jul 26, 2024

Quickly view your data

Rust 254 16 Updated Jul 25, 2024

C language bindings for DataFusion

C 16 4 Updated Mar 11, 2024

Arrow Flight Sql Client

Rust 2 2 Updated Jul 25, 2024

Spark RAPIDS Benchmarks – benchmark sets and utilities for the RAPIDS Accelerator for Apache Spark

Python 34 26 Updated Jul 15, 2024

Apache DataFusion Ballista Distributed Query Engine

Rust 1,411 183 Updated Jul 15, 2024

Terminal based, extensible, interactive data analysis tool using SQL

Rust 73 4 Updated Jul 26, 2023

Stream processing & Service framework.

Rust 134 7 Updated May 1, 2024
Next