Skip to content
View FANNG1's full-sized avatar
Block or Report

Block or report FANNG1

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A Rust implementation of the Iceberg REST Catalog specification.

Rust 92 6 Updated Jul 26, 2024

Open, Multi-modal Catalog for Data & AI

Java 2,004 284 Updated Jul 26, 2024

Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

Java 799 128 Updated Jul 20, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 7,791 621 Updated Jul 25, 2024

Python SQL Parser and Transpiler

Python 6,105 611 Updated Jul 25, 2024

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Python 3,728 250 Updated Jul 26, 2024

Gradle plugin to create fat/uber JARs, apply file transforms, and relocate packages for applications and libraries. Gradle version of Maven's Shade plugin.

Groovy 3,631 389 Updated Jul 22, 2024

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 2,181 858 Updated Jul 26, 2024

Apache DataFusion Comet Spark Accelerator

Rust 689 128 Updated Jul 26, 2024

Uniffle is a high performance, general purpose Remote Shuffle Service.

Java 363 141 Updated Jul 26, 2024

CMU-DB's Cascades optimizer framework

Rust 335 19 Updated Jun 16, 2024

Polycat is a cutting-edge cloud-native metastore system, purpose-built to cater to the demands of modern data management in lakehouse deployments. It offers a comprehensive solution for organizatio…

Java 18 6 Updated May 9, 2024

Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.

Java 262 73 Updated Jul 23, 2024

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,024 891 Updated Jul 25, 2024

Open protocol for decentralized exchange and transformation of data

Python 121 11 Updated Jun 27, 2024

A GPT4 powered tool for detecting bugs in Databend

Python 16 4 Updated Jun 18, 2024

Apache Iceberg

Rust 529 113 Updated Jul 25, 2024

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java 764 236 Updated Jul 26, 2024

A networking framework that evolves with your application

Java 906 175 Updated Jul 25, 2024

📈 Capturing JVM- and application-level metrics. So you know what's going on.

Java 7,816 1,806 Updated Jul 26, 2024

Dropwizard bundle and reporter for Prometheus

Java 25 10 Updated Jul 20, 2024

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

10,071 485 Updated May 5, 2024
Java 1,597 277 Updated Jul 10, 2024

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1,113 148 Updated Jul 25, 2024

Simple project to expose a catalog over REST using a Java catalog backend

Java 91 39 Updated Jul 11, 2024

A list of semi to fully remote-friendly companies (jobs) in tech.

JavaScript 28,443 2,992 Updated Jul 8, 2024

Brings SQL and AI together.

Go 5,054 698 Updated Apr 18, 2024

Gluten: Plugin to Boost Trino's Performance

Java 68 13 Updated Oct 25, 2023
Next