Skip to content
View SteNicholas's full-sized avatar
🎮
Focusing
🎮
Focusing

Organizations

@apache
Block or Report

Block or report SteNicholas

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 3,651 195 Updated Jul 26, 2024

A playground to experience Gravitino

Jupyter Notebook 19 17 Updated Jul 19, 2024

Apache Paimon Rust The rust implementation of Apache Paimon.

Rust 57 21 Updated Jul 25, 2024

Apache Paimon Rust The rust implementation of Apache Paimon.

Rust 1 Updated Jul 8, 2024

Apache DataFusion SQL Query Engine

Rust 5,646 1,055 Updated Jul 26, 2024

Apache DataFusion SQL Query Engine

Rust 1 Updated Jun 22, 2024

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 2,181 858 Updated Jul 26, 2024

Open, Multi-modal Catalog for Data & AI

Java 2,007 284 Updated Jul 26, 2024

Z80 open-source silicon clone. Goal is to become a silicon proven, pin compatible, open-source replacement for classic Z80.

Verilog 578 22 Updated Jun 10, 2024

Apache Spark Kubernetes Operator

Java 29 7 Updated Jul 26, 2024

Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

Java 799 129 Updated Jul 20, 2024

🌍 针对小白的算法训练 | 包括四部分:①.大厂面经 ②.力扣图解 ③.千本开源电子书 ④.百张技术思维导图(项目花了上百小时,希望可以点 star 支持,🌹感谢~)推荐免费ChatGPT使用网站

Java 35,022 6,457 Updated Jun 13, 2023

Apache DataFusion Comet Spark Accelerator

Rust 689 128 Updated Jul 26, 2024

KubeBlocks is an open-source control plane software that runs and manages databases, message queues and other stateful applications on K8s.

Go 1,921 160 Updated Jul 26, 2024

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

Java 831 341 Updated Jul 26, 2024

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

10,071 484 Updated May 5, 2024

Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm

Jupyter Notebook 129 34 Updated Jul 24, 2024

Flink Connector for Apache Doris

Java 299 208 Updated Jul 26, 2024

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java 764 236 Updated Jul 26, 2024

Apache Celeborn Site

Shell 6 23 Updated Jul 26, 2024

Restate examples

TypeScript 1 Updated Jul 19, 2024

Restate is the platform for building resilient applications that tolerate all infrastructure faults w/o the need for a PhD.

Rust 1,330 33 Updated Jul 26, 2024
Go 1 Updated Nov 23, 2023

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,509 459 Updated Jul 26, 2024

A cloud native implementation for Apache RocketMQ 5.0

Java 189 33 Updated Jul 18, 2024

AutoMQ is a cloud-first alternative to Kafka by decoupling durability to S3 and EBS. 10x cost-effective. Autoscale in seconds. Single-digit ms latency.

Java 2,342 124 Updated Jul 26, 2024

Wangle is a framework providing a set of common client/server abstractions for building services in a consistent, modular, and composable way.

C++ 3,045 537 Updated Jul 26, 2024

Lakehouse storage system benchmark

Scala 62 9 Updated Feb 22, 2023

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 2 Updated Jul 4, 2023
Next