A high-throughput and memory-efficient inference and serving engine for LLMs
-
Updated
Jul 25, 2024 - Python
A high-throughput and memory-efficient inference and serving engine for LLMs
An orchestration platform for the development, production, and observation of data assets.
Materials for the useR!2024 "Deploy and Monitor ML Pipelines with Open Source and Free Applications" workshop
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
🦋 A personal research and development (R&D) lab that facilitates the sharing of knowledge.
Machine Learning Pipelines for Kubeflow
Extensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.
AI Observability & Evaluation
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
🔮 SuperDuper: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
Open source AI platform for rapid development of advanced AI and AGI pipelines.
Add a description, image, and links to the mlops topic page so that developers can more easily learn about it.
To associate your repository with the mlops topic, visit your repo's landing page and select "manage topics."