Block or Report
Block or report akansal1
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing
Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Whisper realtime streaming for long speech-to-text transcription and translation
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
An implementation of paper "Heavy Rain Image Restoration: Integrating Physics Model and Conditional Adversarial Learning" (CVPR19)
Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.
Restate is the platform for building resilient applications that tolerate all infrastructure faults w/o the need for a PhD.
Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.
Enjoy the magic of Diffusion models!
(Arxiv 2021) NeRF--: Neural Radiance Fields Without Known Camera Parameters
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
[CVPR 2023] STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)
High performance self-hosted photo and video management solution.
Video+code lecture on building nanoGPT from scratch
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
MARS5 speech model (TTS) from CAMB.AI