Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q3 2024
#5805 opened Jun 25, 2024 by simon-mo
Open 21
Virtual Office Hours: July 9 and July 25
#5937 opened Jun 27, 2024 by mgoin
Open 2
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[RFC]: Performance Roadmap RFC
#6801 opened Jul 25, 2024 by simon-mo
5 tasks
[Bug]: Engine iteration timed out. This should never happen! bug Something isn't working
#6790 opened Jul 25, 2024 by Kelcin2
[Performance]: Slow TTFT(?) for Qwen2-72B-GPTQ-Int4 on H100 *2 performance Performance-related issues
#6781 opened Jul 25, 2024 by cyc00518
[Bug]: N-gram spec_decode in flash_attention bug bug Something isn't working
#6780 opened Jul 25, 2024 by chenglu66
[Performance]: Medusa SD have poor performance than baseline performance Performance-related issues
#6777 opened Jul 25, 2024 by cwlseu
[Bug]: --max-model-len configuration robustness bug Something isn't working
#6774 opened Jul 25, 2024 by gargnipungarg
[Bug]: Possible data race when running Llama 405b fp8 bug Something isn't working
#6767 opened Jul 25, 2024 by tlrmchlsmth
[Bug]: premature stopping or cut off output bug Something isn't working
#6764 opened Jul 25, 2024 by ndao600
[Doc]: ROCm installation instructions do not work documentation Improvements or additions to documentation rocm
#6762 opened Jul 24, 2024 by rlrs
[Bug]: Unable to run meta-llama/Llama-Guard-3-8B-INT8 bug Something isn't working
#6756 opened Jul 24, 2024 by xfalcox
ProTip! What’s not been updated in a month: updated:<2024-06-25.