Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[TPU] Support collective communications in XLA devices ready tpu Related to Google TPUs
#6813 opened Jul 26, 2024 by WoosukKwon Loading…
[Misc] Support TPU in initialize_ray_cluster ready tpu Related to Google TPUs
#6812 opened Jul 26, 2024 by WoosukKwon Loading…
[Core] Get KV from Block, add KV to Block
#6808 opened Jul 26, 2024 by KrishnaM251 Loading…
Fix ReplicatedLinear weight loading ready
#6793 opened Jul 25, 2024 by qingquansong Loading…
[Core] Use array to speedup padding ready
#6779 opened Jul 25, 2024 by peng1999 Loading…
[wip] spmd delta optimization ready
#6771 opened Jul 25, 2024 by rkooo567 Loading…
(Dont Merge) Add rwkv6
#6749 opened Jul 24, 2024 by uniartisan Draft
[Model][Jamba] Mamba cache single buffer ready
#6739 opened Jul 24, 2024 by mzusman Loading…
[CORE] support for *.pt type prompt adapters
#6709 opened Jul 23, 2024 by prashantgupta24 Loading…
ProTip! Filter pull requests by the default branch with base:main.