-
-
Notifications
You must be signed in to change notification settings - Fork 6.2k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[DO NOT MERGE] [V1] Implement Scheduler Interface + SimpleScheduler
v1
#14731
opened Mar 13, 2025 by
WoosukKwon
•
Draft
[Neuron] flatten test parameterization for neuron attention kernels
ci/build
#14712
opened Mar 12, 2025 by
liangfu
Loading…
Re-enable the AMD Entrypoints Test
ci/build
#14711
opened Mar 12, 2025 by
Alexei-V-Ivanov-AMD
Loading…
[Misc] Better RayExecutor and multiprocessing compatibility
v1
#14705
opened Mar 12, 2025 by
comaniac
Loading…
[V1][Feature] Enable Speculative Decoding with Structured Outputs
ci/build
v1
#14702
opened Mar 12, 2025 by
benchislett
Loading…
[V1][Metrics] Updated list of deprecated metrics in v0.8
documentation
Improvements or additions to documentation
[V1] Refactor Structured Output for multiple backends
structured-output
v1
#14694
opened Mar 12, 2025 by
russellb
Loading…
setup.py: drop assumption about local ONLY add when PR is ready to merge/full CI is needed
main
branch
ci/build
ready
#14692
opened Mar 12, 2025 by
russellb
Loading…
[VLM] Add video inputs support to InternVL2.5/InternVideo2.5 models
documentation
Improvements or additions to documentation
[Kernels] LoRA - Retire SGMV and BGMV Kernels
#14685
opened Mar 12, 2025 by
varun-sundar-rabindranath
Loading…
[Misc] Optimize Qwen2-VL's M-RoPE pos calc using numba
multi-modality
Related to multi-modality (#4194)
[Bugfix][IPEX]
use_prepack=False
for MoE when it's not supported
#14681
opened Mar 12, 2025 by
gau-nernst
Loading…
[Core] Add a level 3 sleep/wake_up that offloads tensors to disk
frontend
v1
#14678
opened Mar 12, 2025 by
manoelmarques
•
Draft
fix "Total generated tokens:" is 0 if using --backend tgi and --endpo…
#14673
opened Mar 12, 2025 by
sywangyi
Loading…
[VLM] Support pan-and-scan for Gemma3 multi-modal processor
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
[Bugfix][Kernel][CPU] Fix num_tokens in CPU rotary embedding kernel
ready
ONLY add when PR is ready to merge/full CI is needed
#14667
opened Mar 12, 2025 by
gau-nernst
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.