Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Support page size > 1
#4356 opened Mar 12, 2025 by merrymercy Loading…
2 tasks
Implement triton per token quantization of fp8
#4349 opened Mar 12, 2025 by zcnrex Loading…
6 tasks
Add Logits tests for Qwen2.5VL
#4347 opened Mar 12, 2025 by ravi03071991 Draft
6 tasks
[k8s] Clarified the usage of shared memory.
#4341 opened Mar 12, 2025 by jsuchome Loading…
2 of 6 tasks
Fix Llama3.3 tool call support high priority
#4320 opened Mar 11, 2025 by CatherineSue Loading…
3 of 6 tasks
[Feature] Support "strict" in function calling
#4310 opened Mar 11, 2025 by DarkSharpness Loading…
3 of 6 tasks
Add make_layers for deepseek_v2
#4307 opened Mar 11, 2025 by CharlesRiggins Loading…
6 tasks
Add metrics for tokenization/detokenization/wait in queue latency
#4280 opened Mar 11, 2025 by hebiao064 Loading…
1 of 6 tasks
[Feature] Support Tensor Parallelism and Weight Slicing for Lora
#4274 opened Mar 10, 2025 by aoshen524 Loading…
3 of 4 tasks
Fix the output of hidden states after HTTP requests
#4269 opened Mar 10, 2025 by Qiaolin-Yu Loading…
1 of 6 tasks
remove moe_align vllm dep
#4249 opened Mar 10, 2025 by sleepcoo Loading…
[Feature] Support EAGLE 3 high priority
#4247 opened Mar 10, 2025 by chromecast56 Loading…
6 tasks
Integrate DeepEP into SGLang high priority
#4232 opened Mar 9, 2025 by liz-badada Draft
1 of 6 tasks
Fix MoE quant args
#4190 opened Mar 8, 2025 by Edenzzzz Loading…
6 tasks
[ROCm/Draft/No-Merge]: Flex Attention Enablement amd collaboration documentation Improvements or additions to documentation
#4172 opened Mar 7, 2025 by HaiShaw Draft
6 tasks
ProTip! Updated in the last three days: updated:>2025-03-09.