-
Notifications
You must be signed in to change notification settings - Fork 293
Issues: vllm-project/aibrix
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[RFC]: Cache and Router refactoring for concurrent performance, concurrent safety and stateful routing.
area/gateway
area/heterogeneous
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#868
opened Mar 14, 2025 by
zhangjyr
[Dist KV] vllm pods which do not have kvcache pods running in the same node crashes.
area/kv-cache
#863
opened Mar 14, 2025 by
gangmuk
Automate local disk management and ai runtime model management
area/runtime
kind/feature
Categorizes issue or PR as related to a new feature.
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#854
opened Mar 12, 2025 by
Jeffwan
Pod scale success,aibrix-controller-manager failed to parse metrics
area/autoscaling
kind/support
Categorizes issue as a support question.
triage/needs-information
Indicates an issue needs more information in order to work on it.
#852
opened Mar 12, 2025 by
ying2025
Provide production grade overlay manifests
area/installation
kind/enhancement
New feature or request
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#847
opened Mar 11, 2025 by
Jeffwan
[RFC]: Make API Gateway interface OpenAI compatible
area/gateway
kind/enhancement
New feature or request
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#846
opened Mar 11, 2025 by
Jeffwan
[Observation] Improve AIBrix control plane monitoring
area/stability
kind/feature
Categorizes issue or PR as related to a new feature.
priority/important-longterm
Important over the long term, but may not be staffed and/or may need multiple releases to complete.
#845
opened Mar 11, 2025 by
Jeffwan
[Docs] Provide AIBrix upgrade guidance
area/installation
kind/documentation
Improvements or additions to documentation
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#844
opened Mar 11, 2025 by
Jeffwan
Ask for testing suggestions
kind/support
Categorizes issue as a support question.
triage/needs-information
Indicates an issue needs more information in order to work on it.
#842
opened Mar 10, 2025 by
ying2025
Some prompts with special character fail the benchmark script
area/benchmark
kind/bug
Something isn't working
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#832
opened Mar 9, 2025 by
Jeffwan
Making prefix-cache-and-load-aware routing more general
area/gateway
area/performance
kind/enhancement
New feature or request
kind/feature
Categorizes issue or PR as related to a new feature.
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
#814
opened Mar 7, 2025 by
gangmuk
Prefix sharing workload generation
area/benchmark
area/performance
priority/important-soon
Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#813
opened Mar 7, 2025 by
gangmuk
ModelAdapter seems to be working abnormally
area/lora
kind/support
Categorizes issue as a support question.
triage/needs-information
Indicates an issue needs more information in order to work on it.
#801
opened Mar 5, 2025 by
ying2025
Making max-tokens configurable in the benchmark client.
area/benchmark
#797
opened Mar 5, 2025 by
gangmuk
Recording request routing(target-pod) in the benchmark client
area/benchmark
#796
opened Mar 5, 2025 by
gangmuk
Previous Next
ProTip!
no:milestone will show everything without a milestone.