mlc-ai / mlc-llm Public

Notifications You must be signed in to change notification settings
Fork 1.7k
Star 20.2k

Code
Issues 231
Pull requests 9
Actions
Projects 2
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: mlc-ai/mlc-llm

Project Tracking

#647 opened Aug 2, 2023 by tqchen

Open

Model Request Tracking

#1042 opened Oct 9, 2023 by CharlieFRuan

Open 4

Labels 13 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

231 Open 1,257 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Question] Does it support multi-gpu (intel ARC A770)? question

Question about the usage

#3175 opened Mar 14, 2025 by savvadesogle

A significant difference in answer quality between the model provided by the official and the model obtained by converting weights based on official documents question

Question about the usage

#3174 opened Mar 14, 2025 by FFchopon

[Bug] The Medusa model has some differences from the current official implementation bug

Confirmed bugs

#3173 opened Mar 14, 2025 by Songyanfei

[Model Request] gemma 3 new-models

#3171 opened Mar 12, 2025 by nico-martin

[Question] How to get last layer hidden state of transformer model while transfer the model to MLC format? question

Question about the usage

#3170 opened Mar 11, 2025 by Jianshu1only

[Bug] App closes itself when initializing a model, DeepSeek-R1-Distill-Qwen-1.5B-q4f16_1-MLC bug

Confirmed bugs

#3169 opened Mar 11, 2025 by GTMssj

[Question] VLM inference on android question

Question about the usage

#3167 opened Mar 10, 2025 by amirvenus

[Bug] <iframe src="https://reach-vb-smollm2-structured-generation.static.hf.space" frameborder="0" width="850" height="450" ></iframe> bug

Confirmed bugs

#3163 opened Mar 10, 2025 by Jose1370

[Bug] /mlc-llm/3rdparty/tvm/src/runtime/relax_vm/attn_utils.h:712:19: error: no member named 'cl' in namespace 'tvm::runtime' bug

Confirmed bugs

#3157 opened Mar 8, 2025 by PineJuneYang

[Bug] Issues with MLC on ROCM bug

Confirmed bugs

#3156 opened Mar 8, 2025 by KartDriver

[Bug] Unable to convert weight: "PagedKVCache.attention_with_fused_qkv() missing 1 required positional argument: 'sm_scale'" bug

Confirmed bugs

#3149 opened Mar 2, 2025 by Kisaragi-ng

[Model Request] phi-4-mini-instruct new-models

#3146 opened Feb 27, 2025 by j0h0k0i0m

[Bug] Mlc cli server gets stuck bug

Confirmed bugs

#3145 opened Feb 26, 2025 by falkbene

Question about flashinfer constraints

#3144 opened Feb 25, 2025 by mayakolad

[Question] mlc-llm server cannot return correct logprobs question

Question about the usage

#3142 opened Feb 19, 2025 by kunxiongzhu

[Question] how to use function call question

Question about the usage

#3141 opened Feb 19, 2025 by tebie6

[Model Request] GLINER for entity recognisition new-models

#3139 opened Feb 17, 2025 by manasaniprashanth

[Bug] Gemma 2 models fail due to errors in tokenizer bug

Confirmed bugs

#3138 opened Feb 17, 2025 by julioasotodv

[Question] I followed the instructions to build for Orange Pi, but it seems outdated (ChatModule) question

Question about the usage

#3134 opened Feb 16, 2025 by LivingLinux

[Question] While waiting for the model's response on an Android phone, performing other operations may cause the phone to become unresponsive or reboot. question

Question about the usage

#3131 opened Feb 13, 2025 by yangshgetui

[Bug] mlc-llm server cannot return correct logprobs bug

Confirmed bugs

#3130 opened Feb 13, 2025 by kunxiongzhu

[Bug] Is it compiling? CUDA 12.8 bug

Confirmed bugs

#3129 opened Feb 12, 2025 by johnnynunez

[Bug] Mistral-Nemo-Instruct-2407 The results were confused bug

Confirmed bugs

#3120 opened Feb 7, 2025 by fierceX

Very slow time to first token on ROCM question

Question about the usage

#3119 opened Feb 5, 2025 by Jyers

[Bug] Android app does not take input; 'user 'role' is not defined' error bug

Confirmed bugs

#3117 opened Feb 4, 2025 by afsara-ben

Previous 1 2 3 4 5 … 9 10 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly