Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[HELP] 求好心人帮忙量化 DeepSeek-R1-Distill-Qwen-32Bint8 #3200

Open
SolomonLeon opened this issue Mar 1, 2025 · 1 comment
Open

Comments

@SolomonLeon
Copy link

我的设备配置是 4 x V100 ,虽然理论上四卡显存足够量化,但 LMDeploy 目前不支持多卡量化 ( #3145 )。也尝试过 SGLang 但其并不支持 V100 显卡。

请问有好心人能帮忙使用 LMDeploy 量化 DeepSeek-R1-Distill-Qwen-32Bint8 并上传到 Huggingface 吗?非常感谢!

@lvhan028
Copy link
Collaborator

lvhan028 commented Mar 3, 2025

你是指 w8a8 吗

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants