We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DeepSeek-R1-Distill-Qwen-32B
int8
我的设备配置是 4 x V100 ,虽然理论上四卡显存足够量化,但 LMDeploy 目前不支持多卡量化 ( #3145 )。也尝试过 SGLang 但其并不支持 V100 显卡。
SGLang
请问有好心人能帮忙使用 LMDeploy 量化 DeepSeek-R1-Distill-Qwen-32B 到 int8 并上传到 Huggingface 吗?非常感谢!
The text was updated successfully, but these errors were encountered:
你是指 w8a8 吗
Sorry, something went wrong.
No branches or pull requests
我的设备配置是 4 x V100 ,虽然理论上四卡显存足够量化,但 LMDeploy 目前不支持多卡量化 ( #3145 )。也尝试过
SGLang
但其并不支持 V100 显卡。请问有好心人能帮忙使用 LMDeploy 量化
DeepSeek-R1-Distill-Qwen-32B
到int8
并上传到 Huggingface 吗?非常感谢!The text was updated successfully, but these errors were encountered: