-
Notifications
You must be signed in to change notification settings - Fork 5.4k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
H200全量训练DeepSeek-R1-Distill-Llama-70B,采用zero3(batch_size=1)时溢出。采用zero3+offload(优化器、参数)时,显存占用少(141G显存只占用27G),cpu占用高
bug
Something isn't working
pending
This problem is yet to be addressed
#7282
opened Mar 13, 2025 by
github-eliviate
1 task done
Failed to load relative path images for multimodality models
bug
Something isn't working
pending
This problem is yet to be addressed
#7280
opened Mar 13, 2025 by
shaowei-su
1 task done
unsloth偶尔出现loss为0
bug
Something isn't working
pending
This problem is yet to be addressed
#7268
opened Mar 12, 2025 by
EntropyYue
1 task done
多模态数据集多的时候,数据加载失败
bug
Something isn't working
pending
This problem is yet to be addressed
#7266
opened Mar 12, 2025 by
zhaop-l
1 task done
streaming 训练卡在第一个step
bug
Something isn't working
pending
This problem is yet to be addressed
#7261
opened Mar 12, 2025 by
zfr00
1 task done
GPU Imbalanced Loading
bug
Something isn't working
pending
This problem is yet to be addressed
#7250
opened Mar 11, 2025 by
WillDreamer
1 task done
微调 DeepSeek-R1 蒸馏模型,在 Chat 加载秩表现出色,但在导出部署到 Ollama 后问答准确率大幅下降
bug
Something isn't working
pending
This problem is yet to be addressed
#7238
opened Mar 11, 2025 by
Nehcknarf
1 task done
单机多卡(4 x 3090)Linux 系统 使用默认的llamafactory-cli train /homeqwen3b_lora_pretrain.yaml 报错
bug
Something isn't working
pending
This problem is yet to be addressed
#7233
opened Mar 10, 2025 by
Johnnythefool
1 task done
Error when training
bug
Something isn't working
pending
This problem is yet to be addressed
#7232
opened Mar 10, 2025 by
catn1pdeal3r
1 task done
安装报错:Failed to build Something isn't working
pending
This problem is yet to be addressed
autoawq==0.2.8
bug
#7225
opened Mar 9, 2025 by
thinkingInWorldByNull
1 task done
希望提供对phi4-mini:3.8b的支持。
enhancement
New feature or request
pending
This problem is yet to be addressed
#7224
opened Mar 9, 2025 by
liuaifu
1 task done
raise RuntimeError("Cannot find valid samples, check Something isn't working
pending
This problem is yet to be addressed
data/README.md
for the data format.") when wikipedia_en
bug
#7220
opened Mar 8, 2025 by
new-Sunset-shimmer
1 task done
vllm_infer对qwen2.5vl推理很慢,10000个图文对卡住很久
bug
Something isn't working
pending
This problem is yet to be addressed
#7216
opened Mar 8, 2025 by
2019211753
1 task done
TypeError: unhashable type: 'list'
bug
Something isn't working
pending
This problem is yet to be addressed
#7214
opened Mar 7, 2025 by
CaiJichang212
1 task done
Reward Model 推理
bug
Something isn't working
pending
This problem is yet to be addressed
#7212
opened Mar 7, 2025 by
SFTJBD
1 task done
训练deepseek蒸馏的7B时,loss在每个epoch开始时翻倍
bug
Something isn't working
pending
This problem is yet to be addressed
#7208
opened Mar 7, 2025 by
Y56611
1 task done
同一个数据集和模型,相同参数设置,训练两次,0.5epoch时会因为模型见到数据顺序不同的原因导致很大效果差异吗?
invalid
This doesn't seem right
#7200
opened Mar 7, 2025 by
tiphaineeee
1 task done
when will you release the new version?
bug
Something isn't working
pending
This problem is yet to be addressed
#7199
opened Mar 7, 2025 by
ganisback
1 task done
deepseek r1 微调后我应该怎么加载lora参数推理呢
bug
Something isn't working
pending
This problem is yet to be addressed
#7185
opened Mar 6, 2025 by
joyyyhuang
1 task done
使用unsloth加速报错
bug
Something isn't working
pending
This problem is yet to be addressed
#7177
opened Mar 6, 2025 by
GEK1
1 task done
MiniCPM-o-2_6的sft、lora训练报错:Some weights of the model checkpoint at /app123/model/MiniCPM-o-2_6 were not used when initializing MiniCPMO:
bug
Something isn't working
pending
This problem is yet to be addressed
#7169
opened Mar 5, 2025 by
winni0
1 task done
deepseek-moe-16B预训练问题
bug
Something isn't working
pending
This problem is yet to be addressed
#7165
opened Mar 5, 2025 by
zyp-byte
1 task done
跑open_r1_math数据集,qwen7b-instruct每次跑到53个step报错
bug
Something isn't working
pending
This problem is yet to be addressed
#7163
opened Mar 5, 2025 by
fsq77
1 task done
Qwen/Qwen2.5-VL-7B-Instruct PPO 训练报错
bug
Something isn't working
pending
This problem is yet to be addressed
#7159
opened Mar 5, 2025 by
ulovecode
1 task done
Previous Next
ProTip!
Follow long discussions with comments:>50.