-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
单卡全参微调qwen2-1.5b时产生了报错
pending
This problem is yet to be addressed
#4978
opened Jul 26, 2024 by
fgo65654
1 task done
微调训练好后,基于vllm部署的GLM模型,是否支持function calling?
pending
This problem is yet to be addressed
#4977
opened Jul 26, 2024 by
RyanOvO
pytorch的docker镜像是否可以使用pytorch/pytorch
pending
This problem is yet to be addressed
#4974
opened Jul 26, 2024 by
xiaoyaolangzhi
1 task done
为什么我对qwen2 0.5b所有线性层lora微调和全量微调用的时间基本一致
pending
This problem is yet to be addressed
#4966
opened Jul 25, 2024 by
Yun-Peng-Wang
1 task done
windows部署api服务,推理过程使用的是CPU,而不是GPU?
pending
This problem is yet to be addressed
#4965
opened Jul 25, 2024 by
dudneytarash
1 task done
运行久了,显卡内存越来越高,导致内存溢出
pending
This problem is yet to be addressed
#4964
opened Jul 25, 2024 by
dravinbox
1 task done
量化会卡住,Issues里很多人遇到了同样的问题,但都没有解决方案
pending
This problem is yet to be addressed
#4963
opened Jul 25, 2024 by
ConniePK
1 task done
我在腾讯云中全量微调8B的llama架构128k上下文模型,我很确信JSON解析错误不来源于数据集和模型config,仍有json.decoder.JSONDecodeError: Expecting value: line 1 column 2 (char 1)。
pending
This problem is yet to be addressed
#4952
opened Jul 24, 2024 by
1ring2rta
1 task done
How to fine tune 405B
pending
This problem is yet to be addressed
#4940
opened Jul 23, 2024 by
etemiz
1 task done
Cannot find any model weights when activating VLLM-based inference backend
pending
This problem is yet to be addressed
#4931
opened Jul 23, 2024 by
Jerry-jwz
1 task done
已经修改过 deepseek-v2-lite的 float32问题,失败但没有明显错误信息
pending
This problem is yet to be addressed
#4924
opened Jul 22, 2024 by
yiyepiaoling0715
1 task done
After converting the InternLM2 7b from LLamaFactory and importing it into ollama, i get an error: tensor 'token_embd.weight' has wrong shape.
pending
This problem is yet to be addressed
#4919
opened Jul 22, 2024 by
Sakura4036
1 task done
多机多卡训练速度如何判定,单机八卡和双机16卡训练输出速度相同, 应该如何配置并行
pending
This problem is yet to be addressed
#4916
opened Jul 21, 2024 by
Tengfei9228
1 task done
全量参数开放做增量预训练,数据集加载内存溢出报错,不符合预期
pending
This problem is yet to be addressed
#4915
opened Jul 21, 2024 by
Adam-fei
1 task done
模型在合并LoRA权重后回答混乱,与合并前差距明显(非Issue #2505)
good first issue
Good for newcomers
pending
This problem is yet to be addressed
#4913
opened Jul 21, 2024 by
CloudyDory
1 task done
关于最新代码的模型切分问题
pending
This problem is yet to be addressed
#4912
opened Jul 20, 2024 by
Jayce1kk
1 task done
freeze微调报警告None of the inputs have requires_grad=True. Gradients will be None
pending
This problem is yet to be addressed
#4905
opened Jul 20, 2024 by
zhoujinyi66
1 task done
PPO的reward model训练卡住
pending
This problem is yet to be addressed
#4904
opened Jul 20, 2024 by
bingkunyao
1 task done
关于运用llama-factory中的deepseed全参数微调qwen2-7b-instruct面临问题
pending
This problem is yet to be addressed
#4898
opened Jul 19, 2024 by
Micro647
1 task done
vllm多卡推理遇到的问题
pending
This problem is yet to be addressed
#4893
opened Jul 19, 2024 by
ConniePK
1 task done
openai接口中toolcall的响应报文中,丢失tool_call_id字段
pending
This problem is yet to be addressed
#4881
opened Jul 18, 2024 by
xiaojun777-huang
1 task done
qwen72B训练完RM后,预测的时候会报Memory错误
pending
This problem is yet to be addressed
#4863
opened Jul 17, 2024 by
yaopanyaopan
Implementation of Flash Attention 3
pending
This problem is yet to be addressed
#4854
opened Jul 17, 2024 by
GitIgnoreMaybe
kto_pair not available error
pending
This problem is yet to be addressed
#4839
opened Jul 15, 2024 by
JianbangZ
1 task done
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.