-
Notifications
You must be signed in to change notification settings - Fork 732
Insights: modelscope/ms-swift
Overview
Could not load contribution data
Please try again later
31 Pull requests merged by 8 people
-
[grpo] fix offpolicy check
#4852 merged
Jul 7, 2025 -
[grpo]Fix bug when repeatedly call inputs_to_rolloutrequest
#4823 merged
Jul 7, 2025 -
[grpo] deprecated params for 3.6
#4848 merged
Jul 7, 2025 -
[megatron] fix eval_iters -1
#4847 merged
Jul 7, 2025 -
fix bug: grpo train error for deepseek model
#4833 merged
Jul 7, 2025 -
[megatron] Fix the display issue for train_type=lora
#4845 merged
Jul 7, 2025 -
update stream & fix bugs
#4842 merged
Jul 7, 2025 -
[Feature] SwanLab Lark callback
#4830 merged
Jul 6, 2025 -
fix multimodal padding_free prediction_step
#4839 merged
Jul 6, 2025 -
[train] fix multimodal packing & padding_free
#4838 merged
Jul 6, 2025 -
Support gemma3n
#4836 merged
Jul 4, 2025 -
[grpo] fix apply_chat_template
#4827 merged
Jul 4, 2025 -
[rollout] fix request from dict
#4826 merged
Jul 4, 2025 -
[rollout] Fix non-serializable torch.dtype bug in VLLM weight sync
#4825 merged
Jul 4, 2025 -
[rollout] fix external plugins
#4822 merged
Jul 4, 2025 -
[GITHUB WORKFLOW]add close stale issues workflow
#4816 merged
Jul 3, 2025 -
[RM] support margin & update doc
#4817 merged
Jul 3, 2025 -
Support ring attention for llm sft/dpo/grpo (packing/padding_free only).
#4814 merged
Jul 3, 2025 -
Refactor Web-UI
#4687 merged
Jul 3, 2025 -
[train] Update split_dataset_ratio
#4798 merged
Jul 3, 2025 -
[model] support GLM4.1V
#4804 merged
Jul 2, 2025 -
fix template bug for qwen3 reranker
#4795 merged
Jul 2, 2025 -
update custom_dataset_docs
#4792 merged
Jul 2, 2025 -
update resume from checkpoint & update timeout
#4774 merged
Jul 1, 2025 -
Fix media downloading from hf
#4788 merged
Jul 1, 2025 -
[grpo] check eval_dataset length
#4781 merged
Jul 1, 2025 -
[grpo] pass trainer state to reward funcs
#4779 merged
Jul 1, 2025 -
[docs] fix grpo docs
#4777 merged
Jul 1, 2025 -
[grpo] update vllm weight sync & wake up
#4770 merged
Jul 1, 2025 -
update megatron shell
#4773 merged
Jun 30, 2025 -
update wechat
#4769 merged
Jun 30, 2025
6 Pull requests opened by 5 people
-
[Safety]Fix torch load
#4802 opened
Jul 2, 2025 -
[WIP][megatron] support LoRA
#4812 opened
Jul 3, 2025 -
Update template_meta.prefix bug
#4813 opened
Jul 3, 2025 -
Aacedar patch 3
#4832 opened
Jul 4, 2025 -
[grpo] entropy mask
#4850 opened
Jul 7, 2025 -
Fix test bug
#4851 opened
Jul 7, 2025
24 Issues closed by 13 people
-
GRPO训练结果异常
#4800 closed
Jul 7, 2025 -
grpo + gen_rm 流程中的GenRMPlugin是否重复跑了数据
#4846 closed
Jul 7, 2025 -
Padding free feature
#4439 closed
Jul 6, 2025 -
支持Gemma-3n模型
#4759 closed
Jul 5, 2025 -
Zero loss in case lora qwen3-4b-reranker tuning
#4820 closed
Jul 4, 2025 -
grpo can not support deepseek-6.7b-base model
#4785 closed
Jul 4, 2025 -
Missing attribute when generate infer_request in VLM GRPO
#4824 closed
Jul 4, 2025 -
grpo训练qwen,报通信超时错误
#4797 closed
Jul 4, 2025 -
Rollout Stuck after “Core engine process 0 ready.” if use custom plugin
#4807 closed
Jul 4, 2025 -
Load lora finetuned model and further finetune with GRPO
#4821 closed
Jul 4, 2025 -
使用PPO训练完毕保存时报错:zero_gather_16bit_weights_on_model_save
#4815 closed
Jul 4, 2025 -
reward model的训练能否再详细整理一下文档
#4379 closed
Jul 3, 2025 -
我想使用Gemini蒸馏的特定场景带有思考过程的多轮对话数据,微调qwen3-32B,想问一下,损失计算的时候是不是思考过程只计算最后一轮的<think>\n\n</think>的内容呢?
#4809 closed
Jul 2, 2025 -
Qwen3Reranker自定义数据构造bug
#4784 closed
Jul 2, 2025 -
How to Input Frame Sequence Without Original Video? 如何直接输入帧图片序列?
#4776 closed
Jul 2, 2025 -
megatron sft使用packing时报“cannot pickle _io.TextIOWrapper”错误
#4778 closed
Jul 2, 2025 -
vllmengine推理的GuidedDecodingParams参数失效
#4790 closed
Jul 1, 2025 -
为什么我在使用GRPO微调时,我的reward始终为0
#4789 closed
Jul 1, 2025 -
grpo字段解析bug
#4783 closed
Jul 1, 2025 -
GRPO,Why is it that I am incoming in a video, which is recognized as a picture?
#4772 closed
Jul 1, 2025 -
grpo输入格式多处描述不一致
#4782 closed
Jul 1, 2025 -
支持GRPO训练Qwen2-audio-7B-Instruct吗
#4768 closed
Jul 1, 2025 -
grpo_trainer.py TypeError: must be real number, not NoneType
#4751 closed
Jun 30, 2025
29 Issues opened by 25 people
-
qwen2.5vl是否支持4bit的kv_cache量化?
#4849 opened
Jul 7, 2025 -
使用ms-swift sft之后模型的config.json文件变了,导致我不能直接使用vllm部署模型
#4844 opened
Jul 7, 2025 -
ALL_PARALLEL_STYLES argument of type 'NoneType' is not iterable
#4843 opened
Jul 7, 2025 -
grpo + gen_rm padding index error
#4841 opened
Jul 7, 2025 -
DDP环境下FileNotFoundError问题
#4840 opened
Jul 6, 2025 -
Need to update requirements.txt
#4837 opened
Jul 5, 2025 -
Trained Qwen 3 model seems to be broken.
#4835 opened
Jul 4, 2025 -
Feature Request: RTX 5090 Support with ms-swift docker image with CUDA 12.8
#4834 opened
Jul 4, 2025 -
[有人碰到过吗?]qwen2.5vl微调agent出现坐标点偏移问题
#4831 opened
Jul 4, 2025 -
SwanLab Notification Integration
#4829 opened
Jul 4, 2025 -
awq量化qwen2.5-vl-7b报错
#4828 opened
Jul 4, 2025 -
Qwen2-VL-2B 预训练到后期会出现梯度爆炸,其他VLM不会出现
#4819 opened
Jul 3, 2025 -
Ovis-2B 预训练报错
#4818 opened
Jul 3, 2025 -
error when finetuning qwen3 in modelscope notebook.
#4811 opened
Jul 3, 2025 -
带有思考过程的多轮对话数据,微调qwen3-32B,想问一下,损失计算的时候是不是思考过程只计算最后一轮的<think>\n\n</think>的内容呢?
#4810 opened
Jul 2, 2025 -
grpo微调deepseek_coder模型填充信息有误
#4808 opened
Jul 2, 2025 -
Reranker training requires A LOT of VRAM
#4805 opened
Jul 2, 2025 -
设置packing_cache后,第二次训练没有从cache读取数据,又重新packing了。
#4803 opened
Jul 2, 2025 -
预训练/微调很慢
#4801 opened
Jul 2, 2025 -
sft A3B模型一直卡在这里不动了
#4799 opened
Jul 2, 2025 -
Logged accuracy doesn't change when training a reranker
#4796 opened
Jul 2, 2025 -
qwen2.5-vl grounding GRPO
#4794 opened
Jul 2, 2025 -
llava-next-110b seq_cls 微调出错,AttributeError: 'Identity' object has no attribute 'weight'
#4793 opened
Jul 2, 2025 -
关于SFT微调语料
#4791 opened
Jul 2, 2025 -
test kimi vl thinking meet error!
#4780 opened
Jul 1, 2025 -
对DeepSeek-VL2进行GRPO训练
#4775 opened
Jun 30, 2025 -
GRPO training: skip_special_tokens config
#4771 opened
Jun 30, 2025 -
支持keye-vl-8b模型
#4766 opened
Jun 30, 2025
34 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
请教下GRPO训练时出现模型多次异常触碰到Max_length的问题
#4758 commented on
Jun 30, 2025 • 0 new comments -
shape mismatch internvl3
#4636 commented on
Jun 30, 2025 • 0 new comments -
GRPO npu多机多卡显存问题
#4748 commented on
Jul 1, 2025 • 0 new comments -
Fatal Python error: none_dealloc: deallocating None
#4353 commented on
Jul 1, 2025 • 0 new comments -
关于resume_from_checkpoint加载deepspeed
#4765 commented on
Jul 2, 2025 • 0 new comments -
agent推理时是否还不支持实际的工具调用,参考demo_agent.py
#4764 commented on
Jul 2, 2025 • 0 new comments -
蒸馏Qwen2.5-Omni模型报错:IndexError: max(): Expected reduction dim 1 to have non-zero size.
#4755 commented on
Jul 2, 2025 • 0 new comments -
wandb,开了海外代理还一直报错(网络连接超时,network error (connectiontimeout))
#4152 commented on
Jul 2, 2025 • 0 new comments -
GPTQ量化模型GRPO强化微调报错:AttributeError: 'GPTQLoraLinear' object has no attribute 'get_delta_weight'
#3949 commented on
Jul 2, 2025 • 0 new comments -
请问支持 webdataset 作为 qwen2.5VL 的输入么?
#3214 commented on
Jul 2, 2025 • 0 new comments -
使用lora的方式单机多卡微调最新的Qwen3_embedding模型会报错
#4585 commented on
Jul 2, 2025 • 0 new comments -
a question for rl
#4735 commented on
Jul 2, 2025 • 0 new comments -
请问是否支持自定义dataloader,想要实现两种不同格式dataset的训练,每个batch仅有一类数据
#4750 commented on
Jul 3, 2025 • 0 new comments -
关于序列并行训练
#2837 commented on
Jul 3, 2025 • 0 new comments -
SFT利用上一次加载数据的缓存
#3762 commented on
Jul 4, 2025 • 0 new comments -
评测时,长度最大只能输出2048,不知道为啥。。。。
#3761 commented on
Jul 4, 2025 • 0 new comments -
评测时,生成参数改了没效果,评测的配置文件还是显示默认参数
#3758 commented on
Jul 4, 2025 • 0 new comments -
自定义评测集报错
#3757 commented on
Jul 4, 2025 • 0 new comments -
ValueError: Cannot use chat template functions because tokenizer.chat_template
#3755 commented on
Jul 4, 2025 • 0 new comments -
Support SGLang in Swift
#3750 commented on
Jul 4, 2025 • 0 new comments -
关于rejected_response的引入
#3748 commented on
Jul 4, 2025 • 0 new comments -
qwen2.5-vl的awq量化问题
#4762 commented on
Jul 4, 2025 • 0 new comments -
支持GME微调么
#3019 commented on
Jul 4, 2025 • 0 new comments -
MaxLengthError
#4754 commented on
Jul 4, 2025 • 0 new comments -
使用xtuner作为序列并行的实现方式时没有调用pad_and_split_inputs对输入进行pad和split
#4760 commented on
Jul 4, 2025 • 0 new comments -
使用qwen32b-vl训练grounding发现连基本的格式遵循都做不到
#3746 commented on
Jul 5, 2025 • 0 new comments -
more logs in wandb
#3737 commented on
Jul 5, 2025 • 0 new comments -
deepseek-r蒸馏模型funcation_calling训练没有效果
#3733 commented on
Jul 5, 2025 • 0 new comments -
async_infer无法实现异步调用的疑问
#3717 commented on
Jul 6, 2025 • 0 new comments -
ModuleNotFoundError: No module named 'torch.distributed.device_mesh'
#4092 commented on
Jul 6, 2025 • 0 new comments -
开启断点训练后,为什么剩余时间越来越大了,epoch 和max_steps也和断点训练前接不上呢
#3783 commented on
Jul 7, 2025 • 0 new comments -
Use the feature of resume_from_checkpoint when using python code to run finetuning.
#3774 commented on
Jul 7, 2025 • 0 new comments -
Update dataset_info.json
#3723 commented on
Jul 6, 2025 • 0 new comments -
support ernie_vl
#4763 commented on
Jul 1, 2025 • 0 new comments