Skip to content

Pull requests: PaddlePaddle/PaddleFormers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix moe_subbatch_token_num conflict contributor
#2719 opened Oct 11, 2025 by zjjlivein Loading…
2 tasks
fix dpo training parallel contributor
#2717 opened Oct 10, 2025 by llbdyiu66 Loading…
subbatch cast logits to fp32 in dpo_loss
#2716 opened Oct 10, 2025 by cheng221 Loading…
2 tasks
【gpt-oss】cherry-pick
#2713 opened Oct 9, 2025 by xiaoguoguo626807 Loading…
2 tasks
【gpt-oss】Add Fp4 to bf16 test
#2712 opened Oct 9, 2025 by xiaoguoguo626807 Loading…
2 tasks
GLM4.5 support sp + moe aux loss contributor
#2682 opened Sep 24, 2025 by WYB27 Loading…
[DSv3]: Add Tokenizer Config for DSv3
#2650 opened Sep 22, 2025 by hushenwei2000 Loading…
Glm4Moe: fix attn_mask && fused_loss contributor
#2648 opened Sep 20, 2025 by WYB27 Loading…
Update CODE_OF_CONDUCT.md contributor
#2636 opened Sep 18, 2025 by Jagdish2810 Draft
2 tasks done
[dsv3]Move dsv3 model from paddlenlp-dsv3-sft
#2593 opened Sep 11, 2025 by Difers Loading…
1 of 7 tasks
【FlexCP】add Flexcp for trainer
#2541 opened Sep 4, 2025 by xiaoguoguo626807 Loading…
2 tasks
feat(dsv3):Runnable N1C8 configs
#2525 opened Sep 1, 2025 by hushenwei2000 Loading…
feat(dsv3): add dsv3 fast pretrain into paddleformers
#2524 opened Aug 31, 2025 by chen2016013 Loading…
2 tasks
feat(dsv3):Runnable N1C8 configs
#2523 opened Aug 31, 2025 by chen2016013 Loading…
2 tasks
add moe
#2510 opened Aug 28, 2025 by a31413510 Loading…
fix bug support download ernie model contributor
#2509 opened Aug 28, 2025 by fjjF77 Loading…
fix typos contributor
#2500 opened Aug 28, 2025 by co63oc Loading…
2 tasks
feat(dsv3): add dsv3 fast pretrain into paddleformers
#2496 opened Aug 27, 2025 by chen2016013 Loading…
2 tasks
Merge dsv3 tainer part
#2487 opened Aug 27, 2025 by hushenwei2000 Draft
change deepseekv2 model
#2486 opened Aug 26, 2025 by chen2016013 Loading…
2 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.