Pulse · PaddlePaddle/PaddleNLP · GitHub

June 29, 2025 – July 6, 2025

Overview

41 Active pull requests

12 Active issues

28 Pull requests merged by 12 people

[RL] fix bug in qwen fuse qkv
#10821 merged Jul 6, 2025
Move setup_fp8.py to FleetY
#10820 merged Jul 6, 2025
[RL] disable aistudio download and fix qwen bug
#10819 merged Jul 5, 2025
Move FP8 ops to FleetY branch
#10803 merged Jul 5, 2025
support dispatch both bf16 & fp8
#10817 merged Jul 4, 2025
[AutoParallel] close dynamic sharding CI test
#10791 merged Jul 4, 2025
[LLM] fix_qwen3_moe
#10801 merged Jul 4, 2025
Set FP8Linear weight update by inplace add
#10813 merged Jul 4, 2025
Fix nccl ut
#10812 merged Jul 4, 2025
[tests] migrate to unittest.assertEqual
#10670 merged Jul 4, 2025
optimize scale transpose
#10810 merged Jul 4, 2025
optimizer_dual_pp_post_node_memory
#10806 merged Jul 3, 2025
lock gemm sm 112
#10805 merged Jul 3, 2025
Add fp8 opt config for dualpipe
#10804 merged Jul 3, 2025
Add expert num 32 support for FP8 ops and fix ut
#10802 merged Jul 3, 2025
Refine fp8 node
#10798 merged Jul 3, 2025
Support expert number 16 in FP8 operators
#10799 merged Jul 3, 2025
Adapt dispatch_quant_node to fp8 fusion moe node
#10794 merged Jul 2, 2025
[Auto-parallel] Fix sharding all_gather overlap in auto_dy
#10782 merged Jul 2, 2025
[Auto-parallel] Add llama13b benchmark fast_ln
#10785 merged Jul 2, 2025
[Auto-parallel] Add llama7b benchmark fast_ln
#10786 merged Jul 2, 2025
optimize dual pp memory
#10787 merged Jul 2, 2025
Reset EMA state dict when distributed strategy not matched
#10790 merged Jul 1, 2025
[CherryPick]Adapt new nccl version (#10768)
#10784 merged Jul 1, 2025
[Auto-Parallel] optimize the perfermance of GPT-3
#10780 merged Jul 1, 2025
adapt new nccl version
#10768 merged Jul 1, 2025
[Auto Parallel] Add enable_linear_fused_grad_add in qwen benchmark
#10779 merged Jun 30, 2025
[Auto Parallel] Adapt mp_async_allreduce optimize in auto paralle
#10770 merged Jun 30, 2025

13 Pull requests opened by 9 people

[AutoParallel] init sync param
#10783 opened Jul 1, 2025
Fix the _save function so that it can save the optimizer parameters.
#10789 opened Jul 1, 2025
[AutoParallel] Open dynamic sharding CI test
#10793 opened Jul 2, 2025
[AutoParallel] Remove dynamic pipeline import adaption
#10795 opened Jul 2, 2025
N1C8 dsv3 config
#10796 opened Jul 2, 2025
[Auto Parallel] change fused_layers.py to support fused_linear_grad add/sync_mp_allreduce/sp_async_reduce_scatter in same time
#10797 opened Jul 2, 2025
[Auto-parallel] temp adjust the tensor_fusion config
#10800 opened Jul 3, 2025
support quant return transpose only
#10811 opened Jul 4, 2025
dispatch support fp8
#10814 opened Jul 4, 2025
Add recompute for post_norm and moe_gate
#10815 opened Jul 4, 2025
[Auto-parallel] Fix sp_async and mp_async used together
#10816 opened Jul 4, 2025
修复qwen3moe的一系列报错(justin-0704)
#10818 opened Jul 4, 2025
Add tokens_zip_unique_add_subbatch and merge_subbatch_cast ops
#10822 opened Jul 6, 2025

9 Issues closed by 3 people

[Question]: 使用text_matching的predict_pointwise每次检测结果不一样
#10459 closed Jul 6, 2025
[Question]: UIE大模型下载地址
#10414 closed Jul 2, 2025
[Bug]: export静态图无报错但没有pdmodel文件
#10415 closed Jul 2, 2025
DocVQA-ZH数据集链接失效
#6470 closed Jul 1, 2025
[Bug]: ImportError: cannot import name 'download' from 'aistudio_sdk.hub'
#10781 closed Jul 1, 2025
[Bug]: 服务化部署失败，ModuleNotFoundError: No module named 'predict'
#10265 closed Jun 30, 2025
[Question]: pp_uie微调之后, 推理的output.json没有输出
#10406 closed Jun 30, 2025
[Question]: 微调之后的UIE结果输出
#10420 closed Jun 30, 2025
[Question]: UIE微调之后导出静态模型报错
#10421 closed Jun 30, 2025

3 Issues opened by 3 people

paddlenlp库中的aistudio_sdk版本问题
#10809 opened Jul 3, 2025
[Bug]: RuntimeError: (PreconditionNotMet) Tensor's dimension is out of bound.Tensor's dimension must be equal or less than the size of its memory.But received Tensor's dimension is 8, memory's size is 0.
#10792 opened Jul 2, 2025
ernie-1.0 支持padding 吗？我试验了padding, 结果不对呢
#10788 opened Jul 1, 2025

10 Unresolved conversations

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

[Bug]: 按照PP-UIE的新版本说明文档，调用大模型精调脚本 llm/run-finetune.py时，报错 No module named 'paddlenlp.datasets.json'
#10543 commented on Jul 6, 2025 • 0 new comments
[Question]: 使用GPU运行时报错：terminate called after throwing an instance of 'thrust::system::system_error'
#10535 commented on Jul 6, 2025 • 0 new comments
Dsv3 dev
#10273 commented on Jul 4, 2025 • 0 new comments
[LLM] add fuse attention options to LlmMetaConfig
#10542 commented on Jul 6, 2025 • 0 new comments
[Inference] Add new wint2.75/wint2.5 quant type and support DeepseekV3
#10578 commented on Jul 4, 2025 • 0 new comments
[CI]add workflow yml
#10718 commented on Jun 30, 2025 • 0 new comments
add auto_parallel context_parallel strategy
#10722 commented on Jul 4, 2025 • 0 new comments
Fp8 gemm & quant operators replacement
#10761 commented on Jul 2, 2025 • 0 new comments
[CI 不review]test a100
#10764 commented on Jul 2, 2025 • 0 new comments
[fix] add parameters arg into AdamWMini
#10774 commented on Jul 1, 2025 • 0 new comments