-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Insights: PaddlePaddle/PaddleNLP
Overview
Could not load contribution data
Please try again later
28 Pull requests merged by 12 people
-
[RL] fix bug in qwen fuse qkv
#10821 merged
Jul 6, 2025 -
Move setup_fp8.py to FleetY
#10820 merged
Jul 6, 2025 -
[RL] disable aistudio download and fix qwen bug
#10819 merged
Jul 5, 2025 -
Move FP8 ops to FleetY branch
#10803 merged
Jul 5, 2025 -
support dispatch both bf16 & fp8
#10817 merged
Jul 4, 2025 -
[AutoParallel] close dynamic sharding CI test
#10791 merged
Jul 4, 2025 -
[LLM] fix_qwen3_moe
#10801 merged
Jul 4, 2025 -
Set FP8Linear weight update by inplace add
#10813 merged
Jul 4, 2025 -
Fix nccl ut
#10812 merged
Jul 4, 2025 -
[tests] migrate to unittest.assertEqual
#10670 merged
Jul 4, 2025 -
optimize scale transpose
#10810 merged
Jul 4, 2025 -
optimizer_dual_pp_post_node_memory
#10806 merged
Jul 3, 2025 -
lock gemm sm 112
#10805 merged
Jul 3, 2025 -
Add fp8 opt config for dualpipe
#10804 merged
Jul 3, 2025 -
Add expert num 32 support for FP8 ops and fix ut
#10802 merged
Jul 3, 2025 -
Refine fp8 node
#10798 merged
Jul 3, 2025 -
Support expert number 16 in FP8 operators
#10799 merged
Jul 3, 2025 -
Adapt dispatch_quant_node to fp8 fusion moe node
#10794 merged
Jul 2, 2025 -
[Auto-parallel] Fix sharding all_gather overlap in auto_dy
#10782 merged
Jul 2, 2025 -
[Auto-parallel] Add llama13b benchmark fast_ln
#10785 merged
Jul 2, 2025 -
[Auto-parallel] Add llama7b benchmark fast_ln
#10786 merged
Jul 2, 2025 -
optimize dual pp memory
#10787 merged
Jul 2, 2025 -
Reset EMA state dict when distributed strategy not matched
#10790 merged
Jul 1, 2025 -
[CherryPick]Adapt new nccl version (#10768)
#10784 merged
Jul 1, 2025 -
[Auto-Parallel] optimize the perfermance of GPT-3
#10780 merged
Jul 1, 2025 -
adapt new nccl version
#10768 merged
Jul 1, 2025 -
[Auto Parallel] Add enable_linear_fused_grad_add in qwen benchmark
#10779 merged
Jun 30, 2025 -
[Auto Parallel] Adapt mp_async_allreduce optimize in auto paralle
#10770 merged
Jun 30, 2025
13 Pull requests opened by 9 people
-
[AutoParallel] init sync param
#10783 opened
Jul 1, 2025 -
Fix the _save function so that it can save the optimizer parameters.
#10789 opened
Jul 1, 2025 -
[AutoParallel] Open dynamic sharding CI test
#10793 opened
Jul 2, 2025 -
[AutoParallel] Remove dynamic pipeline import adaption
#10795 opened
Jul 2, 2025 -
N1C8 dsv3 config
#10796 opened
Jul 2, 2025 -
[Auto-parallel] temp adjust the tensor_fusion config
#10800 opened
Jul 3, 2025 -
support quant return transpose only
#10811 opened
Jul 4, 2025 -
dispatch support fp8
#10814 opened
Jul 4, 2025 -
Add recompute for post_norm and moe_gate
#10815 opened
Jul 4, 2025 -
[Auto-parallel] Fix sp_async and mp_async used together
#10816 opened
Jul 4, 2025 -
修复qwen3moe的一系列报错(justin-0704)
#10818 opened
Jul 4, 2025 -
Add tokens_zip_unique_add_subbatch and merge_subbatch_cast ops
#10822 opened
Jul 6, 2025
9 Issues closed by 3 people
-
[Question]: 使用text_matching的predict_pointwise每次检测结果不一样
#10459 closed
Jul 6, 2025 -
[Question]: UIE大模型下载地址
#10414 closed
Jul 2, 2025 -
[Bug]: export静态图无报错但没有pdmodel文件
#10415 closed
Jul 2, 2025 -
DocVQA-ZH数据集链接失效
#6470 closed
Jul 1, 2025 -
[Bug]: ImportError: cannot import name 'download' from 'aistudio_sdk.hub'
#10781 closed
Jul 1, 2025 -
[Bug]: 服务化部署失败,ModuleNotFoundError: No module named 'predict'
#10265 closed
Jun 30, 2025 -
[Question]: pp_uie微调之后, 推理的output.json没有输出
#10406 closed
Jun 30, 2025 -
[Question]: 微调之后的UIE结果输出
#10420 closed
Jun 30, 2025 -
[Question]: UIE微调之后导出静态模型报错
#10421 closed
Jun 30, 2025
3 Issues opened by 3 people
-
paddlenlp库中的aistudio_sdk版本问题
#10809 opened
Jul 3, 2025 -
ernie-1.0 支持padding 吗?我试验了padding, 结果不对呢
#10788 opened
Jul 1, 2025
10 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[Bug]: 按照PP-UIE的新版本说明文档,调用大模型精调脚本 llm/run-finetune.py时,报错 No module named 'paddlenlp.datasets.json'
#10543 commented on
Jul 6, 2025 • 0 new comments -
[Question]: 使用GPU运行时报错:terminate called after throwing an instance of 'thrust::system::system_error'
#10535 commented on
Jul 6, 2025 • 0 new comments -
Dsv3 dev
#10273 commented on
Jul 4, 2025 • 0 new comments -
[LLM] add fuse attention options to LlmMetaConfig
#10542 commented on
Jul 6, 2025 • 0 new comments -
[Inference] Add new wint2.75/wint2.5 quant type and support DeepseekV3
#10578 commented on
Jul 4, 2025 • 0 new comments -
[CI]add workflow yml
#10718 commented on
Jun 30, 2025 • 0 new comments -
add auto_parallel context_parallel strategy
#10722 commented on
Jul 4, 2025 • 0 new comments -
Fp8 gemm & quant operators replacement
#10761 commented on
Jul 2, 2025 • 0 new comments -
[CI 不review]test a100
#10764 commented on
Jul 2, 2025 • 0 new comments -
[fix] add parameters arg into AdamWMini
#10774 commented on
Jul 1, 2025 • 0 new comments