-
Notifications
You must be signed in to change notification settings - Fork 546
Insights: PaddlePaddle/FastDeploy
Overview
-
- 8 Merged pull requests
- 15 Open pull requests
- 0 Closed issues
- 1 New issue
Could not load contribution data
Please try again later
8 Pull requests merged by 6 people
-
[Bug fix] Add the missing
pod_ip
param to the launch_cache_manager function.#2742 merged
Jul 8, 2025 -
【Sync】Release/2.0.1
#2745 merged
Jul 8, 2025 -
[Bug fix] Fixed the garbled text issues in Qwen3-8B
#2737 merged
Jul 8, 2025 -
[GCU] Support gcu platform
#2702 merged
Jul 8, 2025 -
【Fearture】support qwen2 some func
#2740 merged
Jul 8, 2025 -
[SOT] Remove BreakGraph with
paddle.maximum
#2731 merged
Jul 8, 2025 -
[Bug fix] fix complie bug when sm < 89
#2738 merged
Jul 8, 2025 -
[Optimize] Optimize tensorwise fp8 performance
#2729 merged
Jul 7, 2025
15 Pull requests opened by 13 people
-
Support use safetensors with paddle.MmapStorage to load model files
#2730 opened
Jul 7, 2025 -
add precision check for ci
#2732 opened
Jul 7, 2025 -
[SOT] Make custom_op dy&st unified
#2733 opened
Jul 7, 2025 -
[draft] change rejectionsampling topk=40
#2734 opened
Jul 7, 2025 -
[SOT] Enable SOT Dy2St in Multimodal Model
#2735 opened
Jul 7, 2025 -
Opt wint2
#2741 opened
Jul 8, 2025 -
[Bug fix] fix attention rank init
#2743 opened
Jul 8, 2025 -
[vl]remove duplicated load logic
#2744 opened
Jul 8, 2025 -
[Doc] modify offline inference docs
#2747 opened
Jul 8, 2025 -
[vl] mm and thinking model support structred output
#2749 opened
Jul 8, 2025 -
Support for non-CUDA builds
#2750 opened
Jul 8, 2025 -
[Feature] Add speculative decoding simulation benchmark.
#2751 opened
Jul 8, 2025 -
【Feature】add fd commit/branch info when start server
#2752 opened
Jul 8, 2025 -
[Feature]support top_k_top_p sampling
#2753 opened
Jul 8, 2025 -
【Sync develop】 add commit info
#2755 opened
Jul 8, 2025
1 Issue opened by 1 person
-
ERNIE-4.5-VL-28B-A3B-Paddle 加载卡主不动,无论是单卡4090 48b还是双卡4090 48g都不行
#2739 opened
Jul 7, 2025
7 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[RL Feature] add rl qwen model support
#2713 commented on
Jul 7, 2025 • 2 new comments -
OpenAI接口兼容性不佳以及一些其他问题
#2722 commented on
Jul 8, 2025 • 0 new comments -
ERNIE-4.5-VL-424B-A47B-Paddle加载卡住不动
#2723 commented on
Jul 8, 2025 • 0 new comments -
python -m fastdeploy.entrypoints.openai.api_server --model /root/fssd/PaddlePaddle/ERNIE-4.5-VL-28B-A3B-Base-Paddle 执行失败
#2693 commented on
Jul 8, 2025 • 0 new comments -
8卡 h200 部署ERNIE-4.5-VL-424B-A47B-Paddle 失败
#2683 commented on
Jul 8, 2025 • 0 new comments -
ERNIE-4.5-VL-28B-A3B-Paddle的int4量化加载,4090单卡成功,双卡失败
#2663 commented on
Jul 8, 2025 • 0 new comments -
[WIP] optimzie wint2 moe_group_gemm.
#2661 commented on
Jul 8, 2025 • 0 new comments