[Integration] add swanlab logger #10594

Zeyi-Lin · 2025-05-14T11:27:11Z

Pull Request Description

This PR introduces SwanLab, a lightweight open-source experiment tracking tool, as a new logging option for the training framework. The integration provides both online and offline tracking capabilities, along with a local dashboard for visualizing results.

We have officially integrated with excellent open-source projects such as transformers, LLaMA Factory, and veRL. We are also very eager to integrate with the outstanding 🌟Ultralytics to provide developers with a better training experience.

🎬Here is a onlinedemo of the integrated effect:

https://swanlab.cn/@ZeyiLin/Qwen2.5-0.5B-SFT-paddlenlp/runs/myyyg5lbbikbnvdtw11zr/chart

Below is a detailed overview of the changes and usage instructions:

Key Features of SwanLab Integration

1. Online and Offline Tracking:

Online Mode: Track experiments remotely and store data on SwanLab's cloud platform.
Offline Mode: Use a local dashboard to visualize training logs without an internet connection.

2. Hardware Monitoring:

Automatically tracks GPU usage, power consumption, temperature, and other hardware metrics.
Supports NVIDIA GPUs, Huawei Ascend NPUs and Kunlunxin XPUs.

3. Remote Access:

View training progress remotely via the SwanLab web interface or mobile app.

4. Local Dashboard:

Includes an open-source local dashboard for offline visualization of training logs.

Usage Instructions

Step 1: Set Up Online Tracking (Optional)

Install:

pip install swanlab

To use SwanLab's online tracking, log in to the SwanLab website and obtain your API key from the Settings page. Then, authenticate using the following command:

swanlab login

If you prefer offline mode, skip this step.

Step 2: Configure SwanLab as the Logger

To enable SwanLab as the experiment tracker, Please execute the following command to enable swanlab:

"""
Tested on:
pip install paddlepaddle-gpu==3.0.0 -i https://www.paddlepaddle.org.cn/packages/stable/cu126/
pip install paddlenlp==3.0.0b4
"""
from paddlenlp.trl import SFTConfig, SFTTrainer
from datasets import load_dataset

dataset = load_dataset("ZHUI/alpaca_demo", split="train")

training_args = SFTConfig(
    output_dir="Qwen/Qwen2.5-0.5B-SFT",
    device="gpu",
    per_device_train_batch_size=1,
    logging_steps=20,
    report_to="swanlab",
    )

trainer = SFTTrainer(
    args=training_args,
    model="Qwen/Qwen2.5-0.5B-Instruct",
    train_dataset=dataset,
)

trainer.train()

Then, You can now happily use SwanLab for experiment tracking!

Step 3: View Training Logs

After logging in, you will see a confirmation message:

Online Tracking: View logs on the SwanLab website.

For more details, refer to the SwanLab Cloud Documentation.

Offline Tracking: Use the local dashboard to visualize logs:

swanlab watch

For advanced configurations, such as setting a custom port, refer to the Offline Dashboard Documentation and CLI Documentation.

Impact

Provides a lightweight, flexible, and user-friendly experiment tracking solution.
Supports both online and offline use cases, making it suitable for environments with restricted internet access.
Enhances hardware monitoring capabilities for better resource utilization.

paddle-bot · 2025-05-14T11:27:17Z

Thanks for your contribution!

CLAassistant · 2025-05-14T11:27:18Z

All committers have signed the CLA.

paddlenlp/trainer/integrations.py

paddlenlp/trainer/training_args.py

Zeyi-Lin · 2025-05-16T03:08:00Z

@ZHUI Hey, 🤔it seems that the test failure was not caused by my commits. Could you please run the test again? 😄Thank you!

ZHUI · 2025-05-16T06:41:10Z

Ok, i'll re-run those tests.

codecov · 2025-05-23T04:26:49Z

Codecov Report

Attention: Patch coverage is 76.00000% with 12 lines in your changes missing coverage. Please review.

Project coverage is 46.96%. Comparing base (c309aa7) to head (dc68aac).
Report is 9 commits behind head on develop.

❗ Current head dc68aac differs from pull request most recent head 5145f13

Please upload reports for the commit 5145f13 to get more accurate results.

Files with missing lines	Patch %	Lines
paddlenlp/trainer/integrations.py	76.00%	12 Missing ⚠️

❌ Your patch status has failed because the patch coverage (76.00%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.
❌ Your project status has failed because the head coverage (46.96%) is below the target coverage (58.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop   #10594      +/-   ##
===========================================
+ Coverage    46.91%   46.96%   +0.05%     
===========================================
  Files          799      799              
  Lines       132457   132398      -59     
===========================================
+ Hits         62136    62175      +39     
+ Misses       70321    70223      -98

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Zeyi-Lin added 2 commits May 14, 2025 17:00

feat: add swanlabcallback

7388b60

fix run

9b3641e

paddle-bot bot added the contributor label May 14, 2025

paddle-bot bot assigned lugimzzz May 14, 2025

Zeyi-Lin added 2 commits May 14, 2025 19:35

fix lint

eec30ab

fix requirements dev

a1609a7

ZHUI reviewed May 15, 2025

View reviewed changes

paddlenlp/trainer/integrations.py Outdated Show resolved Hide resolved

paddlenlp/trainer/training_args.py Outdated Show resolved Hide resolved

Zeyi-Lin added 2 commits May 16, 2025 10:12

fix url

98c3242

Merge branch 'develop' into feat/add-swanlab-logger

dc68aac

Zeyi-Lin requested a review from ZHUI May 16, 2025 03:06

ZHUI closed this May 16, 2025

ZHUI reopened this May 16, 2025

Merge branch 'PaddlePaddle:develop' into feat/add-swanlab-logger

5145f13

ZHUI merged commit 062debf into PaddlePaddle:develop May 27, 2025
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Integration] add swanlab logger #10594

[Integration] add swanlab logger #10594

Uh oh!

Zeyi-Lin commented May 14, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented May 14, 2025

Uh oh!

CLAassistant commented May 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Zeyi-Lin commented May 16, 2025 •

edited

Loading

Uh oh!

ZHUI commented May 16, 2025

Uh oh!

codecov bot commented May 23, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

[Integration] add swanlab logger #10594

[Integration] add swanlab logger #10594

Uh oh!

Conversation

Zeyi-Lin commented May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Description

Key Features of SwanLab Integration

Usage Instructions

Uh oh!

paddle-bot bot commented May 14, 2025

Uh oh!

CLAassistant commented May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Zeyi-Lin commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ZHUI commented May 16, 2025

Uh oh!

codecov bot commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Zeyi-Lin commented May 14, 2025 •

edited

Loading

CLAassistant commented May 14, 2025 •

edited

Loading

Zeyi-Lin commented May 16, 2025 •

edited

Loading

codecov bot commented May 23, 2025 •

edited

Loading