support svd quant #202

Glaceon-Hyy · 2025-11-06T09:52:55Z

No description provided.

gemini-code-assist · 2025-11-06T09:53:18Z

Summary of Changes

Hello @Glaceon-Hyy, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the diffsynth_engine by integrating SVD quantization capabilities through the nunchaku library, specifically for the QwenImage model. It introduces new configuration options and model classes to support quantized inference and specialized LoRA handling for these quantized models. The changes aim to improve model efficiency and provide a robust framework for working with quantized models, complete with dynamic detection and clear installation guidance for the nunchaku dependency.

Highlights

SVD Quantization Support: Introduced comprehensive support for SVD quantization using the nunchaku library, enabling more efficient inference for models like QwenImage.
Nunchaku-specific LoRA Classes: Added new LoRA wrapper classes (LoRASVDQW4A4Linear, LoRAAWQW4A16Linear) to seamlessly integrate LoRA with Nunchaku's quantized linear layers.
Dynamic Nunchaku Integration: The pipeline now dynamically detects if a loaded model is Nunchaku-quantized based on its state dictionary and automatically configures the appropriate Nunchaku-specific settings.
Enhanced LoRA Loading Mechanism: Updated LoRA loading logic to correctly handle the new Nunchaku-specific LoRA types, including advanced fusion for QKV projections in SVDQW4A4Linear layers.
User-Friendly Nunchaku Installation Guide: Implemented a detailed error message that guides users on how to manually install the nunchaku library if it's not found, specifying PyTorch, Python, and OS versions.
New Test Coverage: Added a dedicated test suite to validate the functionality of SVD quantization and LoRA loading within the QwenImage pipeline, ensuring stability and correctness.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces support for SVD quantization using the nunchaku library. It adds new configuration options, LoRA wrapper modules for quantized layers, and a quantized DiT model for Qwen Image. The pipeline logic is also updated to detect and load these quantized models. My review has identified a few critical bugs in the new LoRA handling logic for quantized layers that need to be addressed. Specifically, there are issues with rank management when fusing QKV LoRAs and a variable scope bug when loading them. There's also a minor bug in the clear method of a LoRA wrapper.

diffsynth_engine/models/basic/lora_nunchaku.py

diffsynth_engine/models/qwen_image/qwen_image_dit_nunchaku.py

diffsynth_engine/models/basic/lora_nunchaku.py

tests/test_pipelines/test_qwen_image_svd_quant.py

diffsynth_engine/pipelines/qwen_image.py

akaitsuki-ii · 2025-11-11T06:44:25Z

diffsynth_engine/configs/pipeline.py

    # override OptimizationConfig
    fbcache_relative_l1_threshold = 0.009

+    # svd quant


看后面逻辑这几个参数似乎不是给用户设置的？可以参考WanPipelineConfig里面boundary那几个参数设置成不能init的field

config里面还是保留一下字段比较好？用Field(init=False)的方式来写的话用户就不能初始化，但是还是有字段提示

diffsynth_engine/models/basic/lora_nunchaku.py

diffsynth_engine/models/qwen_image/qwen_image_dit_nunchaku.py

diffsynth_engine/pipelines/qwen_image.py

akaitsuki-ii · 2025-11-12T06:25:04Z

diffsynth_engine/pipelines/qwen_image.py

+                        "4. Install it using pip, for example:\n"
+                        "   pip install nunchaku @ https://.../your_specific_nunchaku_file.whl\n"
+                    )
+                    raise ImportError(error_message)


这个error_message还是挺长的，感觉可以从flag里面引进来

support svd quant

c168ebe

gemini-code-assist bot reviewed Nov 6, 2025

View reviewed changes

akaitsuki-ii reviewed Nov 11, 2025

View reviewed changes

auto set nunchaku config

677ff87

akaitsuki-ii reviewed Nov 12, 2025

View reviewed changes

Glaceon-Hyy added 4 commits November 12, 2025 15:29

fix nunchaku transformer init

1391e44

fix qwen image init

f0e7daa

fix svd quant attn init

ee39e17

mv nunchaku import error to flag

4f04b34

akaitsuki-ii approved these changes Nov 12, 2025

View reviewed changes

akaitsuki-ii merged commit ae4faeb into main Nov 12, 2025

akaitsuki-ii deleted the feature/svd branch November 12, 2025 11:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support svd quant #202

support svd quant #202

Uh oh!

Glaceon-Hyy commented Nov 6, 2025

Uh oh!

gemini-code-assist bot commented Nov 6, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

akaitsuki-ii Nov 11, 2025

Uh oh!

Glaceon-Hyy Nov 11, 2025

Uh oh!

akaitsuki-ii Nov 12, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

akaitsuki-ii Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

support svd quant #202

support svd quant #202

Uh oh!

Conversation

Glaceon-Hyy commented Nov 6, 2025

Uh oh!

gemini-code-assist bot commented Nov 6, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

akaitsuki-ii Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Glaceon-Hyy Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

akaitsuki-ii Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

akaitsuki-ii Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants