Allow max shard size to be specified when saving pipeline #9440

a-r-r-o-w · 2024-09-15T19:30:13Z

What does this PR do?

When using .save_pretrained to save different modeling components, we lose control over being able to specify max shard size. Being able to store smaller shards of all modeling components without saving each component individually would be a nice control to have. This is particularly useful in the case of CogVideoX where having both text encoder and transformer in shard size of 10GB results in OOM on a Colab CPU. One needs to save the smaller shards in order to make loading the components possible and create the pipeline, which can then be inferred with something like enable_sequential_cpu_offload.

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@yiyixuxu @sayakpaul

sayakpaul

Thanks!

a-r-r-o-w · 2024-09-16T03:03:58Z

Thank you. Failing test seems unrelated

yiyixuxu · 2024-09-16T17:43:20Z

src/diffusers/pipelines/pipeline_utils.py

@@ -189,6 +189,7 @@ def save_pretrained(
        save_directory: Union[str, os.PathLike],
        safe_serialization: bool = True,
        variant: Optional[str] = None,
+        max_shard_size: Union[int, str] = "10GB",


can we leave the default at None, 5GB is the recommended default size we want to have across all huggingface libraries, we kept is at 10GB so that we won't start to automatically shard sdxl checkpoints. I don't think we should change this default value for our text encoders now

Okay, let me open a quick follow-up PR

…e#9440) allow max shard size to be specified when saving pipeline

allow max shard size to be specified when saving pipeline

allow max shard size to be specified when saving pipeline

62f4fe6

a-r-r-o-w requested review from yiyixuxu and sayakpaul September 15, 2024 21:25

sayakpaul approved these changes Sep 16, 2024

View reviewed changes

a-r-r-o-w merged commit 2454b98 into main Sep 16, 2024
17 of 18 checks passed

a-r-r-o-w deleted the max-shard-size-pipeline branch September 16, 2024 03:06

yiyixuxu reviewed Sep 16, 2024

View reviewed changes

a-r-r-o-w mentioned this pull request Sep 16, 2024

set max_shard_size to None for pipeline save_pretrained #9447

Merged

leisuzz pushed a commit to leisuzz/diffusers that referenced this pull request Oct 11, 2024

Allow max shard size to be specified when saving pipeline (huggingfac…

173ad42

…e#9440) allow max shard size to be specified when saving pipeline

sayakpaul pushed a commit that referenced this pull request Dec 23, 2024

Allow max shard size to be specified when saving pipeline (#9440)

c2d23fb

allow max shard size to be specified when saving pipeline

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow max shard size to be specified when saving pipeline #9440

Allow max shard size to be specified when saving pipeline #9440

Uh oh!

a-r-r-o-w commented Sep 15, 2024

Uh oh!

sayakpaul left a comment

Uh oh!

a-r-r-o-w commented Sep 16, 2024

Uh oh!

Uh oh!

yiyixuxu Sep 16, 2024

Uh oh!

a-r-r-o-w Sep 16, 2024

Uh oh!

Uh oh!

Allow max shard size to be specified when saving pipeline #9440

Allow max shard size to be specified when saving pipeline #9440

Uh oh!

Conversation

a-r-r-o-w commented Sep 15, 2024

What does this PR do?

Who can review?

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

a-r-r-o-w commented Sep 16, 2024

Uh oh!

Uh oh!

yiyixuxu Sep 16, 2024

Choose a reason for hiding this comment

Uh oh!

a-r-r-o-w Sep 16, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!