Closed
Description
Describe the bug
when using torch.float16 dtype, frame pack pipeline return pure black output. is this normal? i want to use torch.backends.cuda.matmul.allow_fp16_accumulation = True to accelerate the inference of framepack. it is a new feature of torch 2.7. but ifound that even without this feature, framepack pipeline still return black output.
Reproduction
import torch
from diffusers import HunyuanVideoFramepackPipeline, HunyuanVideoFramepackTransformer3DModel,AutoencoderKLHunyuanVideo
from diffusers.utils import export_to_video, load_image
from transformers import SiglipImageProcessor, SiglipVisionModel
# torch.backends.cuda.matmul.allow_fp16_accumulation = True
transformer = HunyuanVideoFramepackTransformer3DModel.from_pretrained(
"lllyasviel/FramePack_F1_I2V_HY_20250503", torch_dtype=torch.float16
)
feature_extractor = SiglipImageProcessor.from_pretrained(
"lllyasviel/flux_redux_bfl", subfolder="feature_extractor"
)
image_encoder = SiglipVisionModel.from_pretrained(
"lllyasviel/flux_redux_bfl", subfolder="image_encoder", torch_dtype=torch.float16
)
pipe = HunyuanVideoFramepackPipeline.from_pretrained(
"HunyuanVideo-Diffusers/",
transformer=transformer,
feature_extractor=feature_extractor,
image_encoder=image_encoder,
torch_dtype=torch.float16,
)
# Enable memory optimizations
pipe.enable_model_cpu_offload()
pipe.vae.enable_tiling()
image = load_image(
"./penguin.png"
)
output = pipe(
image=image,
prompt="A penguin dancing in the snow",
height=832,
width=480,
num_frames=91,
num_inference_steps=30,
guidance_scale=9.0,
generator=torch.Generator().manual_seed(0),
sampling_type="vanilla",
).frames[0]
export_to_video(output, "output_hunyuan.mp4", fps=30)
Logs
System Info
diffusers:0.34.0.dev0
torch:2.7.1
Who can help?
No response