Inquiry About Wan-I2V Training/Inference Performance on A6000 GPUs #503

ZhouQianang · 2025-03-31T04:32:58Z

I'd like to consult about the training and inference speeds of Wan-I2V-14B-480P. My setup consists of 4×A6000 (49GB GPUs). After installing Diffsynth-Studio, I ran the example code test and observed the following performance:

wan-1.3B-T2V: ~5 minutes per video generation

wan-14B-I2V-480P:

~50 minutes for 81 frames (bfloat16, 50 iterations)

~37 minutes for 21 frames

My questions:

Baseline Validation: Are these inference times normal?
Inference Acceleration: Is multi-GPU parallelization supported for inference? (I couldn't find related documentation)
Training Acceleration: The current 50min/it training speed is impractical. Are there optimization strategies?

Thank you for your help!

Artiprocher · 2025-04-03T02:22:51Z

My answers:

Yes.
Please refer to the usp script.
If your GPU memory is sufficient, we recommend disabling gradient checkpointing to achieve faster performance.

MukundVarmaT · 2025-05-02T21:29:59Z

Hi,
I wanted to perform some full finetuning expts using WAN I2V, is there any scripts that you can point me to for the same.

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inquiry About Wan-I2V Training/Inference Performance on A6000 GPUs #503

Inquiry About Wan-I2V Training/Inference Performance on A6000 GPUs #503

ZhouQianang commented Mar 31, 2025

Artiprocher commented Apr 3, 2025

MukundVarmaT commented May 2, 2025 •

edited

Loading

Inquiry About Wan-I2V Training/Inference Performance on A6000 GPUs #503

Inquiry About Wan-I2V Training/Inference Performance on A6000 GPUs #503

Comments

ZhouQianang commented Mar 31, 2025

Artiprocher commented Apr 3, 2025

MukundVarmaT commented May 2, 2025 • edited Loading

MukundVarmaT commented May 2, 2025 •

edited

Loading