Skip to content

Add cross attention type for Sana-Sprint training in diffusers. #11514

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 12 commits into from
May 8, 2025
Prev Previous commit
Update examples/research_projects/sana/README.md
  • Loading branch information
lawrence-cj authored May 8, 2025
commit 740baa9e1fbed0535d2723c1e70fe44b62fad680
2 changes: 1 addition & 1 deletion examples/research_projects/sana/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ huggingface-cli download Efficient-Large-Model/SANA_Sprint_1.6B_1024px_teacher_d

### 3. Acquire the dataset shards

The training script in this example uses specific `.parquet` shards from the `brivangl/midjourney-v6-llava` dataset instead of downloading the entire dataset automatically via `dataset_name`.
The training script in this example uses specific `.parquet` shards from a randomly selected `brivangl/midjourney-v6-llava` dataset instead of downloading the entire dataset automatically via `dataset_name`.

The script specifically uses these three files:
* `data/train_000.parquet`
Expand Down
Loading