Skip to content

Commit 6031ecb

Browse files
charchit7yiyixuxupatrickvonplaten
authored
added doc for Kandinsky3.0 (huggingface#5937)
* added en doc for Kandinsky3.0 * required changes * Update docs/source/en/api/pipelines/kandinsky3.md * Update docs/source/en/api/pipelines/kandinsky3.md * Update docs/source/en/api/pipelines/kandinsky3.md --------- Co-authored-by: YiYi Xu <[email protected]> Co-authored-by: Patrick von Platen <[email protected]>
1 parent fdd003d commit 6031ecb

File tree

1 file changed

+26
-1
lines changed

1 file changed

+26
-1
lines changed

docs/source/en/api/pipelines/kandinsky3.md

Lines changed: 26 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,32 @@ specific language governing permissions and limitations under the License.
99

1010
# Kandinsky 3
1111

12-
TODO
12+
Kandinsky 3 is created by [Vladimir Arkhipkin](https://github.com/oriBetelgeuse),[Anastasia Maltseva](https://github.com/NastyaMittseva),[Igor Pavlov](https://github.com/boomb0om),[Andrei Filatov](https://github.com/anvilarth),[Arseniy Shakhmatov](https://github.com/cene555),[Andrey Kuznetsov](https://github.com/kuznetsoffandrey),[Denis Dimitrov](https://github.com/denndimitrov), [Zein Shaheen](https://github.com/zeinsh)
13+
14+
The description from it's Github page:
15+
16+
*Kandinsky 3.0 is an open-source text-to-image diffusion model built upon the Kandinsky2-x model family. In comparison to its predecessors, enhancements have been made to the text understanding and visual quality of the model, achieved by increasing the size of the text encoder and Diffusion U-Net models, respectively.*
17+
18+
Its architecture includes 3 main components:
19+
1. [FLAN-UL2](https://huggingface.co/google/flan-ul2), which is an encoder decoder model based on the T5 architecture.
20+
2. New U-Net architecture featuring BigGAN-deep blocks doubles depth while maintaining the same number of parameters.
21+
3. Sber-MoVQGAN is a decoder proven to have superior results in image restoration.
22+
23+
24+
25+
The original codebase can be found at [ai-forever/Kandinsky-3](https://github.com/ai-forever/Kandinsky-3).
26+
27+
<Tip>
28+
29+
Check out the [Kandinsky Community](https://huggingface.co/kandinsky-community) organization on the Hub for the official model checkpoints for tasks like text-to-image, image-to-image, and inpainting.
30+
31+
</Tip>
32+
33+
<Tip>
34+
35+
Make sure to check out the schedulers [guide](../../using-diffusers/schedulers) to learn how to explore the tradeoff between scheduler speed and quality, and see the [reuse components across pipelines](../../using-diffusers/loading#reuse-components-across-pipelines) section to learn how to efficiently load the same components into multiple pipelines.
36+
37+
</Tip>
1338

1439
## Kandinsky3Pipeline
1540

0 commit comments

Comments
 (0)