Does Liquid support generating both image and text tokens in a single generation sequence?

@wjf5203 
Does Liquid support generating both image and text tokens in a single generation sequence, or is it limited to producing either an image sequence or a text sequence only?
For example, given the input:
"Please show an astronaut riding a horse, and briefly describe which planet is in the background."

I would like the output to be:
[generated image tokens of the astronaut on a horse] + "Behind him is Mars, with a reddish-brown surface and a thin atmosphere."

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Does Liquid support generating both image and text tokens in a single generation sequence? #15

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Does Liquid support generating both image and text tokens in a single generation sequence? #15

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions