Questions about how to fine-tune fastvlm

Thank you very much for open-sourcing such an excellent model. I have the following questions regarding fine-tuning:
1. The repo mentions a `train_qwen.py` file and references the LLaVA repo code, but I haven't found any configuration scripts for passing parameters to `train_qwen.py`.
2. I haven't found any code related to the FastViTHD visual encoder in the entire repo. Are there plans to open-source this part later?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Questions about how to fine-tune fastvlm #62

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Questions about how to fine-tune fastvlm #62

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions