Skip to content

Conversation

winglian
Copy link
Contributor

Summary

using self.vocab_size for the multimodal forward likely never worked or was deprecated in a transformers change.

@Tcc0403 Tcc0403 mentioned this pull request Jun 25, 2025
3 tasks
@ELanning
Copy link

Needs image_features = self.model.get_image_features(pixel_values) on line 232 too, otherwise worked for me.

Copy link
Collaborator

@shimizust shimizust Jun 28, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you remove and add .idea to the .gitignore

shimizust pushed a commit that referenced this pull request Jul 7, 2025
## Summary
<!--- This is a required section; please describe the main purpose of
this proposed code change. --->
Fix #786, #774.

<!---
## Details
This is an optional section; is there anything specific that reviewers
should be aware of?
--->

## Testing Done
<!--- This is a required section; please describe how this change was
tested. --->

<!-- 
Replace BLANK with your device type. For example, A100-80G-PCIe

Complete the following tasks before sending your PR, and replace `[ ]`
with
`[x]` to indicate you have done them. 
-->

- Hardware Type: <BLANK>
- [ ] run `make test` to ensure correctness
- [ ] run `make checkstyle` to ensure code style
- [ ] run `make test-convergence` to ensure convergence

---------

Signed-off-by: Tcc0403 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants