fix: providing a tensor to cache_position
in model.generate
kwargs always crashes because of boolean test
#39261
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Currently, giving
cache_position
tomodel.generate
is broken because providing any tensor value to it results in performing a boolean test, which causes a runtime error. This PR fixes it.This is basically an oversight in the change commited in #37986 --- the code path is untested.
sample code (test-kv.py):
Before submitting
Pull Request section?
to it if that's the case. --- could not find any similar issue
documentation guidelines, and
here are tips on formatting docstrings. --- does not apply
Who can review?
@FremyCompany @ArthurZucker @zucchini-nlp @gante