Skip to content

server : fix cache_tokens bug with no cache_prompt #13533

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 14, 2025

Conversation

ngxson
Copy link
Collaborator

@ngxson ngxson commented May 14, 2025

Fix #13484

Also added a test case for it. The test case currently fails on master, but passed in this PR.

I also renamed function .resize(n) to .keep_first(n) to make it easier to understand

@ngxson ngxson requested a review from ggerganov May 14, 2025 09:50
@github-actions github-actions bot added examples python python script changes server labels May 14, 2025
@ngxson ngxson merged commit 360a9c9 into ggml-org:master May 14, 2025
46 checks passed
Silver267 pushed a commit to Silver267/llama.cpp that referenced this pull request May 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
examples python python script changes server
Projects
None yet
2 participants