Skip to content

Conversation

ngxson
Copy link
Collaborator

@ngxson ngxson commented May 14, 2025

Fix #13484

Also added a test case for it. The test case currently fails on master, but passed in this PR.

I also renamed function .resize(n) to .keep_first(n) to make it easier to understand

@ngxson ngxson requested a review from ggerganov May 14, 2025 09:50
@github-actions github-actions bot added examples python python script changes server labels May 14, 2025
@ngxson ngxson merged commit 360a9c9 into ggml-org:master May 14, 2025
46 checks passed
Silver267 pushed a commit to Silver267/llama.cpp that referenced this pull request May 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
examples python python script changes server
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Misc. bug: GGML_ASSERT(n <= tokens.size()) failed - Memory in use ('/completion' endpoint and 'cache_prompt=false')
2 participants