Skip to content

kv-cache : log (debug) all streams in find_slot #15176

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Aug 11, 2025

Conversation

danbev
Copy link
Member

@danbev danbev commented Aug 8, 2025

This commit updates llama_kv_cache_unified::find_slot to log information for all streams when debug is enabled.

The motivation for this change is that currently if a non-unified kv-cache is used, then only one stream will be logged because the code is currently using seq_to_stream[1].

This commit updates `llama_kv_cache_unified::find_slot` to log
information for all streams when debug is enabled.

The motivation for this change is that currently if a non-unified
kv-cache is used, then only one stream will be logged because the
code was currently uses `seq_to_stream[1]`.
@danbev danbev merged commit cd3069d into ggml-org:master Aug 11, 2025
47 checks passed
@danbev danbev deleted the find_slot_debug_streams branch August 13, 2025 08:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants