Improve token counter to handle more response types #15501

logan-markewich · 2024-08-19T21:23:19Z

APIs like anthropic may have a slightly different naming structure for pulling out token counts

imo longer term, this logic should probably be specific to the LLM class, but this solves some common issues

nerdai

Generally, looks good to me. I have one question re: estimate_tokens_in_messages.

nerdai · 2024-08-20T03:48:05Z

llama-index-core/llama_index/core/callbacks/token_counting.py

+    possible_input_keys = ("prompt_tokens", "input_tokens")
+    possible_output_keys = ("completion_tokens", "output_tokens")


Looks like Anthropic uses input_tokens and output_tokens.

llama-index-core/llama_index/core/callbacks/token_counting.py

vinthree · 2024-08-21T14:04:19Z

llama-index-core/llama_index/core/callbacks/token_counting.py

+    response: Union["CompletionResponse", "ChatResponse"]
+) -> Tuple[int, int]:
+    """Get the token counts from a raw response."""
+    usage = response.raw.get("usage", {})


This will error out saying the following.
'ChatCompletion' object has no attribute 'get'

We will have to access the usage attribute like
response.raw.usage

We can just model dump the raw response if its not already a dict

Improve token counter

962cf2f

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Aug 19, 2024

fix condition

0ca1c28

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. and removed size:M This PR changes 30-99 lines, ignoring generated files. labels Aug 20, 2024

nerdai approved these changes Aug 20, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Aug 20, 2024

logan-markewich mentioned this pull request Aug 20, 2024

[Feature Request]: bedrock converse LLM #15510

Closed

logan-markewich added 4 commits August 19, 2024 22:54

wip

2f9a165

fix tests

4ecd387

make token counter even better

e00e6a8

fix tests some more

71ffa71

logan-markewich merged commit 635b914 into main Aug 20, 2024
8 checks passed

logan-markewich deleted the logan/improve_token_counter branch August 20, 2024 23:24

vinthree reviewed Aug 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve token counter to handle more response types #15501

Improve token counter to handle more response types #15501

Uh oh!

logan-markewich commented Aug 19, 2024

Uh oh!

nerdai left a comment

Uh oh!

nerdai Aug 20, 2024

Uh oh!

logan-markewich Aug 20, 2024

Uh oh!

Uh oh!

Uh oh!

vinthree Aug 21, 2024

Uh oh!

logan-markewich Aug 21, 2024

Uh oh!

Uh oh!

		possible_input_keys = ("prompt_tokens", "input_tokens")
		possible_output_keys = ("completion_tokens", "output_tokens")

Improve token counter to handle more response types #15501

Improve token counter to handle more response types #15501

Uh oh!

Conversation

logan-markewich commented Aug 19, 2024

Uh oh!

nerdai left a comment

Choose a reason for hiding this comment

Uh oh!

nerdai Aug 20, 2024

Choose a reason for hiding this comment

Uh oh!

logan-markewich Aug 20, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

vinthree Aug 21, 2024

Choose a reason for hiding this comment

Uh oh!

logan-markewich Aug 21, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!