fix: Fireworks chat completion broken due to telemetry #3392

slekkala1 · 2025-09-09T22:24:43Z

What does this PR do?

Fix fireworks chat completion broken due to telemetry expecting response.usage
Closes #3391

Test Plan

uv run --with llama-stack llama stack build --distro starter --image-type venv --run
Try

curl -X POST http://0.0.0.0:8321/v1/openai/v1/chat/completions \
    -H "Content-Type: application/json" \
    -d '{
      "model": "fireworks/accounts/fireworks/models/llama-v3p1-8b-instruct",
      "messages": [{"role": "user", "content": "Hello!"}]
    }'

{"id":"chatcmpl-ee922a08-0df0-4974-b0d3-b322113e8bc0","choices":[{"message":{"role":"assistant","content":"Hello! How can I assist you today?","name":null,"tool_calls":null},"finish_reason":"stop","index":0,"logprobs":null}],"object":"chat.completion","created":1757456375,"model":"fireworks/accounts/fireworks/models/llama-v3p1-8b-instruct"}%

Without fix fails as mentioned in #3391

franciscojavierarceo

lgtm

mattf · 2025-09-10T16:11:18Z

$ curl https://api.fireworks.ai/inference/v1/chat/completions -s \
-H "Content-Type: application/json" \
-H "Authorization: Bearer ... \
-d '{
  "model": "accounts/fireworks/models/kimi-k2-instruct-0905",                  
  "messages": [{
      "role": "user",
      "content": "Explain the importance of fast language models"
  }]
}' | jq
{
  "id": "c07ea231-a59d-4828-a169-a8e4243f907f",
  "object": "chat.completion",
  "created": 1757520350,
  "model": "accounts/fireworks/models/kimi-k2-instruct-0905",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "..."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 23,
    "total_tokens": 727,
    "completion_tokens": 704
  }
}

the fireworks endpoint returns usage information. before making a project wide change, especially one that may result in quietly inconsistent results, a fix must be attempted for the fireworks provider.

@raghotham @franciscojavierarceo i recommend reverting and proceeding w/ a fix in the fireworks provider.

This reverts commit 935b8e2.

franciscojavierarceo · 2025-09-10T16:44:07Z

@mattf I've created a revert PR here: #3402

slekkala1 · 2025-09-10T16:45:14Z

$ curl https://api.fireworks.ai/inference/v1/chat/completions -s \
-H "Content-Type: application/json" \
-H "Authorization: Bearer ... \
-d '{
  "model": "accounts/fireworks/models/kimi-k2-instruct-0905",                  
  "messages": [{
      "role": "user",
      "content": "Explain the importance of fast language models"
  }]
}' | jq
{
  "id": "c07ea231-a59d-4828-a169-a8e4243f907f",
  "object": "chat.completion",
  "created": 1757520350,
  "model": "accounts/fireworks/models/kimi-k2-instruct-0905",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "..."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 23,
    "total_tokens": 727,
    "completion_tokens": 704
  }
}

the fireworks endpoint returns usage information. before making a project wide change, especially one that may result in quietly inconsistent results, a fix must be attempted for the fireworks provider.

@raghotham @franciscojavierarceo i recommend reverting and proceeding w/ a fix in the fireworks provider.

@mattf Thanks for the suggestion!

Well, it didnt return usage in my test with fireworks provider. (May be I miss something in provider impl, I can have a second look)

Would it ok for the api to be broken because the telemetry depends on response to have certain fields?

Reverts #3392

mattf · 2025-09-11T15:53:47Z

$ curl https://api.fireworks.ai/inference/v1/chat/completions -s
...
the fireworks endpoint returns usage information. before making a project wide change, especially one that may result in quietly inconsistent results, a fix must be attempted for the fireworks provider.
@raghotham @franciscojavierarceo i recommend reverting and proceeding w/ a fix in the fireworks provider.

@mattf Thanks for the suggestion!

Well, it didnt return usage in my test with fireworks provider. (May be I miss something in provider impl, I can have a second look)

Would it ok for the api to be broken because the telemetry depends on response to have certain fields?

api.fireworks.ai returns the info, if it doesn't get propagated then check the fireworks provider.

# What does this PR do? Fix fireworks chat completion broken due to telemetry expecting response.usage Closes llamastack#3391 ## Test Plan 1. `uv run --with llama-stack llama stack build --distro starter --image-type venv --run` Try ``` curl -X POST http://0.0.0.0:8321/v1/openai/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "fireworks/accounts/fireworks/models/llama-v3p1-8b-instruct", "messages": [{"role": "user", "content": "Hello!"}] }' ``` ``` {"id":"chatcmpl-ee922a08-0df0-4974-b0d3-b322113e8bc0","choices":[{"message":{"role":"assistant","content":"Hello! How can I assist you today?","name":null,"tool_calls":null},"finish_reason":"stop","index":0,"logprobs":null}],"object":"chat.completion","created":1757456375,"model":"fireworks/accounts/fireworks/models/llama-v3p1-8b-instruct"}% ``` Without fix fails as mentioned in llamastack#3391 Co-authored-by: Francisco Arceo <[email protected]>

…#3402) Reverts llamastack#3392

fix: Fireworks chat completion broken due to telemetry

6b36f25

slekkala1 requested review from ashwinb, yanxi0830, hardikjshah, raghotham, ehhuang, terrytangyuan, leseb, bbrowning, reluctantfuturist and mattf as code owners September 9, 2025 22:24

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 9, 2025

franciscojavierarceo approved these changes Sep 10, 2025

View reviewed changes

Merge branch 'main' into fix-fireworks

23628c5

raghotham approved these changes Sep 10, 2025

View reviewed changes

slekkala1 merged commit 935b8e2 into main Sep 10, 2025
22 checks passed

slekkala1 deleted the fix-fireworks branch September 10, 2025 15:48

franciscojavierarceo added a commit that referenced this pull request Sep 10, 2025

Revert "fix: Fireworks chat completion broken due to telemetry (#3392)"

14bb7d6

This reverts commit 935b8e2.

franciscojavierarceo mentioned this pull request Sep 10, 2025

revert: Fireworks chat completion broken due to telemetry #3402

Merged

ashwinb pushed a commit that referenced this pull request Sep 10, 2025

revert: Fireworks chat completion broken due to telemetry (#3402)

a6b1588

Reverts #3392

slekkala1 mentioned this pull request Sep 11, 2025

fix: fireworks provider chat completion failing #3422

Closed

iamemilio pushed a commit to iamemilio/llama-stack that referenced this pull request Sep 24, 2025

revert: Fireworks chat completion broken due to telemetry (llamastack…

adc1b3a

…#3402) Reverts llamastack#3392

cdoern mentioned this pull request Oct 6, 2025

fix: Update watsonx.ai provider to use LiteLLM mixin and list all models #3674

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Fireworks chat completion broken due to telemetry #3392

fix: Fireworks chat completion broken due to telemetry #3392

Uh oh!

slekkala1 commented Sep 9, 2025 •

edited

Loading

Uh oh!

franciscojavierarceo left a comment

Uh oh!

Uh oh!

mattf commented Sep 10, 2025

Uh oh!

franciscojavierarceo commented Sep 10, 2025

Uh oh!

slekkala1 commented Sep 10, 2025 •

edited

Loading

Uh oh!

mattf commented Sep 11, 2025

Uh oh!

Uh oh!

fix: Fireworks chat completion broken due to telemetry #3392

fix: Fireworks chat completion broken due to telemetry #3392

Uh oh!

Conversation

slekkala1 commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test Plan

Uh oh!

franciscojavierarceo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattf commented Sep 10, 2025

Uh oh!

franciscojavierarceo commented Sep 10, 2025

Uh oh!

slekkala1 commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattf commented Sep 11, 2025

Uh oh!

Uh oh!

slekkala1 commented Sep 9, 2025 •

edited

Loading

slekkala1 commented Sep 10, 2025 •

edited

Loading