[DO NOT MERGE] feat: cli for tool and reasoning parsers #2562

ayushag-nv · 2025-08-20T19:59:08Z

Overview:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

Summary by CodeRabbit

New Features
- Added optional CLI flag --tool-parser-name to select the model’s tool parser (e.g., hermes, nemotron_deci, llama3_json, mistral, phi4, default).
- Introduced service configuration support to set a default tool parser for the HTTP service.
- Propagated the selected parser through chat and completion endpoints, including streaming, enabling parser-aware tool-call extraction.
- Responses now carry parser metadata for improved tool-call handling.
- Backwards compatible: no change in behavior unless a parser is specified.

copy-pr-bot · 2025-08-20T19:59:11Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

coderabbitai · 2025-08-20T20:07:01Z

Walkthrough

Adds an optional tool_parser_name setting, exposed via a new CLI flag and threaded through Python bindings, engine/state/config, local model builder, HTTP service, and OpenAI aggregators. Updated APIs pass this value into response construction/aggregation for completions and chat completions. Tests updated for new method signatures.

Changes

Cohort / File(s)	Summary
CLI flag (frontend) `components/frontend/src/dynamo/frontend/main.py`	Adds --tool-parser-name flag; forwards value via EntrypointArgs kwargs in async_main().
Python bindings (EntrypointArgs) `lib/bindings/python/rust/llm/entrypoint.rs`	Adds Option tool_parser_name field; updates PyO3 constructor to accept tool_parser_name=None; forwards to engine build.
HTTP service state/config `lib/llm/src/http/service/service_v2.rs`	State, HttpServiceConfig, and builder gain Option tool_parser_name; new getter; wired through new/new_with_etcd/build.
OpenAI HTTP handlers `lib/llm/src/http/service/openai.rs`	Propagates tool_parser_name into completion/chat response constructors (from_annotated_stream) for unary and streaming paths.
Local model builder `lib/llm/src/local_model.rs`	Adds tool_parser_name to LocalModelBuilder/LocalModel; builder setter and model getter; passed through build(), including echo_full path.
OpenAI chat aggregators `lib/llm/src/protocols/openai/chat_completions/aggregator.rs`	Adds tool_parser_name param to DeltaAggregator::apply and ChatCompletionAggregator trait methods; uses it in try_tool_call_parse_aggregate; updates implementations and tests.
OpenAI completions aggregators `lib/llm/src/protocols/openai/completions/aggregator.rs`	Adds tool_parser_name param to DeltaAggregator::apply and NvCreateCompletionResponse::{from_annotated_stream, from_sse_stream}; forwards parameter.
Tests `lib/llm/tests/aggregators.rs`	Updates calls to from_sse_stream to pass None as new second arg for both chat and completions.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  actor User
  participant CLI as Frontend CLI
  participant Py as Python Bindings (EntrypointArgs)
  participant Eng as Engine/Builder
  participant Svc as Http Service State
  participant OA as OpenAI Handlers
  participant Agg as Aggregators

  User->>CLI: run ... --tool-parser-name=<name?>
  CLI->>Py: EntrypointArgs(tool_parser_name)
  Py->>Eng: engine.build(..., tool_parser_name)
  Eng->>Svc: State::new_with_etcd(..., tool_parser_name)
  Note right of Svc: State stores Option<String>

  User->>OA: /v1/chat/completions or /v1/completions
  OA->>Svc: state.tool_parser_name()
  OA->>Agg: from_annotated_stream(..., tool_parser_name)
  alt chat completions
    Agg->>Agg: parse tool calls with named parser
  else text completions
    Agg->>Agg: aggregate deltas (tool_parser_name passed)
  end
  Agg-->>OA: Response
  OA-->>User: JSON (with parsed tools if any)

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

feat: added parsers lib #2542 — Introduces/expands named tool parsers used by try_tool_call_parse_aggregate, which this PR selects via tool_parser_name.
chore: Refactor tool calling for wider support in the future #2393 — Refactors the tool-calling parser pipeline (e.g., try_tool_call_parse_aggregate) that this PR now parameterizes by name.
chore: remove flatten for chat response types, add reasoning_content #2543 — Adjusts chat completion aggregator traits/APIs that this PR further modifies by adding the tool_parser_name parameter.

Poem

A rabbit toggled a parser’s name,
From CLIs to streams it hopped the same.
Through states and builders it found its way,
To nudge tool calls from text’s ballet.
Now choices parse with nary a shame—
Hippity-hop, config in frame! 🐇✨

Tip

🔌 Remote MCP (Model Context Protocol) integration is now available!

Pro plan users can now connect to remote MCP servers from the Integrations page. Connect with popular remote MCPs such as Notion and Linear to add more context to your reviews and chats.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (11)

lib/bindings/python/rust/llm/entrypoint.rs (2)
110-111: Threading tool_parser_name through EntrypointArgs is correct; consider normalizing early.

Normalize at construction time to trim whitespace and treat empty strings as None. This avoids propagating sentinel empty strings and keeps semantics crisp across layers.

Apply near the return in EntrypointArgs::new:
@@
-        Ok(EntrypointArgs {
+        // Normalize tool_parser_name: trim and treat empty as None
+        let tool_parser_name = tool_parser_name
+            .map(|s| s.trim().to_string())
+            .filter(|s| !s.is_empty());
+        Ok(EntrypointArgs {
             engine_type,
@@
-            extra_engine_args,
-            tool_parser_name,
+            extra_engine_args,
+            tool_parser_name,
         })
133-134: Optional: replace free-form String with a typed enum to prevent invalid values.

Right now any string will pass through and only fail downstream. A small enum (e.g., enum ToolParserName { Default, Hermes, NemotronDeci, Llama3Json, Mistral, Phi4 } plus FromStr) would make invalid values unrepresentable and tighten the API boundary. You can still construct it from the CLI string in one place.

Also applies to: 165-166
components/frontend/src/dynamo/frontend/main.py (2)
179-184: Validate and normalize the CLI value; support env var default.

Add a small validator to enforce allowed values and normalize casing. This prevents accidental typos and keeps values consistent across the stack.

Apply:
@@
-    parser.add_argument(
-        "--tool-parser-name",
-        type=str,
-        default=None,
-        help="Tool parser name for the model. Available options: 'hermes', 'nemotron_deci', 'llama3_json', 'mistral', 'phi4', 'default'.",
-    )
+    # Allowed tool parsers. Keep in sync with backend enum/impl.
+    TOOL_PARSERS = ("default", "hermes", "nemotron_deci", "llama3_json", "mistral", "phi4")
+
+    def validate_tool_parser_name(value: str | None) -> str | None:
+        if value is None:
+            return None
+        v = value.strip().lower()
+        if not v:
+            return None
+        if v not in TOOL_PARSERS:
+            raise argparse.ArgumentTypeError(
+                f"--tool-parser-name must be one of {TOOL_PARSERS}, got: {value}"
+            )
+        return v
+
+    parser.add_argument(
+        "--tool-parser-name",
+        type=validate_tool_parser_name,
+        default=os.environ.get("DYN_TOOL_PARSER_NAME"),
+        help="Tool parser name for the model. Options: 'hermes', 'nemotron_deci', 'llama3_json', 'mistral', 'phi4', 'default'.",
+    )
242-244: Pass normalized value into EntrypointArgs.

If you keep the validator above, this remains fine. If not, normalize here to avoid propagating stray whitespace/case.
-    if flags.tool_parser_name:
-        kwargs["tool_parser_name"] = flags.tool_parser_name
+    if flags.tool_parser_name:
+        kwargs["tool_parser_name"] = flags.tool_parser_name.strip().lower()
lib/llm/src/local_model.rs (2)
177-180: Normalize tool_parser_name in the builder setter.

Trim and drop empty strings so we only carry meaningful values downstream.
-    pub fn tool_parser_name(&mut self, tool_parser_name: Option<String>) -> &mut Self {
-        self.tool_parser_name = tool_parser_name;
-        self
-    }
+    pub fn tool_parser_name(&mut self, tool_parser_name: Option<String>) -> &mut Self {
+        self.tool_parser_name = tool_parser_name
+            .map(|s| s.trim().to_string())
+            .filter(|s| !s.is_empty());
+        self
+    }
317-318: Consider a zero-copy getter or an additional ref-returning API.

tool_parser_name(&self) -> Option<String> clones on each call. If this is accessed in hot paths, add a ref-returning variant to avoid allocations.

Example addition (keep existing method for compatibility):
pub fn tool_parser_name_ref(&self) -> Option<&str> {
    self.tool_parser_name.as_deref()
}
Also applies to: 385-387
lib/llm/src/protocols/openai/completions/aggregator.rs (1)
68-71: Parameter is currently pass-through only; either use it or underscore it.

Right now tool_parser_name is only logged. Consider one of:

Implement tool-call parsing for completions (like chat), or

Rename to _tool_parser_name and add a TODO to avoid confusion.
-        tool_parser_name: Option<String>,
+        _tool_parser_name: Option<String>,
@@
-        tracing::trace!("Tool parser name: {:?}", tool_parser_name); // Remove this after enabling tool calling for completions
+        tracing::trace!("Tool parser name provided for completions (currently unused)"); // TODO: enable tool calling for completions
lib/llm/tests/aggregators.rs (1)

40-41: LGTM: Tests correctly adapted to the new from_sse_stream(..., None) signature.

No behavior change; just API alignment. Consider adding one chat-completions test that passes a non-None parser to exercise the new path when tool-calling aggregation lands.

Also applies to: 62-63, 82-83, 102-103, 115-116
lib/llm/src/http/service/service_v2.rs (2)
187-189: Builder wiring is fine; consider normalizing empty strings at the boundary.

The builder correctly stores and passes tool_parser_name into State::new_with_etcd.

Optional: normalize here too (treat Some("") or whitespace-only as None) to catch misconfigurations early.

Example tweak in build():
-        let state = Arc::new(State::new_with_etcd(
+        let normalized = config.tool_parser_name.and_then(|s| {
+            let t = s.trim().to_string();
+            if t.is_empty() { None } else { Some(t) }
+        });
+        let state = Arc::new(State::new_with_etcd(
             model_manager,
             config.etcd_client,
-            config.tool_parser_name,
+            normalized,
         ));
Also applies to: 311-315

378-381: API ergonomics: add a convenience setter for &str.

Current setter requires Option<String>. Consider an overload to accept impl Into<String> and one that takes Option<&str> to reduce friction for callers.
 impl HttpServiceConfigBuilder {
+    pub fn with_tool_parser_name_str<T: Into<String>>(mut self, tool_parser_name: T) -> Self {
+        self.tool_parser_name = Some(Some(tool_parser_name.into()));
+        self
+    }
 }
lib/llm/src/protocols/openai/chat_completions/aggregator.rs (1)
98-100: Tool-call parsing hook: correct use of Option via as_deref; add minimal failure logging.

Passing tool_parser_name.as_deref() is correct and avoids unnecessary allocations.

Recommend logging parse failures at trace/debug to aid diagnostics when a parser is configured but yields an error.
-            if choice.tool_calls.is_none() {
-                if let Ok(tool_calls) = try_tool_call_parse_aggregate(&choice.text, tool_parser_name.as_deref()) {
+            if choice.tool_calls.is_none() {
+                match try_tool_call_parse_aggregate(&choice.text, tool_parser_name.as_deref()) {
+                    Ok(tool_calls) => {
                         if tool_calls.is_empty() {
-                        continue;
+                            continue;
                         }
                         for tool_call in &tool_calls {
                             tracing::debug!(
                                 tool_call_id = %tool_call.id,
                                 function_name = %tool_call.function.name,
                                 arguments = %tool_call.function.arguments,
                                 "Parsed structured tool call from aggregated content"
                             );
                         }
                         choice.tool_calls = Some(tool_calls);
                         choice.text.clear();
                         choice.finish_reason =
                             Some(dynamo_async_openai::types::FinishReason::ToolCalls);
-                }
+                    }
+                    Err(e) => {
+                        tracing::trace!(error = %e, "Tool-call parsing failed; leaving content as text");
+                    }
+                }
             }
Also applies to: 165-185

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 626d7e1 and 1923cae.

📒 Files selected for processing (8)

components/frontend/src/dynamo/frontend/main.py (2 hunks)
lib/bindings/python/rust/llm/entrypoint.rs (4 hunks)
lib/llm/src/http/service/openai.rs (5 hunks)
lib/llm/src/http/service/service_v2.rs (8 hunks)
lib/llm/src/local_model.rs (7 hunks)
lib/llm/src/protocols/openai/chat_completions/aggregator.rs (9 hunks)
lib/llm/src/protocols/openai/completions/aggregator.rs (6 hunks)
lib/llm/tests/aggregators.rs (5 hunks)

🧰 Additional context used

🧬 Code Graph Analysis (8)

lib/llm/src/protocols/openai/completions/aggregator.rs (3)

lib/llm/src/local_model.rs (2)

tool_parser_name (177-180)

tool_parser_name (385-387)

lib/llm/src/http/service/service_v2.rs (1)

tool_parser_name (130-132)

lib/llm/src/protocols/openai/chat_completions/aggregator.rs (1)

apply (97-210)

lib/llm/src/local_model.rs (1)

lib/llm/src/http/service/service_v2.rs (1)

tool_parser_name (130-132)

lib/llm/tests/aggregators.rs (2)

lib/llm/src/protocols/openai/chat_completions/aggregator.rs (2)

from_sse_stream (266-269)

from_sse_stream (280-286)

lib/llm/src/protocols/openai/completions/aggregator.rs (1)

from_sse_stream (180-186)

components/frontend/src/dynamo/frontend/main.py (2)

lib/llm/src/local_model.rs (3)

default (66-87)

tool_parser_name (177-180)

tool_parser_name (385-387)

lib/llm/src/http/service/service_v2.rs (1)

tool_parser_name (130-132)

lib/llm/src/protocols/openai/chat_completions/aggregator.rs (4)

lib/llm/src/local_model.rs (2)

tool_parser_name (177-180)

tool_parser_name (385-387)

lib/llm/src/http/service/service_v2.rs (1)

tool_parser_name (130-132)

lib/parsers/src/tool_calling/tools.rs (1)

try_tool_call_parse_aggregate (13-34)

lib/llm/src/protocols/openai/completions/aggregator.rs (3)

from_annotated_stream (188-193)

apply (66-163)

from_sse_stream (180-186)

lib/bindings/python/rust/llm/entrypoint.rs (2)

lib/llm/src/http/service/service_v2.rs (2)

new (75-88)

tool_parser_name (130-132)

lib/llm/src/local_model.rs (3)

extra_engine_args (162-165)

tool_parser_name (177-180)

tool_parser_name (385-387)

lib/llm/src/http/service/openai.rs (4)

lib/llm/src/local_model.rs (2)

tool_parser_name (177-180)

tool_parser_name (385-387)

lib/llm/src/http/service/service_v2.rs (2)

tool_parser_name (130-132)

state (200-202)

lib/llm/src/protocols/openai/chat_completions/aggregator.rs (2)

from_annotated_stream (253-256)

from_annotated_stream (273-278)

lib/llm/src/protocols/openai/completions/aggregator.rs (1)

from_annotated_stream (188-193)

lib/llm/src/http/service/service_v2.rs (2)

lib/bindings/python/rust/llm/entrypoint.rs (3)

new (40-55)

new (70-80)

new (118-167)

lib/llm/src/local_model.rs (2)

tool_parser_name (177-180)

tool_parser_name (385-387)

🔇 Additional comments (12)

lib/bindings/python/rust/llm/entrypoint.rs (2)

117-118: PyO3 signature remains backward-compatible.

Adding tool_parser_name=None at the tail preserves positional/kwargs compatibility for existing callers. Good placement and defaults.

198-200: Preserve None semantics for tool_parser_name in State constructors

The current constructors in lib/llm/src/http/service/service_v2.rs always wrap tool_parser_name into Some("") when None is passed, losing the distinction between “unset” and an empty string:
State::new (around line 86):
- tool_parser_name: Some(tool_parser_name.unwrap_or_else(|| String::from(""))),
+ tool_parser_name,
State::new_with_etcd (around line 105):
- tool_parser_name: Some(tool_parser_name.unwrap_or_else(|| String::from(""))),
+ tool_parser_name,
A quick search confirmed these are the only spots defaulting to Some("") for None inputs. Preserving the original Option<String> ensures downstream logic can accurately detect when the parser name was truly unset.

Likely an incorrect or invalid review comment.
lib/llm/src/local_model.rs (2)

62-63: LGTM: Added tool_parser_name to builder and Default.

State is extended cleanly and defaults to None; no behavior change unless explicitly provided.

Also applies to: 85-86

223-224: Propagation into LocalModel (both echo and main paths) looks correct.

Both construction paths consistently pass the builder’s tool_parser_name into LocalModel.

Also applies to: 300-301

lib/llm/src/protocols/openai/completions/aggregator.rs (3)

179-186: API propagation LGTM.

The from_sse_stream and from_annotated_stream signatures and call chain correctly forward tool_parser_name.

Also applies to: 188-193

248-248: Tests updated for the new parameter are correct.

Passing None matches the intended default behavior and keeps existing test semantics unchanged.

Also applies to: 272-272, 312-312, 372-372

179-186: All usages updated to new arity

I've verified that every call to both
NvCreateCompletionResponse::from_sse_stream(...) and
NvCreateChatCompletionResponse::from_sse_stream(...) now supplies two arguments (the stream and an Option<String>), and that every invocation of DeltaAggregator::apply in the completions and chat modules likewise passes two parameters (stream and tool_parser_name or None). No remaining call sites are using the old signature—everything compiles as expected.

lib/llm/src/http/service/openai.rs (3)

298-299: Plumbing tool_parser_name into completions folding looks correct.

Correctly retrieves tool_parser_name from state and threads it into NvCreateCompletionResponse::from_annotated_stream(...).

This aligns with the updated aggregator API taking Option<String>.

Also applies to: 319-329

737-751: Responses endpoint: tool_parser_name propagation is consistent.

Correctly mirrors chat completions’ non-streaming fold by passing tool_parser_name into NvCreateChatCompletionResponse::from_annotated_stream(...).

483-484: Chat and non-chat completions: tool_parser_name correctly threaded

Verified that tool_parser_name() is fetched before engine creation and passed into
NvCreateCompletionResponse::from_annotated_stream(stream, tool_parser_name) at line 319.

Confirmed the same parameter is forwarded to
NvCreateChatCompletionResponse::from_annotated_stream(stream, tool_parser_name) at lines 549 and 739.

Streaming path remains an SSE passthrough—parsing happens during fold, consistent with the new aggregator signature.

No further changes needed.

lib/llm/src/protocols/openai/chat_completions/aggregator.rs (2)

253-257: Public API updates to thread tool_parser_name are consistent.

Trait and impl signatures cleanly propagate the optional parser name across annotated and SSE flows.

Also applies to: 266-270, 273-287

221-237: I’ve added a script to dump the surrounding context in aggregator.rs; once you provide the results, I can confirm whether delta.role is indeed an Option and whether an unwrap_or fallback is appropriate.

coderabbitai · 2025-08-20T20:07:04Z

lib/llm/src/http/service/service_v2.rs

+    tool_parser_name: Option<String>,
 }


💡 Verification agent

🧩 Analysis chain

Do not coerce None into Some(""); preserve None semantics for tool_parser_name.

Currently State::new/new_with_etcd convert None into Some(String::from("")). This silently changes semantics: downstream code now receives Some("") instead of None. The parser API differentiates between “no parser configured” (None) vs “specific parser name”. Passing Some("") may disable default behavior or select a non-existent parser.

Refactor to keep None as None and also normalize empty/whitespace-only strings to None.

Apply this diff:

pub struct State { @@ - tool_parser_name: Option<String>, + tool_parser_name: Option<String>, } @@ - pub fn new(manager: Arc<ModelManager>, tool_parser_name: Option<String>) -> Self { + pub fn new(manager: Arc<ModelManager>, tool_parser_name: Option<String>) -> Self { Self { manager, metrics: Arc::new(Metrics::default()), etcd_client: None, flags: StateFlags { chat_endpoints_enabled: AtomicBool::new(false), cmpl_endpoints_enabled: AtomicBool::new(false), embeddings_endpoints_enabled: AtomicBool::new(false), responses_endpoints_enabled: AtomicBool::new(false), }, - tool_parser_name: Some(tool_parser_name.unwrap_or_else(|| String::from(""))), + tool_parser_name: tool_parser_name.and_then(|s| { + let trimmed = s.trim().to_string(); + if trimmed.is_empty() { None } else { Some(trimmed) } + }), } } @@ - pub fn new_with_etcd( - manager: Arc<ModelManager>, - etcd_client: Option<etcd::Client>, - tool_parser_name: Option<String>, - ) -> Self { + pub fn new_with_etcd( + manager: Arc<ModelManager>, + etcd_client: Option<etcd::Client>, + tool_parser_name: Option<String>, + ) -> Self { Self { manager, metrics: Arc::new(Metrics::default()), etcd_client, flags: StateFlags { chat_endpoints_enabled: AtomicBool::new(false), cmpl_endpoints_enabled: AtomicBool::new(false), embeddings_endpoints_enabled: AtomicBool::new(false), responses_endpoints_enabled: AtomicBool::new(false), }, - tool_parser_name: Some(tool_parser_name.unwrap_or_else(|| String::from(""))), + tool_parser_name: tool_parser_name.and_then(|s| { + let trimmed = s.trim().to_string(); + if trimmed.is_empty() { None } else { Some(trimmed) } + }), } } @@ - pub fn tool_parser_name(&self) -> Option<String> { - self.tool_parser_name.clone() - } + pub fn tool_parser_name(&self) -> Option<String> { + self.tool_parser_name.clone() + }

Also applies to: 75-88, 90-107, 129-132

🏁 Script executed:

#!/bin/bash # 1) Ensure State constructors no longer coerce None -> Some("") rg -n 'unwrap_or_else\(\|\|\s*String::from\(\"\"\)\s*\)\)' lib/llm/src/http/service/service_v2.rs # 2) Check all call sites for State::new/new_with_etcd are passing Option<String> directly rg -nP 'State::new(_with_etcd)?\s*\(' -C2 # 3) Sanity: find all usages of tool_parser_name() to confirm Option<> is handled (not unwrapped blindly) rg -nP '\btool_parser_name\(\)' -C2

Length of output: 9796

🏁 Script executed:

#!/bin/bash # 1) Verify no unwrap_or_else remains in service_v2.rs echo "Searching for unwrap_or_else occurrences..." rg -n 'unwrap_or_else' lib/llm/src/http/service/service_v2.rs || echo "No unwrap_or_else found" echo "Searching for String::from(\"\") occurrences..." rg -n 'String::from\(\s*\"\"\s*\)' lib/llm/src/http/service/service_v2.rs || echo "No String::from(\"\") found" # 2) Inspect constructor assignments for tool_parser_name echo "Inspecting tool_parser_name assignments in State constructors..." rg -n 'tool_parser_name:' -C2 lib/llm/src/http/service/service_v2.rs # 3) Inspect State::new and new_with_etcd definitions echo "Inspecting State::new definition..." rg -nP 'pub fn new\(' -C10 lib/llm/src/http/service/service_v2.rs echo "Inspecting State::new_with_etcd definition..." rg -nP 'pub fn new_with_etcd' -C10 lib/llm/src/http/service/service_v2.rs # 4) Inspect tool_parser_name usage in openai.rs echo "Inspecting tool_parser_name() usage in openai.rs..." rg -nP 'tool_parser_name\(\)' -C5 lib/llm/src/http/service/openai.rs

Length of output: 5675

Preserve None semantics for tool_parser_name in State constructors

State::new and new_with_etcd still coerce None into Some(""), altering downstream behavior. On lines 86 and 105 of lib/llm/src/http/service/service_v2.rs, the code uses:

tool_parser_name: Some(tool_parser_name.unwrap_or_else(|| String::from(""))),

This should be refactored to preserve None and normalize empty or whitespace-only strings to None.

Locations to fix:

lib/llm/src/http/service/service_v2.rs: State::new (line 86)

lib/llm/src/http/service/service_v2.rs: State::new_with_etcd (line 105)

Apply this diff at both constructor sites:

--- a/lib/llm/src/http/service/service_v2.rs +++ b/lib/llm/src/http/service/service_v2.rs @@ -86,7 +86,12 @@ impl State { - tool_parser_name: Some(tool_parser_name.unwrap_or_else(|| String::from(""))), + tool_parser_name: tool_parser_name.and_then(|s| { + let trimmed = s.trim().to_string(); + if trimmed.is_empty() { + None + } else { + Some(trimmed) + } + }), @@ -105,7 +110,12 @@ impl State { - tool_parser_name: Some(tool_parser_name.unwrap_or_else(|| String::from(""))), + tool_parser_name: tool_parser_name.and_then(|s| { + let trimmed = s.trim().to_string(); + if trimmed.is_empty() { + None + } else { + Some(trimmed) + } + }),

This change will ensure that when no parser is configured (None), it remains None, and that empty or all-whitespace names are normalized to None.

🤖 Prompt for AI Agents

In lib/llm/src/http/service/service_v2.rs around lines 86 and 105, the constructors currently coerce None into Some("") via tool_parser_name: Some(tool_parser_name.unwrap_or_else(|| String::from(""))); change both sites so that None is preserved and any provided name that is empty or only whitespace is normalized to None (i.e., map the Option<String> through trim(), filter out empty results, and only wrap in Some if non-empty), updating the field initialization accordingly.

grahamking · 2025-08-20T20:17:22Z

lib/llm/src/local_model.rs

@@ -59,6 +59,7 @@ pub struct LocalModelBuilder {
    extra_engine_args: Option<PathBuf>,
    runtime_config: ModelRuntimeConfig,
    user_data: Option<serde_json::Value>,
+    tool_parser_name: Option<String>,


Could you make this an enum? Then you give it an impl From<String> for ToolParserName {.
Then you call the builder by doing .tool_parser_name(the_str_name.into()) and it will magic itself into an enum.

Or you can do try_into() (impl TryFrom<String> .. instead of impl From) and that can return an error.

Examples in lib/llm/src/entrypoint/input.rs on the Input type.

For the default type here this makes MyThing::One the Default::default().

enum MyThing { #[default] One, Two, }

Then #[derive(Copy, Clone)] on it, because now it's implemented as a single byte, and so you don't need to .clone() it, it copies automatically.

rmccorm4 · 2025-08-20T20:37:28Z

components/frontend/src/dynamo/frontend/main.py

Don't want to bog down development here - but for all the many LoRA, Tool calling, reasoning, etc. PRs coming in -- can we start a small docs/guides/frontend.md or something and start to call out some of the big things we're adding and how to opt in? Not looking for something terribly comprehensive, but want to get something in so each new PR has an easier time just appending a small chunk to the doc each time.

# Dynamo OpenAI Frontend ... ## Tool Calling ## Tool Parsers For tool parsers, see the `--tool-parser-name` arg in output of `python -m dynamo.frontend --help`. ## Reasoning Parsers For reasoning models, dynamo supports ... ## LoRAs LoRAs are supported under unique model names in the `/v1/models` route, for example ...

Doesn't need to be specific to this PR either - but I think it would be good to start writing this stuff down somewhere earlier than later. CC @grahamking

For example @zhongdaor-nv is starting to look into tool calling and reasoning for GPT OSS specifically, and I don't have a good resource/overview to point him to the list of things we just added in past 2 weeks that may be applicable to him, other than linking a dozen PRs or so

sounds good. I will add it soon once I finalize the tool call param design

rmccorm4 · 2025-08-21T20:47:57Z

components/frontend/src/dynamo/frontend/main.py

@@ -176,6 +176,12 @@ def parse_args():
        default=None,
        help="Prefix for Dynamo frontend metrics. If unset, uses DYN_METRICS_PREFIX env var or 'dynamo_frontend'.",
    )
+    parser.add_argument(
+        "--tool-call-parser",


Since the frontend can serve many models - should this be a frontend argument? Or backend/worker argument passed through ModelDeploymentCard/RuntimeConfig etc. that the frontend can load on demand when worker is discovered via register_llm?

ex: python -m dynamo.vllm instead of python -m dynamo.frontend

python -m dynamo.vllm --tool-call-parser ...

Oops, sorry I see you said the same thing on slack - ignore me.

ayushag-nv · 2025-08-22T17:02:50Z

closing this in favor of #2619

ayushag-nv self-assigned this Aug 20, 2025

ayushag-nv requested review from nnshah1, tanmayv25, piotrm-nvidia, ptarasiewiczNV, ryanolson, grahamking, paulhendricks, biswapanda, tmonty12, GuanLuo, rmccorm4, tedzhouhk, alec-flowers, kkranen, jthomson04, ishandhanani, PeaBrane, hhzhang16 and a team as code owners August 20, 2025 19:59

pull-request-size bot added the size/L label Aug 20, 2025

ayushag-nv marked this pull request as draft August 20, 2025 19:59

github-actions bot added the feat label Aug 20, 2025

coderabbitai bot reviewed Aug 20, 2025

View reviewed changes

grahamking reviewed Aug 20, 2025

View reviewed changes

rmccorm4 reviewed Aug 20, 2025

View reviewed changes

chore: basic cli for tool parser

91a742a

fix: fix fmt

ed22ee5

ayushag-nv force-pushed the ayushag/cli-for-parsers branch from 8eca226 to ed22ee5 Compare August 21, 2025 15:08

ayushag-nv added 2 commits August 21, 2025 17:36

fix: tool call parser flag reaching down the lane now

6da6178

chore: updated arg name

f74ba35

ayushag-nv changed the title ~~feat: cli for tool and reasoning parsers~~ [DO NOT MERGE] feat: cli for tool and reasoning parsers Aug 21, 2025

rmccorm4 reviewed Aug 21, 2025

View reviewed changes

ayushag-nv closed this Aug 22, 2025

[DO NOT MERGE] feat: cli for tool and reasoning parsers #2562

[DO NOT MERGE] feat: cli for tool and reasoning parsers #2562

Uh oh!

Conversation

ayushag-nv commented Aug 20, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Summary by CodeRabbit

Uh oh!

copy-pr-bot bot commented Aug 20, 2025

Uh oh!

coderabbitai bot commented Aug 20, 2025

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

grahamking Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

grahamking Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

rmccorm4 Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

rmccorm4 Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

rmccorm4 Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

ayushag-nv Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

rmccorm4 Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rmccorm4 Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

ayushag-nv commented Aug 22, 2025

Uh oh!

Uh oh!

ayushag-nv commented Aug 20, 2025 •

edited by coderabbitai bot

Loading

rmccorm4 Aug 21, 2025 •

edited

Loading