Skip to content

🌿 Fern Regeneration -- July 2, 2025 #321

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
31 changes: 17 additions & 14 deletions .mock/definition/tts/__package__.yml
Original file line number Diff line number Diff line change
Expand Up @@ -390,22 +390,19 @@ types:
see our documentation on [instant
mode](/docs/text-to-speech-tts/overview#ultra-low-latency-streaming-instant-mode).

- Dynamic voice generation is not supported with this mode; a
predefined
- A
[voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice)
must be specified in your request.
must be specified when instant mode is enabled. Dynamic voice
generation is not supported with this mode.

- This mode is only supported for streaming endpoints (e.g.,
- Instant mode is only supported for streaming endpoints (e.g.,
[/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming),
[/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).

- Ensure only a single generation is requested
([num_generations](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.num_generations)
must be `1` or omitted).

- With `instant_mode` enabled, **requests incur a 10% higher cost**
due to increased compute and resource requirements.
default: false
default: true
source:
openapi: tts-openapi.yml
ReturnTts:
Expand Down Expand Up @@ -514,14 +511,20 @@ types:
docs: >-
Natural language instructions describing how the synthesized speech
should sound, including but not limited to tone, intonation, pacing,
and accent (e.g., 'a soft, gentle voice with a strong British
accent').
and accent.

- If a Voice is specified in the request, this description serves as
acting instructions. For tips on how to effectively guide speech
delivery, see our guide on [Acting

**This field behaves differently depending on whether a voice is
specified**:

- **Voice specified**: the description will serve as acting directions
for delivery. Keep directions concise—100 characters or fewer—for best
results. See our guide on [acting
instructions](/docs/text-to-speech-tts/acting-instructions).
- If no Voice is specified, a new voice is generated based on this description. See our [prompting guide](/docs/text-to-speech-tts/prompting) for tips on designing a voice.

- **Voice not specified**: the description will serve as a voice
prompt for generating a voice. See our [prompting
guide](/docs/text-to-speech-tts/prompting) for design tips.
validation:
maxLength: 1000
speed:
Expand Down
277 changes: 152 additions & 125 deletions poetry.lock

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ name = "hume"

[tool.poetry]
name = "hume"
version = "0.9.1"
version = "0.9.2"
description = "A Python SDK for Hume AI"
readme = "README.md"
authors = []
Expand Down
20 changes: 8 additions & 12 deletions reference.md
Original file line number Diff line number Diff line change
Expand Up @@ -145,10 +145,9 @@ This setting affects how the `snippets` array is structured in the response, whi
**instant_mode:** `typing.Optional[bool]`

Enables ultra-low latency streaming, significantly reducing the time until the first audio chunk is received. Recommended for real-time applications requiring immediate audio playback. For further details, see our documentation on [instant mode](/docs/text-to-speech-tts/overview#ultra-low-latency-streaming-instant-mode).
- Dynamic voice generation is not supported with this mode; a predefined [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified in your request.
- This mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
- A [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified when instant mode is enabled. Dynamic voice generation is not supported with this mode.
- Instant mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
- Ensure only a single generation is requested ([num_generations](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.num_generations) must be `1` or omitted).
- With `instant_mode` enabled, **requests incur a 10% higher cost** due to increased compute and resource requirements.

</dd>
</dl>
Expand Down Expand Up @@ -294,10 +293,9 @@ This setting affects how the `snippets` array is structured in the response, whi
**instant_mode:** `typing.Optional[bool]`

Enables ultra-low latency streaming, significantly reducing the time until the first audio chunk is received. Recommended for real-time applications requiring immediate audio playback. For further details, see our documentation on [instant mode](/docs/text-to-speech-tts/overview#ultra-low-latency-streaming-instant-mode).
- Dynamic voice generation is not supported with this mode; a predefined [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified in your request.
- This mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
- A [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified when instant mode is enabled. Dynamic voice generation is not supported with this mode.
- Instant mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
- Ensure only a single generation is requested ([num_generations](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.num_generations) must be `1` or omitted).
- With `instant_mode` enabled, **requests incur a 10% higher cost** due to increased compute and resource requirements.

</dd>
</dl>
Expand Down Expand Up @@ -441,10 +439,9 @@ This setting affects how the `snippets` array is structured in the response, whi
**instant_mode:** `typing.Optional[bool]`

Enables ultra-low latency streaming, significantly reducing the time until the first audio chunk is received. Recommended for real-time applications requiring immediate audio playback. For further details, see our documentation on [instant mode](/docs/text-to-speech-tts/overview#ultra-low-latency-streaming-instant-mode).
- Dynamic voice generation is not supported with this mode; a predefined [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified in your request.
- This mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
- A [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified when instant mode is enabled. Dynamic voice generation is not supported with this mode.
- Instant mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
- Ensure only a single generation is requested ([num_generations](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.num_generations) must be `1` or omitted).
- With `instant_mode` enabled, **requests incur a 10% higher cost** due to increased compute and resource requirements.

</dd>
</dl>
Expand Down Expand Up @@ -596,10 +593,9 @@ This setting affects how the `snippets` array is structured in the response, whi
**instant_mode:** `typing.Optional[bool]`

Enables ultra-low latency streaming, significantly reducing the time until the first audio chunk is received. Recommended for real-time applications requiring immediate audio playback. For further details, see our documentation on [instant mode](/docs/text-to-speech-tts/overview#ultra-low-latency-streaming-instant-mode).
- Dynamic voice generation is not supported with this mode; a predefined [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified in your request.
- This mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
- A [voice](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.utterances.voice) must be specified when instant mode is enabled. Dynamic voice generation is not supported with this mode.
- Instant mode is only supported for streaming endpoints (e.g., [/v0/tts/stream/json](/reference/text-to-speech-tts/synthesize-json-streaming), [/v0/tts/stream/file](/reference/text-to-speech-tts/synthesize-file-streaming)).
- Ensure only a single generation is requested ([num_generations](/reference/text-to-speech-tts/synthesize-json-streaming#request.body.num_generations) must be `1` or omitted).
- With `instant_mode` enabled, **requests incur a 10% higher cost** due to increased compute and resource requirements.

</dd>
</dl>
Expand Down
Loading