FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System

👉🏻 FireRedTTS-1S Paper 👈🏻

👉🏻 FireRedTTS-1S Demos 👈🏻

⚠️ Note: The current branch is `FireRedtts-1s`. To access `FireRedtts-1`, please switch to the `FireRedtts-1` branch

News

[2025/04/14] 🔥 We release the pre-trained checkpoints and inference code.
[2025/03/25] 🔥 We release the technical report

Roadmap

2025/04
- Release the pre-trained checkpoints and inference code.

Usage

Clone and install

Clone the repo

https://github.com/FireRedTeam/FireRedTTS.git
cd FireRedTTS

Create conda env

# step1.create env
conda create --name redtts python=3.10

# stpe2.install torch （pytorch should match the cuda-version on your machine）
# CUDA 11.8
conda install pytorch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 pytorch-cuda=11.8 -c pytorch -c nvidia
# CUDA 12.1
conda install pytorch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 pytorch-cuda=12.1 -c pytorch -c nvidia

# step3.install fireredtts form source
cd fireredtts
pip install -e . 

# step4.install other requirements
pip install -r requirements.txt

Download models

Download the required model files from Model_Lists and place them in the folder pretrained_models

Basic Usage

import os
import torchaudio
from fireredtts.fireredtts import FireRedTTS


tts = FireRedTTS(
        config_path="configs/config_24k.json",
        pretrained_path=<pretrained_models_dir>,
  )

#same language
# For the test-hard evaluation, we enabled the use_tn=True configuration setting.
rec_wavs = tts.synthesize(
  prompt_wav="examples/prompt_1.wav",
  prompt_text="对，所以说你现在的话，这个账单的话，你既然说能处理，那你就想办法处理掉。",
  text="小红书，是中国大陆的网络购物和社交平台，成立于二零一三年六月。",
  lang="zh",
  use_tn=True
)




rec_wavs = rec_wavs.detach().cpu()
out_wav_path = os.path.join("./example.wav")
torchaudio.save(out_wav_path, rec_wavs, 24000)

Tips

The reference audio should not be too long or too short; a duration of 3 to 10 seconds is recommended.
The reference audio should be smooth and natural, and the accompanying text must be accurate to enhance the stability and naturalness of the synthesized audio.

Acknowledgements

Tortoise-tts and XTTS-v2 offer invaluable insights for constructing an autoregressive-style system.
We referred to fish-speech text tokenizer solution.
We referred to BigCodec speech codec solution.
We referred to Encodec causal convolution solution.
We referred to SpeechBrain ECAPA-TDNN solution.
We referred to ChineseSpeechPretrain HuBERT model.

⚠️ Usage Disclaimer ❗️❗️❗️❗️❗️❗️

The project incorporates zero-shot voice cloning functionality; Please note that this capability is intended solely for academic research purposes.
DO NOT use this model for ANY illegal activities❗️❗️❗️❗️❗️❗️
The developers assume no liability for any misuse of this model.
If you identify any instances of abuse, misuse, or fraudulent activities related to this project, please report them to our team immediately.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
assets		assets
configs		configs
examples		examples
fireredtts		fireredtts
pretrained_models		pretrained_models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System

👉🏻 FireRedTTS-1S Paper 👈🏻

👉🏻 FireRedTTS-1S Demos 👈🏻

⚠️ Note: The current branch is `FireRedtts-1s`. To access `FireRedtts-1`, please switch to the `FireRedtts-1` branch

News

Roadmap

Usage

Clone and install

Download models

Basic Usage

Tips

Acknowledgements

⚠️ Usage Disclaimer ❗️❗️❗️❗️❗️❗️

About

Releases

Packages

Contributors 2

Languages

License

FireRedTeam/FireRedTTS

Folders and files

Latest commit

History

Repository files navigation

FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System

👉🏻 FireRedTTS-1S Paper 👈🏻

👉🏻 FireRedTTS-1S Demos 👈🏻

⚠️ Note: The current branch is FireRedtts-1s. To access FireRedtts-1, please switch to the FireRedtts-1 branch

News

Roadmap

Usage

Clone and install

Download models

Basic Usage

Tips

Acknowledgements

⚠️ Usage Disclaimer ❗️❗️❗️❗️❗️❗️

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

⚠️ Note: The current branch is `FireRedtts-1s`. To access `FireRedtts-1`, please switch to the `FireRedtts-1` branch

Packages