Accelerate is a dependency for voice cloning script #1468

gustrd · 2025-04-06T01:57:55Z

Describe the Issue
"NameError: name 'init_empty_weights' is not defined" happens when trying to run the voice cloning example, at the line: "self.model = HFModel(config.model_path, self.device, config.dtype, config.additional_model_config)"

Additional Information:
Installing the "accelerate" package via pip solves this issue.

LostRuins · 2025-04-06T03:22:59Z

Ah yup alright I'll update the wiki.

How's the voice cloning so far? Should be compatible with all existing speakers. Hoping that @edwko brings us new OuteTTS models in future.

edwko · 2025-04-06T07:20:50Z

@LostRuins New model on it's way! 🚚📦

gustrd · 2025-04-07T01:58:17Z

I tested with some voices, but with them I think that is kind of slow, and often generates audio just to a part of the message.

I'm not sure why, but seems like when calling the API it generates faster than in the chat ui. Very strange, I know.

LostRuins · 2025-04-07T11:06:40Z

@gustrd did you test with the sample speaker jsons included here or did you make your own? Are they working? The voice cloning process requires an accurate transcription from the source audio to work.

New model looks interesting but it's blocked due to dependency on DAC.speech.v1.0 arch being implemented. Kinda wish they has stuck with WavTokenizer.

Unfortunately this might be rough as the number of people who can implement a new arch in ggml is quite limited - we did not get xcodec for YuE, nor snac_24khz for Orpheus, and now those are kind of DoA. Hopefully someone will pick it up, thats the only major blocker and the rest is quite trivial to add.

gustrd · 2025-04-07T22:15:05Z

The ones provided works ok.

I had issues with the ones I cloned myself. Maybe I was using an audio too short or too long what is the recommended length?

LostRuins · 2025-04-08T11:43:04Z

About 10 seconds or so works best. Choose clear, articulate speech used for cloning, so that whisper can transcribe it properly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accelerate is a dependency for voice cloning script #1468

Accelerate is a dependency for voice cloning script #1468

gustrd commented Apr 6, 2025

LostRuins commented Apr 6, 2025

edwko commented Apr 6, 2025

gustrd commented Apr 7, 2025

LostRuins commented Apr 7, 2025 •

edited

Loading

gustrd commented Apr 7, 2025

LostRuins commented Apr 8, 2025

Accelerate is a dependency for voice cloning script #1468

Accelerate is a dependency for voice cloning script #1468

Comments

gustrd commented Apr 6, 2025

LostRuins commented Apr 6, 2025

edwko commented Apr 6, 2025

gustrd commented Apr 7, 2025

LostRuins commented Apr 7, 2025 • edited Loading

gustrd commented Apr 7, 2025

LostRuins commented Apr 8, 2025

LostRuins commented Apr 7, 2025 •

edited

Loading